Learning Representations by Maximizing Mutual Information Across Views

Bachman, Philip; Hjelm, R Devon; Buchwalter, William

doi:10.48550/arxiv.1906.00910

preprintarXiv (Cornell University)Jun 3, 2019GREEN OA

Learning Representations by Maximizing Mutual Information Across Views

PBPhilip Bachman RDR Devon Hjelm WBWilliam Buchwalter

Microsoft Research (United Kingdom)

Indexed inarxivdatacite

Abstract

We propose an approach to self-supervised representation learning based on maximizing mutual information between features extracted from multiple views of a shared context. For example, one could produce multiple views of a local spatio-temporal context by observing it from different locations (e.g., camera positions within a scene), and via different modalities (e.g., tactile, auditory, or visual). Or, an ImageNet image could provide a context from which one produces multiple views by repeatedly applying data augmentation. Maximizing mutual information between features extracted from these views requires capturing information about high-level factors whose influence spans multiple views -- e.g., presence of…

Citation impact

678

total citations

FWCI: —
Percentile: —
References: 45

Citations per year

Authors

3

Topics & keywords

Topics

Keywords

Mutual information
Computer science
Psychology
Data science
Artificial intelligence

No related works found for this paper.