Big Self-Supervised Models are Strong Semi-Supervised Learners

Chen, Ting; Kornblith, Simon; Swersky, Kevin; Norouzi, Mohammad; Hinton, Geoffrey E.

doi:10.48550/arxiv.2006.10029

preprintarXiv (Cornell University)Jun 17, 2020GREEN OA

Big Self-Supervised Models are Strong Semi-Supervised Learners

TCTing Chen SKSimon Kornblith KSKevin Swersky MNMohammad Norouzi GEGeoffrey E. Hinton

Google (United States)

Indexed inarxivdatacite

Abstract

One paradigm for learning from few labeled examples while making best use of a large amount of unlabeled data is unsupervised pretraining followed by supervised fine-tuning. Although this paradigm uses unlabeled data in a task-agnostic way, in contrast to common approaches to semi-supervised learning for computer vision, we show that it is surprisingly effective for semi-supervised learning on ImageNet. A key ingredient of our approach is the use of big (deep and wide) networks during pretraining and fine-tuning. We find that, the fewer the labels, the more this approach (task-agnostic use of unlabeled data) benefits from a bigger network. After fine-tuning, the big network can be further improved and…

Citation impact

476

total citations

FWCI: —
Percentile: —
References: 66

Citations per year

Authors

5

Topics & keywords

Topics

Keywords

Artificial intelligence
Computer science
Task (project management)
Machine learning
Labeled data
Supervised learning
Pattern recognition (psychology)
Semi-supervised learning

UN Sustainable Development Goals

Quality Education

No related works found for this paper.