preprintarXiv (Cornell University)Jun 17, 2020GREEN OA

Big Self-Supervised Models are Strong Semi-Supervised Learners

Google (United States)

Indexed inarxivdatacite

Abstract

One paradigm for learning from few labeled examples while making best use of a large amount of unlabeled data is unsupervised pretraining followed by supervised fine-tuning. Although this paradigm uses unlabeled data in a task-agnostic way, in contrast to common approaches to semi-supervised learning for computer vision, we show that it is surprisingly effective for semi-supervised learning on ImageNet. A key ingredient of our approach is the use of big (deep and wide) networks during pretraining and fine-tuning. We find that, the fewer the labels, the more this approach (task-agnostic use of unlabeled data) benefits from a bigger network. After fine-tuning, the big network can be further improved and…

Citation impact

476
total citations
FWCI
Percentile
References
66
Citations per year

Authors

5

Topics & keywords

Keywords
  • Artificial intelligence
  • Computer science
  • Task (project management)
  • Machine learning
  • Labeled data
  • Supervised learning
  • Pattern recognition (psychology)
  • Semi-supervised learning
UN Sustainable Development Goals
  • Quality Education
No related works found for this paper.