Big Self-Supervised Models are Strong Semi-Supervised Learners
Indexed inarxivdatacite
Abstract
One paradigm for learning from few labeled examples while making best use of a large amount of unlabeled data is unsupervised pretraining followed by supervised fine-tuning. Although this paradigm uses unlabeled data in a task-agnostic way, in contrast to common approaches to semi-supervised learning for computer vision, we show that it is surprisingly effective for semi-supervised learning on ImageNet. A key ingredient of our approach is the use of big (deep and wide) networks during pretraining and fine-tuning. We find that, the fewer the labels, the more this approach (task-agnostic use of unlabeled data) benefits from a bigger network. After fine-tuning, the big network can be further improved and…
Citation impact
476
total citations
- FWCI
- —
- Percentile
- —
- References
- 66
Citations per year
Authors
5Topics & keywords
Topics
Keywords
- Artificial intelligence
- Computer science
- Task (project management)
- Machine learning
- Labeled data
- Supervised learning
- Pattern recognition (psychology)
- Semi-supervised learning
UN Sustainable Development Goals
- Quality Education
No related works found for this paper.