articleJun 1, 2020Closed access

Self-Training With Noisy Student Improves ImageNet Classification

Google (United States) · Brain (Germany) · +1 more institution

Indexed incrossref

Abstract

We present a simple self-training method that achieves 88.4% top-1 accuracy on ImageNet, which is 2.0% better than the state-of-the-art model that requires 3.5B weakly labeled Instagram images. On robustness test sets, it improves ImageNet-A top-1 accuracy from 61.0% to 83.7%, reduces ImageNet-C mean corruption error from 45.7 to 28.3, and reduces ImageNet-P mean flip rate from 27.8 to 12.2. To achieve this result, we first train an EfficientNet model on labeled ImageNet images and use it as a teacher to generate pseudo labels on 300M unlabeled images. We then train a larger EfficientNet as a student model on the combination of labeled and pseudo labeled images. We iterate this process by putting back the…

Citation impact

2,268
total citations
FWCI
158.69
Percentile
100%
References
163
Citations per year

Authors

4

Topics & keywords

Keywords
  • Robustness (evolution)
  • Computer science
  • Artificial intelligence
  • Dropout (neural networks)
  • Training set
  • Machine learning
  • Noise (video)
  • Process (computing)
UN Sustainable Development Goals
  • Peace, Justice and strong institutions
No related works found for this paper.