Self-Training With Noisy Student Improves ImageNet Classification
Google (United States) · Brain (Germany) · +1 more institution
Abstract
We present a simple self-training method that achieves 88.4% top-1 accuracy on ImageNet, which is 2.0% better than the state-of-the-art model that requires 3.5B weakly labeled Instagram images. On robustness test sets, it improves ImageNet-A top-1 accuracy from 61.0% to 83.7%, reduces ImageNet-C mean corruption error from 45.7 to 28.3, and reduces ImageNet-P mean flip rate from 27.8 to 12.2. To achieve this result, we first train an EfficientNet model on labeled ImageNet images and use it as a teacher to generate pseudo labels on 300M unlabeled images. We then train a larger EfficientNet as a student model on the combination of labeled and pseudo labeled images. We iterate this process by putting back the…
Citation impact
- FWCI
- 158.69
- Percentile
- 100%
- References
- 163
Authors
4Topics & keywords
- Robustness (evolution)
- Computer science
- Artificial intelligence
- Dropout (neural networks)
- Training set
- Machine learning
- Noise (video)
- Process (computing)
- Peace, Justice and strong institutions