articlearXiv (Cornell University)Mar 6, 2017GREEN OA

Mean teachers are better role models: Weight-averaged consistency targets improve semi-supervised deep learning results

Dalle Molle Institute for Artificial Intelligence Research

Indexed inarxivdatacite

Abstract

The recently proposed Temporal Ensembling has achieved state-of-the-art results in several semi-supervised learning benchmarks. It maintains an exponential moving average of label predictions on each training example, and penalizes predictions that are inconsistent with this target. However, because the targets change only once per epoch, Temporal Ensembling becomes unwieldy when learning large datasets. To overcome this problem, we propose Mean Teacher, a method that averages model weights instead of label predictions. As an additional benefit, Mean Teacher improves test accuracy and enables training with fewer labels than Temporal Ensembling. Without changing the network architecture, Mean Teacher achieves…

Citation impact

1,592
total citations
FWCI
Percentile
References
0
Citations per year

Authors

2

Topics & keywords

Keywords
  • Computer science
  • Consistency (knowledge bases)
  • Artificial intelligence
  • Residual
  • Machine learning
  • Algorithm
UN Sustainable Development Goals
  • Quality Education
No related works found for this paper.