preprintarXiv (Cornell University)Mar 6, 2017GREEN OA

Mean teachers are better role models: Weight-averaged consistency\n targets improve semi-supervised deep learning results

Indexed inarxiv

Abstract

The recently proposed Temporal Ensembling has achieved state-of-the-art\nresults in several semi-supervised learning benchmarks. It maintains an\nexponential moving average of label predictions on each training example, and\npenalizes predictions that are inconsistent with this target. However, because\nthe targets change only once per epoch, Temporal Ensembling becomes unwieldy\nwhen learning large datasets. To overcome this problem, we propose Mean\nTeacher, a method that averages model weights instead of label predictions. As\nan additional benefit, Mean Teacher improves test accuracy and enables training\nwith fewer labels than Temporal Ensembling. Without changing the network\narchitecture, Mean Teacher…

Citation impact

2,520
total citations
FWCI
Percentile
References
28
Citations per year

Authors

2

Topics & keywords

Keywords
  • Consistency (knowledge bases)
  • Computer science
  • Artificial intelligence
  • Residual
  • Machine learning
  • Algorithm
UN Sustainable Development Goals
  • Quality Education
No related works found for this paper.