Mean teachers are better role models: Weight-averaged consistency\n targets improve semi-supervised deep learning results

Tarvainen, Antti; Valpola, Harri

doi:10.48550/arxiv.1703.01780

preprintarXiv (Cornell University)Mar 6, 2017GREEN OA

Mean teachers are better role models: Weight-averaged consistency\n targets improve semi-supervised deep learning results

ATAntti Tarvainen HVHarri Valpola

Indexed inarxiv

Abstract

The recently proposed Temporal Ensembling has achieved state-of-the-art\nresults in several semi-supervised learning benchmarks. It maintains an\nexponential moving average of label predictions on each training example, and\npenalizes predictions that are inconsistent with this target. However, because\nthe targets change only once per epoch, Temporal Ensembling becomes unwieldy\nwhen learning large datasets. To overcome this problem, we propose Mean\nTeacher, a method that averages model weights instead of label predictions. As\nan additional benefit, Mean Teacher improves test accuracy and enables training\nwith fewer labels than Temporal Ensembling. Without changing the network\narchitecture, Mean Teacher…

Citation impact

2,520

total citations

FWCI: —
Percentile: —
References: 28

Citations per year

Authors

2

Topics & keywords

Topics

Keywords

Consistency (knowledge bases)
Computer science
Artificial intelligence
Residual
Machine learning
Algorithm

UN Sustainable Development Goals

Quality Education

No related works found for this paper.