Mean teachers are better role models: Weight-averaged consistency\n targets improve semi-supervised deep learning results
Indexed inarxiv
Abstract
The recently proposed Temporal Ensembling has achieved state-of-the-art\nresults in several semi-supervised learning benchmarks. It maintains an\nexponential moving average of label predictions on each training example, and\npenalizes predictions that are inconsistent with this target. However, because\nthe targets change only once per epoch, Temporal Ensembling becomes unwieldy\nwhen learning large datasets. To overcome this problem, we propose Mean\nTeacher, a method that averages model weights instead of label predictions. As\nan additional benefit, Mean Teacher improves test accuracy and enables training\nwith fewer labels than Temporal Ensembling. Without changing the network\narchitecture, Mean Teacher…
Citation impact
2,520
total citations
- FWCI
- —
- Percentile
- —
- References
- 28
Citations per year
Authors
2Topics & keywords
Topics
Keywords
- Consistency (knowledge bases)
- Computer science
- Artificial intelligence
- Residual
- Machine learning
- Algorithm
UN Sustainable Development Goals
- Quality Education
No related works found for this paper.