preprintarXiv (Cornell University)May 6, 2019GREEN OA

MixMatch: A Holistic Approach to Semi-Supervised Learning

Google (United States)

Indexed inarxivdatacite

Abstract

Semi-supervised learning has proven to be a powerful paradigm for leveraging unlabeled data to mitigate the reliance on large labeled datasets. In this work, we unify the current dominant approaches for semi-supervised learning to produce a new algorithm, MixMatch, that works by guessing low-entropy labels for data-augmented unlabeled examples and mixing labeled and unlabeled data using MixUp. We show that MixMatch obtains state-of-the-art results by a large margin across many datasets and labeled data amounts. For example, on CIFAR-10 with 250 labels, we reduce error rate by a factor of 4 (from 38% to 11%) and by a factor of 2 on STL-10. We also demonstrate how MixMatch can help achieve a dramatically better…

Citation impact

604
total citations
FWCI
Percentile
References
47
Citations per year

Authors

6

Topics & keywords

Keywords
  • Computer science
  • Margin (machine learning)
  • Labeled data
  • Semi-supervised learning
  • Machine learning
  • Artificial intelligence
  • Supervised learning
  • Factor (programming language)
No related works found for this paper.