preprintarXiv (Cornell University)Feb 18, 2020GREEN OA

DivideMix: Learning with Noisy Labels as Semi-supervised Learning

Indexed inarxivdatacite

Abstract

Deep neural networks are known to be annotation-hungry. Numerous efforts have been devoted to reducing the annotation cost when learning with deep networks. Two prominent directions include learning with noisy labels and semi-supervised learning by exploiting unlabeled data. In this work, we propose DivideMix, a novel framework for learning with noisy labels by leveraging semi-supervised learning techniques. In particular, DivideMix models the per-sample loss distribution with a mixture model to dynamically divide the training data into a labeled set with clean samples and an unlabeled set with noisy samples, and trains the model on both the labeled and unlabeled data in a semi-supervised manner. To avoid…

Citation impact

500
total citations
FWCI
Percentile
References
44
Citations per year

Authors

3

Topics & keywords

Keywords
  • Computer science
  • Artificial intelligence
  • Benchmark (surveying)
  • Annotation
  • Machine learning
  • Semi-supervised learning
  • Supervised learning
  • Deep learning
No related works found for this paper.