Pseudo-labeling and confirmation bias in deep semi-supervised learning

Dublin City University

Abstract

Semi-supervised learning, i.e. jointly learning from labeled and unlabeled samples, is an active research topic due to its key role on relaxing human supervision. In the context of image classification, recent advances to learn from unlabeled samples are mainly focused on consistency regularization methods that encourage invariant predictions for different perturbations of unlabeled samples. We, conversely, propose to learn from unlabeled data by generating soft pseudo-labels using the network predictions. We show that a naive pseudo-labeling overfits to incorrect pseudo-labels due to the so-called confirmation bias and demonstrate that mixup augmentation and setting a minimum number of labeled samples per…

Citation impact

763
total citations
FWCI
Percentile
References
57
Citations per year

Authors

5

Topics & keywords

Keywords
  • Regularization (linguistics)
  • Computer science
  • Labeled data
  • Artificial intelligence
  • Machine learning
  • Consistency (knowledge bases)
  • Source code
  • Semi-supervised learning
No related works found for this paper.