preprintarXiv (Cornell University)Jun 6, 2019GREEN OA

When Does Label Smoothing Help?

Indexed inarxivdatacite

Abstract

The generalization and learning speed of a multi-class neural network can often be significantly improved by using soft targets that are a weighted average of the hard targets and the uniform distribution over labels. Smoothing the labels in this way prevents the network from becoming over-confident and label smoothing has been used in many state-of-the-art models, including image classification, language translation and speech recognition. Despite its widespread use, label smoothing is still poorly understood. Here we show empirically that in addition to improving generalization, label smoothing improves model calibration which can significantly improve beam-search. However, we also observe that if a teacher…

Citation impact

884
total citations
FWCI
Percentile
References
17
Citations per year

Authors

3

Topics & keywords

Keywords
  • Smoothing
  • Computer science
  • Mathematics
  • Business
  • Statistics
UN Sustainable Development Goals
  • Quality Education
No related works found for this paper.