Abstract
Large deep neural networks are powerful, but exhibit undesirable behaviors such as memorization and sensitivity to adversarial examples. In this work, we propose mixup, a simple learning principle to alleviate these issues. In essence, mixup trains a neural network on convex combinations of pairs of examples and their labels. By doing so, mixup regularizes the neural network to favor simple linear behavior in-between training examples. Our experiments on the ImageNet-2012, CIFAR-10, CIFAR-100, Google commands and UCI datasets show that mixup improves the generalization of state-of-the-art neural network architectures. We also find that mixup reduces the memorization of corrupt labels, increases the robustness…
Citation impact
961
total citations
- FWCI
- —
- Percentile
- —
- References
- 0
Citations per year
Authors
1Topics & keywords
Topics
Keywords
- Computer science
- Adversarial system
- Robustness (evolution)
- Artificial intelligence
- Machine learning
- Artificial neural network
- Memorization
- Deep neural networks
UN Sustainable Development Goals
- Peace, Justice and strong institutions
No related works found for this paper.