Be Your Own Teacher: Improve the Performance of Convolutional Neural Networks via Self Distillation
Tsinghua University · Silicon Technologies (United States)
Abstract
Convolutional neural networks have been widely deployed in various application scenarios. In order to extend the applications' boundaries to some accuracy-crucial domains, researchers have been investigating approaches to boost accuracy through either deeper or wider network structures, which brings with them the exponential increment of the computational and storage cost, delaying the responding time. In this paper, we propose a general training framework named self distillation, which notably enhances the performance (accuracy) of convolutional neural networks through shrinking the size of the network rather than aggrandizing it. Different from traditional knowledge distillation - a knowledge transformation…
Citation impact
- FWCI
- 31.13
- Percentile
- 100%
- References
- 80
Authors
6Topics & keywords
- Computer science
- Distillation
- Softmax function
- Convolutional neural network
- Artificial intelligence
- Flexibility (engineering)
- Artificial neural network
- Generalization