Dreaming to Distill: Data-Free Knowledge Transfer via DeepInversion
Princeton University · University of Illinois Urbana-Champaign
Abstract
We introduce DeepInversion, a new method for synthesizing images from the image distribution used to train a deep neural network. We ``invert'' a trained network (teacher) to synthesize class-conditional input images starting from random noise, without using any additional information about the training dataset. Keeping the teacher fixed, our method optimizes the input while regularizing the distribution of intermediate feature maps using information stored in the batch normalization layers of the teacher. Further, we improve the diversity of synthesized images using Adaptive DeepInversion, which maximizes the Jensen-Shannon divergence between the teacher and student network logits. The resulting synthesized…
Citation impact
- FWCI
- 41.75
- Percentile
- 100%
- References
- 126
Authors
8Topics & keywords
- Computer science
- Normalization (sociology)
- Artificial intelligence
- Artificial neural network
- Divergence (linguistics)
- Transfer of learning
- Pattern recognition (psychology)
- Image (mathematics)
- Quality Education