Diffusion Autoencoders: Toward a Meaningful and Decodable Representation
Vidyasirimedhi Institute of Science and Technology
Abstract
Diffusion probabilistic models (DPMs) have achieved remarkable quality in image generation that rivals GANs'. But unlike GANs, DPMs use a set of latent variables that lack semantic meaning and cannot serve as a useful representation for other tasks. This paper explores the possibility of using DPMs for representation learning and seeks to extract a meaningful and decodable representation of an input image via autoencoding. Our key idea is to use a learnable encoder for discovering the high-level semantics, and a DPM as the decoder for modeling the remaining stochastic variations. Our method can encode any image into a two-part latent code where the first part is semantically meaningful and linear, and the…
Citation impact
- FWCI
- 15.52
- Percentile
- 100%
- References
- 72
Authors
4Topics & keywords
- Computer science
- Representation (politics)
- Encoder
- Semantics (computer science)
- Encoding (memory)
- Probabilistic logic
- Artificial intelligence
- Code (set theory)