Diffusion Autoencoders: Toward a Meaningful and Decodable Representation

Preechakul, Konpat; Chatthee, Nattanat; Wizadwongsa, Suttisak; Suwajanakorn, Supasorn

doi:10.1109/cvpr52688.2022.01036

article2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)Jun 1, 2022Closed access

Diffusion Autoencoders: Toward a Meaningful and Decodable Representation

KPKonpat Preechakul NCNattanat Chatthee SWSuttisak Wizadwongsa SSSupasorn Suwajanakorn

Vidyasirimedhi Institute of Science and Technology

Indexed incrossref

Abstract

Diffusion probabilistic models (DPMs) have achieved remarkable quality in image generation that rivals GANs'. But unlike GANs, DPMs use a set of latent variables that lack semantic meaning and cannot serve as a useful representation for other tasks. This paper explores the possibility of using DPMs for representation learning and seeks to extract a meaningful and decodable representation of an input image via autoencoding. Our key idea is to use a learnable encoder for discovering the high-level semantics, and a DPM as the decoder for modeling the remaining stochastic variations. Our method can encode any image into a two-part latent code where the first part is semantically meaningful and linear, and the…

Citation impact

281

total citations

FWCI: 15.52
Percentile: 100%
References: 72

Citations per year

Authors

4

Topics & keywords

Topics

Keywords

Computer science
Representation (politics)
Encoder
Semantics (computer science)
Encoding (memory)
Probabilistic logic
Artificial intelligence
Code (set theory)

No related works found for this paper.