More Control for Free! Image Synthesis with Semantic Diffusion Guidance

University of California, Berkeley · University of Hong Kong · +1 more institution

Indexed incrossref

Abstract

Controllable image synthesis models allow creation of diverse images based on text instructions or guidance from a reference image. Recently, denoising diffusion probabilistic models have been shown to generate more realistic imagery than prior methods, and have been successfully demonstrated in unconditional and class-conditional settings. We investigate fine-grained, continuous control of this model class, and introduce a novel unified framework for semantic diffusion guidance, which allows either language or image guidance, or both. Guidance is injected into a pretrained unconditional diffusion model using the gradient of image-text or image matching scores, without re-training the diffusion model. We…

Citation impact

176
total citations
FWCI
10.46
Percentile
100%
References
82
Citations per year

Authors

9

Topics & keywords

Keywords
  • Computer science
  • Image (mathematics)
  • Image synthesis
  • Class (philosophy)
  • Matching (statistics)
  • Artificial intelligence
  • Diffusion
  • Probabilistic logic
UN Sustainable Development Goals
  • Quality Education
No related works found for this paper.