More Control for Free! Image Synthesis with Semantic Diffusion Guidance
University of California, Berkeley · University of Hong Kong · +1 more institution
Abstract
Controllable image synthesis models allow creation of diverse images based on text instructions or guidance from a reference image. Recently, denoising diffusion probabilistic models have been shown to generate more realistic imagery than prior methods, and have been successfully demonstrated in unconditional and class-conditional settings. We investigate fine-grained, continuous control of this model class, and introduce a novel unified framework for semantic diffusion guidance, which allows either language or image guidance, or both. Guidance is injected into a pretrained unconditional diffusion model using the gradient of image-text or image matching scores, without re-training the diffusion model. We…
Citation impact
- FWCI
- 10.46
- Percentile
- 100%
- References
- 82
Authors
9Topics & keywords
- Computer science
- Image (mathematics)
- Image synthesis
- Class (philosophy)
- Matching (statistics)
- Artificial intelligence
- Diffusion
- Probabilistic logic
- Quality Education