Imagic: Text-Based Real Image Editing with Diffusion Models
Technion – Israel Institute of Technology · Google (United States) · +1 more institution
Abstract
Text-conditioned image editing has recently attracted considerable interest. However, most methods are currently limited to one of the following: specific editing types (e.g., object overlay, style transfer), synthetically generated images, or requiring multiple input images of a common object. In this paper we demonstrate, for the very first time, the ability to apply complex (e.g., non-rigid) text-based semantic edits to a single real image. For example, we can change the posture and composition of one or multiple objects inside an image, while preserving its original characteristics. Our method can make a standing dog sit down, cause a bird to spread its wings, etc. – each within its single high-resolution…
Citation impact
- FWCI
- 77.82
- Percentile
- 100%
- References
- 106
Authors
8Topics & keywords
- Computer science
- Image editing
- Image (mathematics)
- Embedding
- Object (grammar)
- Artificial intelligence
- Computer vision
- Benchmark (surveying)