Zero-shot Image-to-Image Translation
Carnegie Mellon University · Adobe Systems (United States)
Abstract
Large-scale text-to-image generative models have shown their remarkable ability to synthesize diverse, high-quality images. However, directly applying these models for real image editing remains challenging for two reasons. First, it is hard for users to craft a perfect text prompt depicting every visual detail in the input image. Second, while existing models can introduce desirable changes in certain regions, they often dramatically alter the input content and introduce unexpected changes in unwanted regions. In this work, we introduce pix2pix-zero, an image-to-image translation method that can preserve the original image’s content without manual prompting. We first automatically discover editing directions…
Citation impact
- FWCI
- 36.80
- Percentile
- 100%
- References
- 21
Authors
6Topics & keywords
- Image editing
- Computer science
- Image (mathematics)
- Image translation
- Translation (biology)
- Generative grammar
- Artificial intelligence
- Embedding
- Quality Education