articleJul 19, 2023GOLD OA

Zero-shot Image-to-Image Translation

Carnegie Mellon University · Adobe Systems (United States)

Indexed incrossref

Abstract

Large-scale text-to-image generative models have shown their remarkable ability to synthesize diverse, high-quality images. However, directly applying these models for real image editing remains challenging for two reasons. First, it is hard for users to craft a perfect text prompt depicting every visual detail in the input image. Second, while existing models can introduce desirable changes in certain regions, they often dramatically alter the input content and introduce unexpected changes in unwanted regions. In this work, we introduce pix2pix-zero, an image-to-image translation method that can preserve the original image’s content without manual prompting. We first automatically discover editing directions…

Citation impact

322
total citations
FWCI
36.80
Percentile
100%
References
21
Citations per year

Authors

6

Topics & keywords

Keywords
  • Image editing
  • Computer science
  • Image (mathematics)
  • Image translation
  • Translation (biology)
  • Generative grammar
  • Artificial intelligence
  • Embedding
UN Sustainable Development Goals
  • Quality Education
No related works found for this paper.

Funding