Zero-shot Image-to-Image Translation

Parmar, Gaurav; Singh, Krishna Kumar; Zhang, Richard; Li, Yijun; Lu, Jingwan; Zhu, Jun-Yan

doi:10.1145/3588432.3591513

articleJul 19, 2023GOLD OA

Zero-shot Image-to-Image Translation

GPGaurav Parmar KKKrishna Kumar Singh RZRichard Zhang YLYijun Li JLJingwan Lu

Carnegie Mellon University · Adobe Systems (United States)

Indexed incrossref

Abstract

Large-scale text-to-image generative models have shown their remarkable ability to synthesize diverse, high-quality images. However, directly applying these models for real image editing remains challenging for two reasons. First, it is hard for users to craft a perfect text prompt depicting every visual detail in the input image. Second, while existing models can introduce desirable changes in certain regions, they often dramatically alter the input content and introduce unexpected changes in unwanted regions. In this work, we introduce pix2pix-zero, an image-to-image translation method that can preserve the original image’s content without manual prompting. We first automatically discover editing directions…

Citation impact

322

total citations

FWCI: 36.80
Percentile: 100%
References: 21

Citations per year

Authors

6

Topics & keywords

Topics

Keywords

Image editing
Computer science
Image (mathematics)
Image translation
Translation (biology)
Generative grammar
Artificial intelligence
Embedding

UN Sustainable Development Goals

Quality Education

No related works found for this paper.

Funding

AS
Adobe Systems