StackGAN: Text to Photo-Realistic Image Synthesis with Stacked Generative Adversarial Networks

Zhang, Han; Xu, Tao; Li, Hongsheng; Zhang, Shaoting; Wang, Xiaogang; Huang, Xiaolei; Metaxas, Dimitris

doi:10.1109/iccv.2017.629

preprintOct 1, 2017GREEN OA

StackGAN: Text to Photo-Realistic Image Synthesis with Stacked Generative Adversarial Networks

HZHan Zhang TXTao Xu HLHongsheng Li SZShaoting Zhang XWXiaogang Wang

Rutgers Sexual and Reproductive Health and Rights · Lehigh University · +2 more institutions

Indexed inarxivcrossrefdatacite

Abstract

Synthesizing high-quality images from text descriptions is a challenging problem in computer vision and has many practical applications. Samples generated by existing textto- image approaches can roughly reflect the meaning of the given descriptions, but they fail to contain necessary details and vivid object parts. In this paper, we propose Stacked Generative Adversarial Networks (StackGAN) to generate 256.256 photo-realistic images conditioned on text descriptions. We decompose the hard problem into more manageable sub-problems through a sketch-refinement process. The Stage-I GAN sketches the primitive shape and colors of the object based on the given text description, yielding Stage-I low-resolution images.…

Citation impact

706

total citations

FWCI: 45.16
Percentile: 100%
References: 53

Citations per year

Authors

7

Topics & keywords

Topics

Keywords

Computer science
Generative grammar
Benchmark (surveying)
Sketch
Object (grammar)
Image (mathematics)
Artificial intelligence
Process (computing)

No related works found for this paper.