MirrorGAN: Learning Text-To-Image Generation by Redescription
University of Technology Sydney · Zhejiang University · +3 more institutions
Abstract
Generating an image from a given text description has two goals: visual realism and semantic consistency. Although significant progress has been made in generating high-quality and visually realistic images using generative adversarial networks, guaranteeing semantic consistency between the text description and visual content remains very challenging. In this paper, we address this problem by proposing a novel global-local attentive and semantic-preserving text-to-image-to-text framework called MirrorGAN. MirrorGAN exploits the idea of learning text-to-image generation by redescription and consists of three modules: a semantic text embedding module (STEM), a global-local collaborative attentive module for…
Citation impact
- FWCI
- 31.84
- Percentile
- 100%
- References
- 57
Authors
4- TQTingting QiaoCorresponding
University of Technology Sydney, Zhejiang University, University of Sydney, Zhejiang University of Science and Technology
- JZJing Zhang
Hangzhou Dianzi University, University of Sydney, University of Technology Sydney
- DXDuanqing Xu
Zhejiang University of Science and Technology, University of Sydney
- DTDacheng Tao
University of Technology Sydney, Zhejiang University, University of Sydney
Topics & keywords
- Computer science
- Consistency (knowledge bases)
- Artificial intelligence
- Sentence
- Benchmark (surveying)
- Natural language processing
- Word (group theory)
- Image (mathematics)
- Quality Education