Photographic Image Synthesis with Cascaded Refinement Networks
Intel (United States) · Stanford University
Abstract
We present an approach to synthesizing photographic images conditioned on semantic layouts. Given a semantic label map, our approach produces an image with photographic appearance that conforms to the input layout. The approach thus functions as a rendering engine that takes a two-dimensional semantic specification of the scene and produces a corresponding photographic image. Unlike recent and contemporaneous work, our approach does not rely on adversarial training. We show that photographic images can be synthesized from semantic layouts by a single feedforward network with appropriate structure, trained end-to-end with a direct regression objective. The presented approach scales seamlessly to high…
Citation impact
- FWCI
- 41.67
- Percentile
- 100%
- References
- 97
Authors
2Topics & keywords
- Computer science
- Rendering (computer graphics)
- Artificial intelligence
- Image synthesis
- Computer vision
- Image (mathematics)
- Image resolution
- Generative adversarial network
- Life below water