Deep Convolutional Inverse Graphics Network

Kulkarni, Tejas D.; WHITNEY, WILLIAM F.; Kohli, Pushmeet; Tenenbaum, Joshua B.

doi:10.48550/arxiv.1503.03167

articlearXiv (Cornell University)Mar 11, 2015GREEN OA

Deep Convolutional Inverse Graphics Network

TDTejas D. Kulkarni WFWILLIAM F. WHITNEY PKPushmeet Kohli JBJoshua B. Tenenbaum

Massachusetts Institute of Technology · Microsoft (United States)

Indexed inarxivdatacite

Abstract

This paper presents the Deep Convolution Inverse Graphics Network (DC-IGN), a model that learns an interpretable representation of images. This representation is disentangled with respect to transformations such as out-of-plane rotations and lighting variations. The DC-IGN model is composed of multiple layers of convolution and de-convolution operators and is trained using the Stochastic Gradient Variational Bayes (SGVB) algorithm. We propose a training procedure to encourage neurons in the graphics code layer to represent a specific transformation (e.g. pose or light). Given a single input image, our model can generate new images of the same object with variations in pose and lighting. We present qualitative…

Citation impact

751

total citations

FWCI: —
Percentile: —
References: 23

Citations per year

Authors

4

Topics & keywords

Topics

Keywords

Computer science
Artificial intelligence
Graphics
Rendering (computer graphics)
Convolution (computer science)
Convolutional neural network
Computer vision
Computer graphics

No related works found for this paper.