articleOct 1, 2023Closed access

Zero-1-to-3: Zero-shot One Image to 3D Object

Columbia University · Toyota Research Institute

Indexed incrossref

Abstract

We introduce Zero-1-to-3, a framework for changing the camera viewpoint of an object given just a single RGB image. To perform novel view synthesis in this under-constrained setting, we capitalize on the geometric priors that large-scale diffusion models learn about natural images. Our conditional diffusion model uses a synthetic dataset to learn controls of the relative camera viewpoint, which allow new images to be generated of the same object under a specified camera transformation. Even though it is trained on a synthetic dataset, our model retains a strong zero-shot generalization ability to out-of-distribution datasets as well as in-the-wild images, including impressionist paintings. Our…

Citation impact

678
total citations
FWCI
77.18
Percentile
100%
References
73
Citations per year

Authors

6

Topics & keywords

Keywords
  • Artificial intelligence
  • Computer science
  • Computer vision
  • Object (grammar)
  • Generalization
  • Transformation (genetics)
  • Geometric transformation
  • Prior probability
UN Sustainable Development Goals
  • Sustainable cities and communities
No related works found for this paper.

Funding