Zero-Shot Text-Guided Object Generation with Dream Fields

Jain, Ajay N.; Mildenhall, Ben; Barron, Jonathan T.; Abbeel, Pieter; Poole, Ben

doi:10.1109/cvpr52688.2022.00094

article2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)Jun 1, 2022Closed access

Zero-Shot Text-Guided Object Generation with Dream Fields

ANAjay N. Jain BMBen Mildenhall JTJonathan T. Barron PAPieter Abbeel BPBen Poole

Berkeley College · Google (United States) · +1 more institution

Indexed incrossref

Abstract

We combine neural rendering with multi-modal image and text representations to synthesize diverse 3D objects solely from natural language descriptions. Our method, Dream Fields, can generate the geometry and color of a wide range of objects without 3D supervision. Due to the scarcity of diverse, captioned 3D data, prior methods only generate objectsfrom a handful of categories, such as ShapeNet. Instead, we guide generation with image-text models pre-trained on large datasets of captioned images from the web. Our method optimizes a Neural Radiance Field from many camera views so that rendered images score highly with a target caption according to a pre-trained CLIP model. To improve fidelity and visual…

Citation impact

362

total citations

FWCI: 70.87
Percentile: 100%
References: 88

Citations per year

Authors

5

Topics & keywords

Topics

Keywords

Computer science
Artificial intelligence
Computer vision
Rendering (computer graphics)
Radiance
Fidelity
Convolutional neural network
Prior probability

UN Sustainable Development Goals

Quality Education

No related works found for this paper.