SDFusion: Multimodal 3D Shape Completion, Reconstruction, and Generation

Cheng, Yen-Chi; Lee, Hsin-Ying; Tulyakov, Sergey; Schwing, Alexander G.; Gui, Liang-Yan

doi:10.1109/cvpr52729.2023.00433

articleJun 1, 2023Closed access

SDFusion: Multimodal 3D Shape Completion, Reconstruction, and Generation

YCYen-Chi Cheng HLHsin-Ying Lee STSergey Tulyakov AGAlexander G. Schwing LGLiang-Yan Gui

University of Illinois Urbana-Champaign · Snap (United States)

Indexed incrossref

Abstract

In this work, we present a novel framework built to sim-plify 3D asset generation for amateur users. To enable interactive generation, our method supports a variety of input modalities that can be easily provided by a human, in-cluding images, text, partially observed shapes and combinations of these, further allowing to adjust the strength of each input. At the core of our approach is an encoder-decoder, compressing 3D shapes into a compact latent representation, upon which a diffusion model is learned. To enable a variety of multimodal inputs, we employ task-specific encoders with dropout followed by a cross-attention mechanism. Due to its flexibility, our model naturally supports a variety of tasks,…

Citation impact

174

total citations

FWCI: 35.13
Percentile: 100%
References: 80

Citations per year

Authors

5

Topics & keywords

Topics

Keywords

Computer science
Encoder
Interactivity
Artificial intelligence
Flexibility (engineering)
Variety (cybernetics)
3D reconstruction
Representation (politics)

No related works found for this paper.

Funding

NS
National Science Foundation
Award: 2008387,2045586,2106825