Magic3D: High-Resolution Text-to-3D Content Creation

Lin, Chen-Hsuan; Gao, Jun; Tang, Luming; Takikawa, Towaki; Zeng, Xiaohui; Huang, Xun; Kreis, Karsten; Fidler, Sanja; Liu, Ming-Yu; Lin, Tsung-Yi

doi:10.1109/cvpr52729.2023.00037

articleJun 1, 2023Closed access

Magic3D: High-Resolution Text-to-3D Content Creation

CLChen-Hsuan Lin JGJun Gao LTLuming Tang TTTowaki Takikawa XZXiaohui Zeng

Nvidia (United States)

Indexed incrossref

Abstract

DreamFusion [31] has recently demonstrated the utility of a pretrained text-to-image diffusion model to optimize Neural Radiance Fields (NeRF) [23], achieving remarkable text-to-3D synthesis results. However, the method has two inherent limitations: (a) extremely slow optimization of NeRF and (b) low-resolution image space supervision on NeRF, leading to low-quality 3D models with a long processing time. In this paper, we address these limitations by utilizing a two-stage optimization framework. First, we obtain a coarse model using a low-resolution diffusion prior and accelerate with a sparse 3D hash grid structure. Using the coarse representation as the initialization, we further optimize a textured 3D mesh…

Citation impact

711

total citations

FWCI: 341.00
Percentile: 100%
References: 74

Citations per year

Authors

10

Topics & keywords

Topics

Keywords

Computer science
Initialization
Hash function
Representation (politics)
Artificial intelligence

No related works found for this paper.