articleJun 1, 2023Closed access

Magic3D: High-Resolution Text-to-3D Content Creation

Nvidia (United States)

Indexed incrossref

Abstract

DreamFusion [31] has recently demonstrated the utility of a pretrained text-to-image diffusion model to optimize Neural Radiance Fields (NeRF) [23], achieving remarkable text-to-3D synthesis results. However, the method has two inherent limitations: (a) extremely slow optimization of NeRF and (b) low-resolution image space supervision on NeRF, leading to low-quality 3D models with a long processing time. In this paper, we address these limitations by utilizing a two-stage optimization framework. First, we obtain a coarse model using a low-resolution diffusion prior and accelerate with a sparse 3D hash grid structure. Using the coarse representation as the initialization, we further optimize a textured 3D mesh…

Citation impact

711
total citations
FWCI
341.00
Percentile
100%
References
74
Citations per year

Authors

10

Topics & keywords

Keywords
  • Computer science
  • Initialization
  • Hash function
  • Representation (politics)
  • Artificial intelligence
No related works found for this paper.