RODIN: A Generative Model for Sculpting 3D Digital Avatars Using Diffusion
Hong Kong University of Science and Technology · Microsoft Research (United Kingdom)
Abstract
This paper presents a 3D diffusion model that automatically generates 3D digital avatars represented as neural radiance fields (NeRFs). A significant challenge for 3D diffusion is that the memory and processing costs are prohibitive for producing high-quality results with rich details. To tackle this problem, we propose the roll-out diffusion network (RODIN), which takes a 3D NeRF model represented as multiple 2D feature maps and rolls out them onto a single 2D feature plane within which we perform 3D-aware diffusion. The RODIN model brings much-needed computational efficiency while preserving the integrity of 3D diffusion by using 3D-aware convolution that attends to projected features in the 2D plane…
Citation impact
- FWCI
- 21.23
- Percentile
- 100%
- References
- 119
Authors
11Topics & keywords
- Computer science
- Feature (linguistics)
- Avatar
- Convolution (computer science)
- Artificial intelligence
- 3D modeling
- Generative model
- Diffusion