articleJul 1, 2017Closed access

ScanNet: Richly-Annotated 3D Reconstructions of Indoor Scenes

Stanford University · Princeton University

Indexed incrossref

Abstract

A key requirement for leveraging supervised deep learning methods is the availability of large, labeled datasets. Unfortunately, in the context of RGB-D scene understanding, very little data is available - current datasets cover a small range of scene views and have limited semantic annotations. To address this issue, we introduce ScanNet, an RGB-D video dataset containing 2.5M views in 1513 scenes annotated with 3D camera poses, surface reconstructions, and semantic segmentations. To collect this data, we designed an easy-to-use and scalable RGB-D capture system that includes automated surface reconstruction and crowd-sourced semantic annotation.We show that using this data helps achieve state-of-the-art…

Citation impact

3,977
total citations
FWCI
378.74
Percentile
100%
References
109
Citations per year

Authors

6

Topics & keywords

Keywords
  • Computer science
  • Computer graphics (images)
  • Computer vision
  • Geology
  • Artificial intelligence
No related works found for this paper.

Funding