MonoScene: Monocular 3D Semantic Scene Completion

Cao, Anh-Quan; Charette, Raoul de

doi:10.1109/cvpr52688.2022.00396

article2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)Jun 1, 2022GREEN OA

MonoScene: Monocular 3D Semantic Scene Completion

ACAnh-Quan Cao RDRaoul de Charette

Institut national de recherche en informatique et en automatique

Indexed inarxivcrossref

Abstract

MonoScene proposes a 3D Semantic Scene Completion (SSC) framework, where the dense geometry and semantics of a scene are inferred from a single monocular RGB image. Different from the SSC literature, relying on 2.5 or 3D input, we solve the complex problem of 2D to 3D scene reconstruction while jointly inferring its semantics. Our framework relies on successive 2D and 3D UNets, bridged by a novel 2D-3D features projection inspired by optics, and introduces a 3D context relation prior to enforce spatio-semantic consistency. Along with architectural contributions, we introduce novel global scene and local frustums losses. Experiments show we outperform the literature on all metries and datasets while…

Citation impact

254

total citations

FWCI: 13.51
Percentile: 100%
References: 134

Citations per year

Authors

2

Topics & keywords

Topics

Keywords

Computer science
Hallucinating
Semantics (computer science)
Artificial intelligence
Context (archaeology)
Monocular
Computer vision
Relation (database)

UN Sustainable Development Goals

Sustainable cities and communities

No related works found for this paper.