preprintJun 1, 2016Closed access

Deep Sliding Shapes for Amodal 3D Object Detection in RGB-D Images

Princeton University

Indexed incrossref

Abstract

We focus on the task of amodal 3D object detection in RGB-D images, which aims to produce a 3D bounding box of an object in metric form at its full extent. We introduce Deep Sliding Shapes, a 3D ConvNet formulation that takes a 3D volumetric scene from a RGB-D image as input and outputs 3D object bounding boxes. In our approach, we propose the first 3D Region Proposal Network (RPN) to learn objectness from geometric shapes and the first joint Object Recognition Network (ORN) to extract geometric features in 3D and color features in 2D. In particular, we handle objects of various sizes by training an amodal RPN at two different scales and an ORN to regress 3D bounding boxes. Experiments show that our algorithm…

Citation impact

719
total citations
FWCI
50.35
Percentile
100%
References
47
Citations per year

Authors

2

Topics & keywords

Keywords
  • Artificial intelligence
  • Amodal perception
  • Computer vision
  • Computer science
  • Minimum bounding box
  • Bounding overwatch
  • Object (grammar)
  • Focus (optics)
No related works found for this paper.