Mask3D: Mask Transformer for 3D Semantic Instance Segmentation

Schult, Jonas; Engelmann, Francis; Hermans, Alexander; Litany, Or; Tang, Siyu; Leibe, Bastian

doi:10.1109/icra48891.2023.10160590

articleMay 29, 2023Closed access

Mask3D: Mask Transformer for 3D Semantic Instance Segmentation

JSJonas Schult FEFrancis Engelmann AHAlexander Hermans OLOr Litany STSiyu Tang

RWTH Aachen University · Nvidia (United States)

Indexed incrossref

Abstract

Modern 3D semantic instance segmentation approaches predominantly rely on specialized voting mechanisms followed by carefully designed geometric clustering techniques. Building on the successes of recent Transformer-based methods for object detection and image segmentation, we propose the first Transformer-based approach for 3D semantic instance segmentation. We show that we can leverage generic Transformer building blocks to directly predict instance masks from 3D point clouds. In our model - called Mask3D - each object instance is represented as an instance query. Using Transformer decoders, the instance queries are learned by iteratively attending to point cloud features at multiple scales. Combined with…

Citation impact

188

total citations

FWCI: 38.25
Percentile: 100%
References: 72

Citations per year

Authors

6

Topics & keywords

Topics

Keywords

Computer science
Segmentation
Point cloud
Transformer
Leverage (statistics)
Artificial intelligence
Voting
Cluster analysis

No related works found for this paper.