Point-BERT: Pre-training 3D Point Cloud Transformers with Masked Point Modeling

Yu, Xumin; Tang, Lulu; Rao, Yongming; Huang, Tiejun; Zhou, Jie; Lu, Jiwen

doi:10.1109/cvpr52688.2022.01871

article2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)Jun 1, 2022Closed access

Point-BERT: Pre-training 3D Point Cloud Transformers with Masked Point Modeling

XYXumin Yu LTLulu Tang YRYongming Rao THTiejun Huang JZJie Zhou

Tsinghua University · Peking University

Indexed incrossref

Abstract

We present Point-BERT, a new paradigm for learning Transformers to generalize the concept of BERT [8] to 3D point cloud. Inspired by BERT, we devise a Masked Point Modeling (MPM) task to pre-train point cloud Transformers. Specifically, we first divide a point cloud into several local point patches, and a point cloud Tokenizer with a discrete Variational AutoEncoder (dVAE) is designed to generate discrete point tokens containing meaningful local information. Then, we randomly mask out some patches of input point clouds and feed them into the backbone Transformers. The pre-training objective is to recover the original point tokens at the masked locations under the supervision of point tokens obtained by the…

Citation impact

699

total citations

FWCI: 129.71
Percentile: 100%
References: 103

Citations per year

Authors

6

Topics & keywords

Topics

Keywords

Point cloud
Computer science
Transformer
Autoencoder
Point (geometry)
Point-to-point
Artificial intelligence
Deep learning

No related works found for this paper.