Point-BERT: Pre-training 3D Point Cloud Transformers with Masked Point Modeling
Tsinghua University · Peking University
Abstract
We present Point-BERT, a new paradigm for learning Transformers to generalize the concept of BERT [8] to 3D point cloud. Inspired by BERT, we devise a Masked Point Modeling (MPM) task to pre-train point cloud Transformers. Specifically, we first divide a point cloud into several local point patches, and a point cloud Tokenizer with a discrete Variational AutoEncoder (dVAE) is designed to generate discrete point tokens containing meaningful local information. Then, we randomly mask out some patches of input point clouds and feed them into the backbone Transformers. The pre-training objective is to recover the original point tokens at the masked locations under the supervision of point tokens obtained by the…
Citation impact
- FWCI
- 129.71
- Percentile
- 100%
- References
- 103
Authors
6Topics & keywords
- Point cloud
- Computer science
- Transformer
- Autoencoder
- Point (geometry)
- Point-to-point
- Artificial intelligence
- Deep learning