3D Graph Neural Networks for RGBD Semantic Segmentation
Chinese University of Hong Kong · University of Toronto · +2 more institutions
Abstract
RGBD semantic segmentation requires joint reasoning about 2D appearance and 3D geometric information. In this paper we propose a 3D graph neural network (3DGNN) that builds a k-nearest neighbor graph on top of 3D point cloud. Each node in the graph corresponds to a set of points and is associated with a hidden representation vector initialized with an appearance feature extracted by a unary CNN from 2D images. Relying on recurrent functions, every node dynamically updates its hidden representation based on the current status and incoming messages from its neighbors. This propagation model is unrolled for a certain number of time steps and the final per-node representation is used for predicting the semantic…
Citation impact
- FWCI
- 19.03
- Percentile
- 100%
- References
- 66
Authors
5- XQXiaojuan QiCorresponding
Chinese University of Hong Kong
- RLRenjie Liao
University of Toronto, Advanced Technologies Group (United States)
- JJJiaya Jia
Chinese University of Hong Kong, Tencent (China)
- SFSanja Fidler
University of Toronto
- RURaquel Urtasun
University of Toronto, Advanced Technologies Group (United States)
Topics & keywords
- Unary operation
- Computer science
- Segmentation
- Artificial intelligence
- Point cloud
- Pattern recognition (psychology)
- Graph
- Representation (politics)