ConceptGraphs: Open-Vocabulary 3D Scene Graphs for Perception and Planning
University of Toronto · Université de Montréal · +2 more institutions
Abstract
For robots to perform a wide variety of tasks, they require a 3D representation of the world that is semantically rich, yet compact and efficient for task-driven perception and planning. Recent approaches have attempted to leverage features from large vision-language models to encode semantics in 3D representations. However, these approaches tend to produce maps with per-point feature vectors, which do not scale well in larger environments, nor do they contain semantic spatial relationships between entities in the environment, which are useful for downstream planning. In this work, we propose ConceptGraphs, an open-vocabulary graph-structured representation for 3D scenes. ConceptGraphs is built by leveraging…
Citation impact
- FWCI
- 229.26
- Percentile
- 100%
- References
- 75
Authors
16Topics & keywords
- Computer science
- Vocabulary
- Perception
- Artificial intelligence
- Computer vision
- Natural language processing
- Linguistics
- Psychology