LISA: Reasoning Segmentation via Large Language Model
University of Hong Kong · Start Making A Reader Today
Abstract
Although perception systems have made remarkable ad-vancements in recent years, they still rely on explicit human instruction or pre-defined categories to identify the target objects before executing visual recognition tasks. Such systems cannot actively reason and comprehend implicit user intention. In this work, we propose a new segmentation task - reasoning segmentation. The task is designed to output a segmentation mask given a complex and implicit query text. Furthermore, we establish a benchmark comprising over one thousand image-instruction-mask data samples, incorporating intricate reasoning and world knowledge for evaluation purposes. Finally, we present LISA: large Language Instructed Segmentation…
Citation impact
- FWCI
- 82.29
- Percentile
- 100%
- References
- 83
Authors
7Topics & keywords
- Computer science
- Segmentation
- Natural language processing
- Artificial intelligence