articleJun 16, 2024Closed access

LISA: Reasoning Segmentation via Large Language Model

University of Hong Kong · Start Making A Reader Today

Indexed incrossref

Abstract

Although perception systems have made remarkable ad-vancements in recent years, they still rely on explicit human instruction or pre-defined categories to identify the target objects before executing visual recognition tasks. Such systems cannot actively reason and comprehend implicit user intention. In this work, we propose a new segmentation task - reasoning segmentation. The task is designed to output a segmentation mask given a complex and implicit query text. Furthermore, we establish a benchmark comprising over one thousand image-instruction-mask data samples, incorporating intricate reasoning and world knowledge for evaluation purposes. Finally, we present LISA: large Language Instructed Segmentation…

Citation impact

255
total citations
FWCI
82.29
Percentile
100%
References
83
Citations per year

Authors

7

Topics & keywords

Keywords
  • Computer science
  • Segmentation
  • Natural language processing
  • Artificial intelligence
No related works found for this paper.