AnomalyGPT: Detecting Industrial Anomalies Using Large Vision-Language Models
Institute of Automation · University of Chinese Academy of Sciences
Abstract
Large Vision-Language Models (LVLMs) such as MiniGPT-4 and LLaVA have demonstrated the capability of understanding images and achieved remarkable performance in various visual tasks. Despite their strong abilities in recognizing common objects due to extensive training datasets, they lack specific domain knowledge and have a weaker understanding of localized details within objects, which hinders their effectiveness in the Industrial Anomaly Detection (IAD) task. On the other hand, most existing IAD methods only provide anomaly scores and necessitate the manual setting of thresholds to distinguish between normal and abnormal samples, which restricts their practical implementation. In this paper, we explore the…
Citation impact
- FWCI
- 26.24
- Percentile
- 100%
- References
- 48
Authors
6- ZGZhaopeng GuCorresponding
Institute of Automation, University of Chinese Academy of Sciences
- BZBingke Zhu
Institute of Automation
- GZGuibo Zhu
Institute of Automation, University of Chinese Academy of Sciences
- YCYingying Chen
Institute of Automation
- MTMing Tang
Institute of Automation, University of Chinese Academy of Sciences
Topics & keywords
- Artificial intelligence
- Computer science
- Computer vision
- Natural language processing