VLCIM: A Vision-Language Cyclic Interaction Model for Industrial Defect Detection

Shen, Xiangkai; Li, Lei; Ma, Yushan; Xu, Shaofeng; Liu, Jinhai; Yang, Zhiming; Shi, Yan

doi:10.1109/tim.2025.3583364

articleIEEE Transactions on Instrumentation and MeasurementJan 1, 2025Closed access

VLCIM: A Vision-Language Cyclic Interaction Model for Industrial Defect Detection

XSXiangkai Shen LLLei Li YMYushan Ma SXShaofeng Xu JLJinhai Liu

Beihang University · State Key Laboratory of Synthetical Automation for Process Industries

Indexed incrossref

Abstract

Accurate defect detection is an important element in ensuring product quality and safe equipment operation. However, due to the lack of deep cross-modal interactions during vision feature extraction, existing methods often suffer from attention bias, which ultimately limits detection accuracy. To address this issue, this paper proposes a Vision-Language Cyclic Interaction Model (VLCIM), which progressively optimizes vision feature extraction by integrating domain prior knowledge and generic large model, effectively bridging the dual-domain barrier between “generic-specific” and “vision-language”. Specifically, progressive cyclic interaction learning is proposed for the first time, which integrates a recursive…

Citation impact

44

total citations

FWCI: 43.13
Percentile: 100%
References: 41

Citations per year

Authors

7

Topics & keywords

Topics

Industrial Vision Systems and Defect Detection100%

Keywords

Computer vision
Computer science
Artificial intelligence
Machine vision
Natural language processing

No related works found for this paper.