Visual-Language Prompt Tuning with Knowledge-Guided Context Optimization

Yao, Hantao; Zhang, Rui; Xu, Changsheng

doi:10.1109/cvpr52729.2023.00653

articleJun 1, 2023Closed access

Visual-Language Prompt Tuning with Knowledge-Guided Context Optimization

HYHantao Yao RZRui Zhang CXChangsheng Xu

Artificial Intelligence in Medicine (Canada) · Shandong Institute of Automation · +2 more institutions

Indexed incrossref

Abstract

Prompt tuning is an effective way to adapt the pretrained visual-language model (VLM) to the downstream task using task-related textual tokens. Representative CoOp-based work combines the learnable textual tokens with the class tokens to obtain specific textual knowledge. However, the specific textual knowledge is worse generalization to the unseen classes because it forgets the essential general textual knowledge having a strong generalization ability. To tackle this issue, we introduce a novel Knowledge-guided Context Optimization (KgCoOp) to enhance the generalization ability of the learnable prompt for unseen classes. The key insight of KgCoOp is that the forgetting about essential knowledge can be…

Citation impact

197

total citations

FWCI: 22.55
Percentile: 100%
References: 60

Citations per year

Authors

3

Topics & keywords

Topics

Keywords

Computer science
Generalization
Discriminative model
Task (project management)
Artificial intelligence
Context (archaeology)
Forgetting
Natural language processing

UN Sustainable Development Goals

Reduced inequalities

No related works found for this paper.