articleOct 1, 2023Closed access

Prompt-aligned Gradient for Prompt Tuning

Nanyang Technological University · Columbia University · +1 more institution

Indexed incrossref

Abstract

Thanks to the large pre-trained vision-language models (VLMs) like CLIP [37], we can craft a zero-shot classifier by discrete prompt design, e.g., the confidence score of an image being "[CLASS]" can be obtained by using the VLM provided similarity between the image and the prompt sentence "a photo of a [CLASS]". Furthermore, prompting shows great potential for fast adaptation of VLMs to downstream tasks if we fine-tune the soft prompts with few samples. However, we find a common failure that improper fine-tuning or learning with extremely few-shot samples may even under-perform the zero-shot prediction. Existing methods still address this problem by using traditional anti-overfitting techniques such as early…

Citation impact

228
total citations
FWCI
38.01
Percentile
100%
References
67
Citations per year

Authors

5

Topics & keywords

Keywords
  • Overfitting
  • Computer science
  • Artificial intelligence
  • Generalization
  • Classifier (UML)
  • Domain adaptation
  • Transfer of learning
  • Machine learning
No related works found for this paper.

Funding