Prompt-aligned Gradient for Prompt Tuning
Nanyang Technological University · Columbia University · +1 more institution
Abstract
Thanks to the large pre-trained vision-language models (VLMs) like CLIP [37], we can craft a zero-shot classifier by discrete prompt design, e.g., the confidence score of an image being "[CLASS]" can be obtained by using the VLM provided similarity between the image and the prompt sentence "a photo of a [CLASS]". Furthermore, prompting shows great potential for fast adaptation of VLMs to downstream tasks if we fine-tune the soft prompts with few samples. However, we find a common failure that improper fine-tuning or learning with extremely few-shot samples may even under-perform the zero-shot prediction. Existing methods still address this problem by using traditional anti-overfitting techniques such as early…
Citation impact
- FWCI
- 38.01
- Percentile
- 100%
- References
- 67
Authors
5Topics & keywords
- Overfitting
- Computer science
- Artificial intelligence
- Generalization
- Classifier (UML)
- Domain adaptation
- Transfer of learning
- Machine learning