Robust fine-tuning of zero-shot models
University of Washington · OpenAI (United States) · +3 more institutions
Abstract
Large pre-trained models such as CLIP or ALIGN offer consistent accuracy across a range of data distributions when performing zero-shot inference (i.e., without fine-tuning on a specific dataset). Although existing fine-tuning methods substantially improve accuracy on a given target distribution, they often reduce robustness to distribution shifts. We address this tension by introducing a simple and effective method for improving robustness while fine-tuning: ensembling the weights of the zero-shot and fine-tuned models (WiSE-FT). Compared to standard fine-tuning, WiSE-FT provides large accuracy improvements under distribution shift, while preserving high accuracy on the target distribution. On ImageNet and…
Citation impact
- FWCI
- 36.58
- Percentile
- 100%
- References
- 205
Authors
11Topics & keywords
- Robustness (evolution)
- Fine-tuning
- Computer science
- Inference
- Range (aeronautics)
- Algorithm
- Artificial intelligence
- Engineering