Robust fine-tuning of zero-shot models

University of Washington · OpenAI (United States) · +3 more institutions

Indexed incrossref

Abstract

Large pre-trained models such as CLIP or ALIGN offer consistent accuracy across a range of data distributions when performing zero-shot inference (i.e., without fine-tuning on a specific dataset). Although existing fine-tuning methods substantially improve accuracy on a given target distribution, they often reduce robustness to distribution shifts. We address this tension by introducing a simple and effective method for improving robustness while fine-tuning: ensembling the weights of the zero-shot and fine-tuned models (WiSE-FT). Compared to standard fine-tuning, WiSE-FT provides large accuracy improvements under distribution shift, while preserving high accuracy on the target distribution. On ImageNet and…

Citation impact

367
total citations
FWCI
36.58
Percentile
100%
References
205
Citations per year

Authors

11

Topics & keywords

Keywords
  • Robustness (evolution)
  • Fine-tuning
  • Computer science
  • Inference
  • Range (aeronautics)
  • Algorithm
  • Artificial intelligence
  • Engineering
No related works found for this paper.

Funding