Robust fine-tuning of zero-shot models

Wortsman, Mitchell; Ilharco, Gabriel; Kim, Jong Wook; Li, Mike; Kornblith, Simon; Roelofs, Rebecca; Lopes, Raphael Gontijo; Hajishirzi, Hannaneh; Farhadi, Ali; Namkoong, Hongseok; Schmidt, Ludwig

doi:10.1109/cvpr52688.2022.00780

article2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)Jun 1, 2022Closed access

Robust fine-tuning of zero-shot models

MWMitchell Wortsman GIGabriel Ilharco JWJong Wook Kim MLMike Li SKSimon Kornblith

University of Washington · OpenAI (United States) · +3 more institutions

Indexed incrossref

Abstract

Large pre-trained models such as CLIP or ALIGN offer consistent accuracy across a range of data distributions when performing zero-shot inference (i.e., without fine-tuning on a specific dataset). Although existing fine-tuning methods substantially improve accuracy on a given target distribution, they often reduce robustness to distribution shifts. We address this tension by introducing a simple and effective method for improving robustness while fine-tuning: ensembling the weights of the zero-shot and fine-tuned models (WiSE-FT). Compared to standard fine-tuning, WiSE-FT provides large accuracy improvements under distribution shift, while preserving high accuracy on the target distribution. On ImageNet and…

Citation impact

367

total citations

FWCI: 36.58
Percentile: 100%
References: 205

Citations per year

Authors

11

Topics & keywords

Topics

Keywords

Robustness (evolution)
Fine-tuning
Computer science
Inference
Range (aeronautics)
Algorithm
Artificial intelligence
Engineering

No related works found for this paper.

Funding

NS
National Science Foundation
Award: 1652052