Reproducible Scaling Laws for Contrastive Language-Image Learning
University of Washington · University of California, Berkeley · +1 more institution
Abstract
Scaling up neural networks has led to remarkable performance across a wide range of tasks. Moreover, performance often follows reliable scaling laws as a function of training set size, model size, and compute, which offers valuable guidance as large-scale experiments are becoming increasingly expensive. However, previous work on scaling laws has primarily used private data & models or focused on uni-modal language or vision learning. To address these limitations, we investigate scaling laws for contrastive language-image pre-training (CLIP) with the public LAION dataset and the open-source OpenCLIP repository. Our large-scale experiments involve models trained on up to two billion image-text pairs and identify…
Citation impact
- FWCI
- 54.98
- Percentile
- 100%
- References
- 119
Authors
9Topics & keywords
- Computer science
- Artificial intelligence
- Natural language processing
- Image (mathematics)
- Scaling law
- Linguistics
- Scaling
- Philosophy
- Peace, Justice and strong institutions