Parameter-efficient fine-tuning of large-scale pre-trained language models
Beijing Academy of Artificial Intelligence · Tsinghua University · +1 more institution
Abstract
Abstract With the prevalence of pre-trained language models (PLMs) and the pre-training–fine-tuning paradigm, it has been continuously shown that larger models tend to yield better performance. However, as PLMs scale up, fine-tuning and storing all the parameters is prohibitively costly and eventually becomes practically infeasible. This necessitates a new branch of research focusing on the parameter-efficient adaptation of PLMs, which optimizes a small portion of the model parameters while keeping the rest fixed, drastically cutting down computation and storage costs. In general, it demonstrates that large-scale models could be effectively stimulated by the optimization of a few parameters. Despite the…
Citation impact
- FWCI
- 145.58
- Percentile
- 100%
- References
- 38
Authors
20Topics & keywords
- Computer science
- Fine-tuning
- Scale (ratio)
- Categorization
- Computation
- Adaptation (eye)
- Language model
- Artificial intelligence
- Quality Education