Parameter-efficient fine-tuning of large-scale pre-trained language models

Ding, Ning; Qin, Yujia; Yang, Guang; Wei, Fuchao; Yang, Zonghan; Su, Yusheng; Hu, Shengding; Chen, Yulin; Chan, Chi-Min; Chen, Weize; Yi, Jing; Zhao, Weilin; Wang, Xiaozhi; Liu, Zhiyuan; Zheng, Hai-Tao; Chen, Jianfei; Liu, Yang; Tang, Jie; Li, Juanzi; Sun, Maosong

doi:10.1038/s42256-023-00626-4

articleNature Machine IntelligenceMar 2, 2023HYBRID OA

Parameter-efficient fine-tuning of large-scale pre-trained language models

NDNing Ding YQYujia Qin GYGuang Yang FWFuchao Wei ZYZonghan Yang

Beijing Academy of Artificial Intelligence · Tsinghua University · +1 more institution

Indexed incrossref

Abstract

Abstract With the prevalence of pre-trained language models (PLMs) and the pre-training–fine-tuning paradigm, it has been continuously shown that larger models tend to yield better performance. However, as PLMs scale up, fine-tuning and storing all the parameters is prohibitively costly and eventually becomes practically infeasible. This necessitates a new branch of research focusing on the parameter-efficient adaptation of PLMs, which optimizes a small portion of the model parameters while keeping the rest fixed, drastically cutting down computation and storage costs. In general, it demonstrates that large-scale models could be effectively stimulated by the optimization of a few parameters. Despite the…

Citation impact

880

total citations

FWCI: 145.58
Percentile: 100%
References: 38

Citations per year

Authors

20

Topics & keywords

Topics

Keywords

Computer science
Fine-tuning
Scale (ratio)
Categorization
Computation
Adaptation (eye)
Language model
Artificial intelligence

UN Sustainable Development Goals

Quality Education

No related works found for this paper.