PPT: Pre-trained Prompt Tuning for Few-shot Learning

Gu, Yuxian; Han, Xu; Liu, Zhiyuan; Huang, Minlie

doi:10.18653/v1/2022.acl-long.576

articleProceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)Jan 1, 2022HYBRID OA

PPT: Pre-trained Prompt Tuning for Few-shot Learning

YGYuxian Gu XHXu Han ZLZhiyuan Liu MHMinlie Huang

Center for Information Technology · Tsinghua University · +1 more institution

Indexed incrossref

Abstract

Prompts for pre-trained language models (PLMs) have shown remarkable performance by bridging the gap between pre-training tasks and various downstream tasks. Among these methods, prompt tuning, which freezes PLMs and only tunes soft prompts, provides an efficient and effective solution for adapting largescale PLMs to downstream tasks. However, prompt tuning is yet to be fully explored. In our pilot experiments, we find that prompt tuning performs comparably with conventional full-model tuning when downstream data are sufficient, whereas it is much worse under fewshot learning settings, which may hinder the application of prompt tuning. We attribute this low performance to the manner of initializing soft…

Citation impact

245

total citations

FWCI: 24.10
Percentile: 100%
References: 58

Citations per year

Authors

4

Topics & keywords

Topics

Keywords

Computer science
Initialization
Task (project management)
Bridging (networking)
Artificial intelligence
Generalization
Machine learning

No related works found for this paper.