Fantastically Ordered Prompts and Where to Find Them: Overcoming Few-Shot Prompt Order Sensitivity

University College London

Indexed incrossref

Abstract

When primed with only a handful of training samples, very large, pretrained language models such as GPT-3 have shown competitive results when compared to fully-supervised, fine-tuned, large, pretrained language models. We demonstrate that the order in which the samples are provided can make the difference between near state-of-the-art and random guess performance: essentially some permutations are “fantastic” and some not. We analyse this phenomenon in detail, establishing that: it is present across model sizes (even for the largest current models), it is not related to a specific subset of samples, and that a given good permutation for one model is not transferable to another. While one could use a…

Citation impact

436
total citations
FWCI
41.44
Percentile
100%
References
36
Citations per year

Authors

5

Topics & keywords

Keywords
  • Computer science
  • Permutation (music)
  • Set (abstract data type)
  • Language model
  • Construct (python library)
  • Generative grammar
  • Artificial intelligence
  • Entropy (arrow of time)
UN Sustainable Development Goals
  • Quality Education
No related works found for this paper.