ActiveLLM: Large Language Model-Based Active Learning for Textual Few-Shot Scenarios

MBMarkus BayerJLJustin LutzCRChristian Reuter

Technische Universität Darmstadt

Indexed inarxivcrossrefdatacitedoaj

Abstract

Abstract Active learning is designed to minimize annotation efforts by prioritizing instances that most enhance learning. However, many active learning strategies struggle with a ‘cold-start’ problem, needing substantial initial data to be effective. This limitation reduces their utility in the increasingly relevant few-shot scenarios, where the instance selection has a substantial impact. To address this, we introduce ActiveLLM, a novel active learning approach that leverages Large Language Models such as GPT-4, o1, Llama 3, or Mistral Large for selecting instances. We demonstrate that ActiveLLM significantly enhances the classification performance of BERT classifiers in few-shot scenarios, outperforming…

Citation impact

5
total citations
FWCI
23.31
Percentile
98%
References
29
Citations per year

Authors

3

Topics & keywords

Keywords
  • Shot (pellet)
  • Computer science
  • One shot
  • Artificial intelligence
  • Natural language processing
  • Engineering
  • Chemistry
No related works found for this paper.

Funding