ActiveLLM: Large Language Model-Based Active Learning for Textual Few-Shot Scenarios
Technische Universität Darmstadt
Abstract
Abstract Active learning is designed to minimize annotation efforts by prioritizing instances that most enhance learning. However, many active learning strategies struggle with a ‘cold-start’ problem, needing substantial initial data to be effective. This limitation reduces their utility in the increasingly relevant few-shot scenarios, where the instance selection has a substantial impact. To address this, we introduce ActiveLLM, a novel active learning approach that leverages Large Language Models such as GPT-4, o1, Llama 3, or Mistral Large for selecting instances. We demonstrate that ActiveLLM significantly enhances the classification performance of BERT classifiers in few-shot scenarios, outperforming…
Citation impact
- FWCI
- 23.31
- Percentile
- 98%
- References
- 29
Authors
3- MBMarkus BayerCorresponding
Technische Universität Darmstadt
- JLJustin Lutz
Technische Universität Darmstadt
- CRChristian Reuter
Technische Universität Darmstadt
Topics & keywords
- Shot (pellet)
- Computer science
- One shot
- Artificial intelligence
- Natural language processing
- Engineering
- Chemistry