Bias in Large Language Models: Origin, Evaluation, and Mitigation
George Washington University · University of Connecticut · +2 more institutions
Abstract
Large language models (LLMs) have revolutionized natural language processing, but their susceptibility to biases poses significant challenges. This comprehensive review examines the landscape of bias in LLMs, from its origins to current mitigation strategies. We categorize biases as intrinsic and extrinsic, analyzing their manifestations in various natural language processing (NLP) tasks. The review critically assesses a range of bias evaluation methods, including data-level, model-level, and output-level approaches, providing researchers with a robust toolkit for bias detection. We further explore mitigation strategies, categorizing them into pre-model, intra-model, and post-model techniques, highlighting…
Citation impact
- FWCI
- 97.40
- Percentile
- 99%
- References
- 0
Authors
8Topics & keywords
- Computer science