preprintElectronicsApr 24, 2026GOLD OA

Bias in Large Language Models: Origin, Evaluation, and Mitigation

George Washington University · University of Connecticut · +2 more institutions

Indexed inarxivcrossrefdatacite

Abstract

Large language models (LLMs) have revolutionized natural language processing, but their susceptibility to biases poses significant challenges. This comprehensive review examines the landscape of bias in LLMs, from its origins to current mitigation strategies. We categorize biases as intrinsic and extrinsic, analyzing their manifestations in various natural language processing (NLP) tasks. The review critically assesses a range of bias evaluation methods, including data-level, model-level, and output-level approaches, providing researchers with a robust toolkit for bias detection. We further explore mitigation strategies, categorizing them into pre-model, intra-model, and post-model techniques, highlighting…

No related works found for this paper.