Bias in Large Language Models: Origin, Evaluation, and Mitigation

Guo, Yufei; Guo, Muzhe; Su, Juntao; Yang, Zhou; Zhu, Mengqiu; Li, Hongfei; Qiu, Mengyang; Liu, Shuo Shuo

doi:10.3390/electronics15091824

preprintElectronicsApr 24, 2026GOLD OA

Bias in Large Language Models: Origin, Evaluation, and Mitigation

YGYufei Guo MGMuzhe Guo JSJuntao Su ZYZhou Yang MZMengqiu Zhu

George Washington University · University of Connecticut · +2 more institutions

Indexed inarxivcrossrefdatacite

Abstract

Large language models (LLMs) have revolutionized natural language processing, but their susceptibility to biases poses significant challenges. This comprehensive review examines the landscape of bias in LLMs, from its origins to current mitigation strategies. We categorize biases as intrinsic and extrinsic, analyzing their manifestations in various natural language processing (NLP) tasks. The review critically assesses a range of bias evaluation methods, including data-level, model-level, and output-level approaches, providing researchers with a robust toolkit for bias detection. We further explore mitigation strategies, categorizing them into pre-model, intra-model, and post-model techniques, highlighting…

Citation impact

16

total citations

FWCI: 97.40
Percentile: 99%
References: 0

Citations per year

Authors

8

Topics & keywords

Topics

Natural Language Processing Techniques90%

Keywords

Computer science

No related works found for this paper.