Hallucination to truth: a review of fact-checking and factuality evaluation in large language models

Rahman, S M Asif Ur; Islam, Md. Adnanul; Alam, Md. Mahbub; Zeba, Musarrat; Rahman, Md Abdur; Chowa, Sadia Sultana; Raiaan, Mohaimenul Azam Khan; Azam, Sami

doi:10.1007/s10462-025-11454-w

articleArtificial Intelligence ReviewJan 3, 2026HYBRID OA

Hallucination to truth: a review of fact-checking and factuality evaluation in large language models

SMS M Asif Ur Rahman MAMd. Adnanul Islam MMMd. Mahbub Alam MZMusarrat Zeba MAMd Abdur Rahman

Artificial Intelligence in Medicine (Canada) · United International University · +3 more institutions

Indexed inarxivcrossref

Abstract

Abstract Large language models (LLMs) are trained on vast and diverse internet corpora that often include inaccurate or misleading content. Consequently, LLMs can generate misinformation, making robust fact-checking essential. This review systematically analyzes how LLM-generated content is evaluated for factual accuracy by exploring key challenges such as hallucinations, dataset limitations, and the reliability of evaluation metrics. The review emphasizes the need for strong fact-checking frameworks that integrate advanced prompting strategies, domain-specific fine-tuning, and retrieval-augmented generation (RAG) methods. It proposes five research questions that guide the analysis of the recent literature…

Citation impact

11

total citations

FWCI: 411.94
Percentile: 100%
References: 81

Too recent for citation history.

Authors

8

Topics & keywords

Topics

Keywords

Consistency (knowledge bases)
Key (lock)
Trustworthiness
The Internet
Reliability (semiconductor)

No related works found for this paper.

Funding

CD
Charles Darwin University