articleArtificial Intelligence ReviewJan 3, 2026HYBRID OA

Hallucination to truth: a review of fact-checking and factuality evaluation in large language models

Artificial Intelligence in Medicine (Canada) · United International University · +3 more institutions

Indexed inarxivcrossref

Abstract

Abstract Large language models (LLMs) are trained on vast and diverse internet corpora that often include inaccurate or misleading content. Consequently, LLMs can generate misinformation, making robust fact-checking essential. This review systematically analyzes how LLM-generated content is evaluated for factual accuracy by exploring key challenges such as hallucinations, dataset limitations, and the reliability of evaluation metrics. The review emphasizes the need for strong fact-checking frameworks that integrate advanced prompting strategies, domain-specific fine-tuning, and retrieval-augmented generation (RAG) methods. It proposes five research questions that guide the analysis of the recent literature…

No related works found for this paper.

Funding