Evaluating AI-generated patient education materials for spinal surgeries: Comparative analysis of readability and DISCERN quality across ChatGPT and deepseek models

Zhou, Mi; Pan, Yunfeng; Zhang, Yuye; Song, Xiaomei; Zhou, Youbin

doi:10.1016/j.ijmedinf.2025.105871

articleInternational Journal of Medical InformaticsMar 13, 2025HYBRID OA

Evaluating AI-generated patient education materials for spinal surgeries: Comparative analysis of readability and DISCERN quality across ChatGPT and deepseek models

MZMi Zhou YPYunfeng Pan YZYuye Zhang XSXiaomei Song YZYoubin Zhou

University of South Australia · Second Affiliated Hospital of Soochow University · +2 more institutions

PubMed

Indexed incrossrefpubmed

Abstract

Background

Access to patient-centered health information is essential for informed decision-making. However, online medical resources vary in quality and often fail to accommodate differing degrees of health literacy. This issue is particularly evident in surgical contexts, where complex terminology obstructs patient comprehension. With the increasing reliance on AI models for supplementary medical information, the reliability and readability of AI-generated content require thorough evaluation.

Objective

This study aimed to evaluate four natural language processing models-ChatGPT-4o, ChatGPT-o3 mini, DeepSeek-V3, and DeepSeek-R1-in generating patient education materials for three common spinal surgeries: lumbar discectomy, spinal fusion, and decompressive laminectomy. Information quality was evaluated using the DISCERN score, and readability was assessed through Flesch-Kincaid indices.

Citation impact

90

total citations

FWCI: 42.84
Percentile: 100%
References: 36

Citations per year

Authors

5

Topics & keywords

Topics

Keywords

Readability
Quality (philosophy)
Computer science
Patient education
Medicine
Medical education
Medical physics
Multimedia

UN Sustainable Development Goals

Quality Education

No related works found for this paper.