Evaluating AI-generated patient education materials for spinal surgeries: Comparative analysis of readability and DISCERN quality across ChatGPT and deepseek models
University of South Australia · Second Affiliated Hospital of Soochow University · +2 more institutions
Abstract
Access to patient-centered health information is essential for informed decision-making. However, online medical resources vary in quality and often fail to accommodate differing degrees of health literacy. This issue is particularly evident in surgical contexts, where complex terminology obstructs patient comprehension. With the increasing reliance on AI models for supplementary medical information, the reliability and readability of AI-generated content require thorough evaluation.
This study aimed to evaluate four natural language processing models-ChatGPT-4o, ChatGPT-o3 mini, DeepSeek-V3, and DeepSeek-R1-in generating patient education materials for three common spinal surgeries: lumbar discectomy, spinal fusion, and decompressive laminectomy. Information quality was evaluated using the DISCERN score, and readability was assessed through Flesch-Kincaid indices.
Citation impact
- FWCI
- 42.84
- Percentile
- 100%
- References
- 36
Authors
5- MZMi ZhouCorresponding
University of South Australia
- YPYunfeng Pan
Second Affiliated Hospital of Soochow University, Soochow University
- YZYuye Zhang
Soochow University, Second Affiliated Hospital of Soochow University
- XSXiaomei SongCorresponding
Soochow University, Second Affiliated Hospital of Soochow University
- YZYoubin Zhou
Jinling Institute of Technology
Topics & keywords
- Readability
- Quality (philosophy)
- Computer science
- Patient education
- Medicine
- Medical education
- Medical physics
- Multimedia
- Quality Education