Comparative Evaluation of Advanced AI Reasoning Models in Pediatric Clinical Decision Support: ChatGPT O1 vs. DeepSeek-R1

Mondillo, Gianluca; Colosimo, Simone; Perrotta, Alessandra; Frattolillo, Vittoria; Masino, Mariapia

doi:10.1101/2025.01.27.25321169

preprintmedRxivJan 28, 2025GREEN OA

Comparative Evaluation of Advanced AI Reasoning Models in Pediatric Clinical Decision Support: ChatGPT O1 vs. DeepSeek-R1

GMGianluca Mondillo SCSimone Colosimo APAlessandra Perrotta VFVittoria Frattolillo MMMariapia Masino

University of Campania "Luigi Vanvitelli" · Azienda Ospedaliera Universitaria Università degli Studi della Campania Luigi Vanvitelli

Indexed incrossref

Abstract

Abstract Introduction The adoption of advanced reasoning models, such as ChatGPT O1 and DeepSeek-R1, represents a pivotal step forward in clinical decision support, particularly in pediatrics. ChatGPT O1 employs “chain-of-thought reasoning” (CoT) to enhance structured problem-solving, while DeepSeek-R1 introduces self-reflection capabilities through reinforcement learning. This study aimed to evaluate the diagnostic accuracy and clinical utility of these models in pediatric scenarios using the MedQA dataset. Materials and Methods A total of 500 multiple-choice pediatric questions from the MedQA dataset were presented to ChatGPT O1 and DeepSeek-R1. Each question included four or more options, with one correct…

Citation impact

48

total citations

FWCI: —
Percentile: —
References: 4

Citations per year

Authors

5

Topics & keywords

Topics

Keywords

Computer science
Decision support system
Clinical decision support system
Artificial intelligence

No related works found for this paper.