Diagnostic Accuracy of Differential-Diagnosis Lists Generated by Generative Pretrained Transformer 3 Chatbot for Clinical Vignettes with Common Chief Complaints: A Pilot Study

Dokkyo Medical University

PubMed
Indexed incrossrefpubmed

Abstract

The diagnostic accuracy of differential diagnoses generated by artificial intelligence (AI) chatbots, including the generative pretrained transformer 3 (GPT-3) chatbot (ChatGPT-3) is unknown. This study evaluated the accuracy of differential-diagnosis lists generated by ChatGPT-3 for clinical vignettes with common chief complaints. General internal medicine physicians created clinical cases, correct diagnoses, and five differential diagnoses for ten common chief complaints. The rate of correct diagnosis by ChatGPT-3 within the ten differential-diagnosis lists was 28/30 (93.3%). The rate of correct diagnosis by physicians was still superior to that by ChatGPT-3 within the five differential-diagnosis lists…

Citation impact

359
total citations
FWCI
12.91
Percentile
100%
References
41
Citations per year

Authors

6

Topics & keywords

Keywords
  • Differential diagnosis
  • Medical diagnosis
  • Medicine
  • Diagnostic accuracy
  • Pediatrics
  • Internal medicine
  • Radiology
  • Pathology
No related works found for this paper.

Funding