Unveiling the ChatGPT phenomenon: Evaluating the consistency and accuracy of endodontic question answers

Suárez, Ana; García, Víctor Díaz‐Flores; Algar, Juan; Sánchez, Margarita Gómez; Pedro, María Llorente de; Freire, Yolanda

doi:10.1111/iej.13985

articleInternational Endodontic JournalOct 9, 2023HYBRID OA

Unveiling the ChatGPT phenomenon: Evaluating the consistency and accuracy of endodontic question answers

ASAna Suárez VDVíctor Díaz‐Flores García JAJuan Algar MGMargarita Gómez Sánchez MLMaría Llorente de Pedro

Universidad Europea de Madrid

PubMed

Indexed incrossrefpubmed

Abstract

Aim

Chatbot Generative Pre-trained Transformer (ChatGPT) is a generative artificial intelligence (AI) software based on large language models (LLMs), designed to simulate human conversations and generate novel content based on the training data it has been exposed to. The aim of this study was to evaluate the consistency and accuracy of ChatGPT-generated answers to clinical questions in endodontics, compared to answers provided by human experts. METHODOLOGY: Ninety-one dichotomous (yes/no) questions were designed and categorized into three levels of difficulty. Twenty questions were randomly selected from each difficulty level. Sixty answers were generated by ChatGPT for each question. Two endodontic experts independently answered the 60 questions. Statistical analysis was performed using the SPSS program to calculate the consistency and accuracy of the answers generated by ChatGPT compared to the experts. Confidence intervals (95%) and standard deviations were used to estimate variability.

Results

The answers generated by ChatGPT showed high consistency (85.44%). No significant differences in consistency were found based on question difficulty. In terms of answer accuracy, ChatGPT achieved an average accuracy of 57.33%. However, significant differences in accuracy were observed based on question difficulty, with lower accuracy for easier questions.

Citation impact

173

total citations

FWCI: 6.27
Percentile: 100%
References: 29

Citations per year

Authors

6

Topics & keywords

Topics

Keywords

Endodontics
Consistency (knowledge bases)
Computer science
Generative grammar
Machine learning
Artificial intelligence
Reliability (semiconductor)
Natural language processing

UN Sustainable Development Goals

Peace, Justice and strong institutions

No related works found for this paper.