Large Language Models for Chatbot Health Advice Studies
McMaster University · California Miramar University · +7 more institutions
Abstract
There is much interest in the clinical integration of large language models (LLMs) in health care. Many studies have assessed the ability of LLMs to provide health advice, but the quality of their reporting is uncertain.
To perform a systematic review to examine the reporting variability among peer-reviewed studies evaluating the performance of generative artificial intelligence (AI)-driven chatbots for summarizing evidence and providing health advice to inform the development of the Chatbot Assessment Reporting Tool (CHART). Evidence Review: A search of MEDLINE via Ovid, Embase via Elsevier, and Web of Science from inception to October 27, 2023, was conducted with the help of a health sciences librarian to yield 7752 articles. Two reviewers screened articles by title and abstract followed by full-text review to identify primary studies evaluating the clinical accuracy of generative AI-driven chatbots in providing health advice (chatbot health advice studies). Two reviewers then performed data extraction for 137 eligible studies.
Citation impact
- FWCI
- 64.27
- Percentile
- 100%
- References
- 186
Authors
13Topics & keywords
- Advice (programming)
- Chatbot
- MEDLINE
- Data extraction
- Medicine
- Health care
- Medical education
- Computer science