A critical assessment of using ChatGPT for extracting structured data from clinical notes

Huang, Jingwei; Yang, Donghan M.; Rong, Ruichen; Nezafati, Kuroush; Treager, Colin; Chi, Zhikai; Wang, Shidan; Cheng, Xian; Guo, Yujia; Klesse, Laura J.; Xiao, Guanghua; Peterson, Eric D.; Zhan, Xiaowei; Xie, Yang

doi:10.1038/s41746-024-01079-8

articlenpj Digital MedicineMay 1, 2024GOLD OA

A critical assessment of using ChatGPT for extracting structured data from clinical notes

JHJingwei Huang DMDonghan M. Yang RRRuichen Rong KNKuroush Nezafati CTColin Treager

The University of Texas Southwestern Medical Center · Southwestern Medical Center

PubMed

Indexed incrossrefdoajpubmed

Abstract

Existing natural language processing (NLP) methods to convert free-text clinical notes into structured data often require problem-specific annotations and model training. This study aims to evaluate ChatGPT's capacity to extract information from free-text medical notes efficiently and comprehensively. We developed a large language model (LLM)-based workflow, utilizing systems engineering methodology and spiral "prompt engineering" process, leveraging OpenAI's API for batch querying ChatGPT. We evaluated the effectiveness of this method using a dataset of more than 1000 lung cancer pathology reports and a dataset of 191 pediatric osteosarcoma pathology reports, comparing the ChatGPT-3.5 (gpt-3.5-turbo-16k)…

Citation impact

233

total citations

FWCI: 24.84
Percentile: 100%
References: 22

Citations per year

Authors

14

Topics & keywords

Topics

Keywords

Computer science
Data science

UN Sustainable Development Goals

Peace, Justice and strong institutions

No related works found for this paper.