Comparing scientific abstracts generated by ChatGPT to original abstracts using an artificial intelligence output detector, plagiarism detector, and blinded human reviewers

Gao, Catherine A.; Howard, Frederick M.; Markov, Nikolay S.; Dyer, Emma; Ramesh, Siddhi; Luo, Yuan; Pearson, Alexander T.

doi:10.1101/2022.12.23.521610

preprintbioRxiv (Cold Spring Harbor Laboratory)Dec 27, 2022GREEN OA

Comparing scientific abstracts generated by ChatGPT to original abstracts using an artificial intelligence output detector, plagiarism detector, and blinded human reviewers

CACatherine A. Gao FMFrederick M. Howard NSNikolay S. Markov EDEmma Dyer SRSiddhi Ramesh

University of Chicago

Indexed incrossref

Abstract

Abstract Background Large language models such as ChatGPT can produce increasingly realistic text, with unknown information on the accuracy and integrity of using these models in scientific writing. Methods We gathered ten research abstracts from five high impact factor medical journals (n=50) and asked ChatGPT to generate research abstracts based on their titles and journals. We evaluated the abstracts using an artificial intelligence (AI) output detector, plagiarism detector, and had blinded human reviewers try to distinguish whether abstracts were original or generated. Results All ChatGPT-generated abstracts were written clearly but only 8% correctly followed the specific journal’s formatting requirements.…

Citation impact

412

total citations

FWCI: —
Percentile: —
References: 19

Citations per year

Authors

7

Topics & keywords

Topics

Keywords

Detector
Originality
Computer science
Natural language processing
Plagiarism detection
Interquartile range
Artificial intelligence
Information retrieval

UN Sustainable Development Goals

Quality Education

No related works found for this paper.