Comparing scientific abstracts generated by ChatGPT to original abstracts using an artificial intelligence output detector, plagiarism detector, and blinded human reviewers
Indexed incrossref
Abstract
Abstract Background Large language models such as ChatGPT can produce increasingly realistic text, with unknown information on the accuracy and integrity of using these models in scientific writing. Methods We gathered ten research abstracts from five high impact factor medical journals (n=50) and asked ChatGPT to generate research abstracts based on their titles and journals. We evaluated the abstracts using an artificial intelligence (AI) output detector, plagiarism detector, and had blinded human reviewers try to distinguish whether abstracts were original or generated. Results All ChatGPT-generated abstracts were written clearly but only 8% correctly followed the specific journal’s formatting requirements.…
Citation impact
412
total citations
- FWCI
- —
- Percentile
- —
- References
- 19
Citations per year
Authors
7Topics & keywords
Topics
Keywords
- Detector
- Originality
- Computer science
- Natural language processing
- Plagiarism detection
- Interquartile range
- Artificial intelligence
- Information retrieval
UN Sustainable Development Goals
- Quality Education
No related works found for this paper.