Machine learning based screening of potential paper mill publications in cancer research: methodological and cross sectional study
Centre National de la Recherche Scientifique · Institut de recherche mathématique de Rennes · +3 more institutions
Abstract
To train and validate a machine learning model to distinguish paper mill publications from genuine cancer research articles, and to screen the cancer research literature to assess the prevalence of papers that have textual similarities to paper mill papers.
Methodological and cross sectional study applying a BERT (bidirectional encoder representations from transformers) based, text classification model to article titles and abstracts.
Citation impact
- FWCI
- 90.91
- Percentile
- 100%
- References
- 34
Authors
4- BSBaptiste ScancarCorresponding
Centre National de la Recherche Scientifique, Institut de recherche mathématique de Rennes
- JAJennifer A. Byrne
The University of Sydney, New South Wales Department of Health
- DCDavid Causeur
Centre National de la Recherche Scientifique, Institut de recherche mathématique de Rennes
- ABAdrian Barnett
Queensland University of Technology
Topics & keywords
- Mill
- Cross-sectional study
- Action (physics)
- Cancer
- Alternative medicine