articleBMJJan 29, 2026HYBRID OA

Machine learning based screening of potential paper mill publications in cancer research: methodological and cross sectional study

Centre National de la Recherche Scientifique · Institut de recherche mathématique de Rennes · +3 more institutions

PubMed
Indexed incrossrefpubmed

Abstract

Objectives

To train and validate a machine learning model to distinguish paper mill publications from genuine cancer research articles, and to screen the cancer research literature to assess the prevalence of papers that have textual similarities to paper mill papers.

Design

Methodological and cross sectional study applying a BERT (bidirectional encoder representations from transformers) based, text classification model to article titles and abstracts.

No related works found for this paper.