A benchmark of batch-effect correction methods for single-cell RNA sequencing data

Tran, Hoa Thi; Ang, Kok Siong; Chevrier, Marion; Zhang, Xiaomeng; Lee, Nicole Yee Shin; Goh, Michelle; Chen, Jinmiao

doi:10.1186/s13059-019-1850-9

articleGenome biologyJan 16, 2020GOLD OA

A benchmark of batch-effect correction methods for single-cell RNA sequencing data

HTHoa Thi Tran KSKok Siong Ang MCMarion Chevrier XZXiaomeng Zhang NYNicole Yee Shin Lee

Agency for Science, Technology and Research · Singapore Immunology Network

PubMed

Indexed incrossrefdoajpubmed

Abstract

Background

Large-scale single-cell transcriptomic datasets generated using different technologies contain batch-specific systematic variations that present a challenge to batch-effect removal and data integration. With continued growth expected in scRNA-seq data, achieving effective batch integration with available computational resources is crucial. Here, we perform an in-depth benchmark study on available batch correction methods to determine the most suitable method for batch-effect removal.

Results

We compare 14 methods in terms of computational runtime, the ability to handle large datasets, and batch-effect correction efficacy while preserving cell type purity. Five scenarios are designed for the study: identical cell types with different technologies, non-identical cell types, multiple batches, big data, and simulated data. Performance is evaluated using four benchmarking metrics including kBET, LISI, ASW, and ARI. We also investigate the use of batch-corrected data to study differential gene expression.

Citation impact

1,196

total citations

FWCI: 63.43
Percentile: 100%
References: 45

Citations per year

Authors

7

Topics & keywords

Topics

Keywords

Benchmark (surveying)
Benchmarking
Computer science
Batch processing
Data mining
Data integration

UN Sustainable Development Goals

Industry, innovation and infrastructure

No related works found for this paper.