A benchmark of batch-effect correction methods for single-cell RNA sequencing data
Agency for Science, Technology and Research · Singapore Immunology Network
Abstract
Large-scale single-cell transcriptomic datasets generated using different technologies contain batch-specific systematic variations that present a challenge to batch-effect removal and data integration. With continued growth expected in scRNA-seq data, achieving effective batch integration with available computational resources is crucial. Here, we perform an in-depth benchmark study on available batch correction methods to determine the most suitable method for batch-effect removal.
We compare 14 methods in terms of computational runtime, the ability to handle large datasets, and batch-effect correction efficacy while preserving cell type purity. Five scenarios are designed for the study: identical cell types with different technologies, non-identical cell types, multiple batches, big data, and simulated data. Performance is evaluated using four benchmarking metrics including kBET, LISI, ASW, and ARI. We also investigate the use of batch-corrected data to study differential gene expression.
Citation impact
- FWCI
- 63.43
- Percentile
- 100%
- References
- 45
Authors
7- HTHoa Thi TranCorresponding
Agency for Science, Technology and Research, Singapore Immunology Network
- KSKok Siong Ang
Agency for Science, Technology and Research, Singapore Immunology Network
- MCMarion Chevrier
Agency for Science, Technology and Research, Singapore Immunology Network
- XZXiaomeng Zhang
Agency for Science, Technology and Research, Singapore Immunology Network
- NYNicole Yee Shin Lee
Agency for Science, Technology and Research, Singapore Immunology Network
Topics & keywords
- Benchmark (surveying)
- Benchmarking
- Computer science
- Batch processing
- Data mining
- Data integration
- Industry, innovation and infrastructure