SemiBin2: self-supervised contrastive learning leads to better MAGs for short- and long-read sequencing
Fudan University · Shanghai Institute for Science of Science · +2 more institutions
Abstract
MOTIVATION: Metagenomic binning methods to reconstruct metagenome-assembled genomes (MAGs) from environmental samples have been widely used in large-scale metagenomic studies. The recently proposed semi-supervised binning method, SemiBin, achieved state-of-the-art binning results in several environments. However, this required annotating contigs, a computationally costly and potentially biased process. RESULTS: We propose SemiBin2, which uses self-supervised learning to learn feature embeddings from the contigs. In simulated and real datasets, we show that self-supervised learning achieves better results than the semi-supervised learning used in SemiBin1 and that SemiBin2 outperforms other state-of-the-art…
Citation impact
- FWCI
- 26.15
- Percentile
- 100%
- References
- 59
Authors
3- SPShaojun PanCorresponding
Fudan University, Shanghai Institute for Science of Science, Shanghai Center for Brain Science and Brain-Inspired Technology
- XZXing‐Ming Zhao
Fudan University, Shanghai Innovative Research Center of Traditional Chinese Medicine, Shanghai Institute for Science of Science, Shanghai Center for Brain Science and Brain-Inspired Technology
- LPLuís Pedro Coelho
Fudan University, Shanghai Institute for Science of Science, Shanghai Center for Brain Science and Brain-Inspired Technology
Topics & keywords
- Computer science
- Artificial intelligence
- Natural language processing