Improving deep learning protein monomer and complex structure prediction using DeepMSA2 with huge metagenomics data
University of Michigan · Michigan State University · +2 more institutions
Abstract
Leveraging iterative alignment search through genomic and metagenome sequence databases, we report the DeepMSA2 pipeline for uniform protein single- and multichain multiple-sequence alignment (MSA) construction. Large-scale benchmarks show that DeepMSA2 MSAs can remarkably increase the accuracy of protein tertiary and quaternary structure predictions compared with current state-of-the-art methods. An integrated pipeline with DeepMSA2 participated in the most recent CASP15 experiment and created complex structural models with considerably higher quality than the AlphaFold2-Multimer server (v.2.2.0). Detailed data analyses show that the major advantage of DeepMSA2 lies in its balanced alignment search and…
Citation impact
- FWCI
- 25.75
- Percentile
- 100%
- References
- 40
Authors
6Topics & keywords
- Pipeline (software)
- Metagenomics
- Computer science
- Data mining
- Deep learning
- Sequence (biology)
- Artificial intelligence
- Machine learning
Funding
- NSNational Science FoundationAwards: IIS1901191, ACI1548562, MTM2025426, S10OD026825, GM136422, DBI2030790, AI134678
- DODivision of Intramural Research, National Institute of Allergy and Infectious DiseasesAward: AI134678
- NINational Institute of General Medical SciencesAwards: AI134678, ACI1548562, IIS1901191, S10OD026825, DBI2030790, MTM2025426, GM136422
- NINational Institute of Allergy and Infectious DiseasesAwards: MTM2025426, IIS1901191, ACI1548562, DBI2030790, S10OD026825, GM136422, AI134678