articleNature MethodsJan 2, 2024HYBRID OA

Improving deep learning protein monomer and complex structure prediction using DeepMSA2 with huge metagenomics data

University of Michigan · Michigan State University · +2 more institutions

PubMed
Indexed incrossrefpubmed

Abstract

Leveraging iterative alignment search through genomic and metagenome sequence databases, we report the DeepMSA2 pipeline for uniform protein single- and multichain multiple-sequence alignment (MSA) construction. Large-scale benchmarks show that DeepMSA2 MSAs can remarkably increase the accuracy of protein tertiary and quaternary structure predictions compared with current state-of-the-art methods. An integrated pipeline with DeepMSA2 participated in the most recent CASP15 experiment and created complex structural models with considerably higher quality than the AlphaFold2-Multimer server (v.2.2.0). Detailed data analyses show that the major advantage of DeepMSA2 lies in its balanced alignment search and…

Citation impact

122
total citations
FWCI
25.75
Percentile
100%
References
40
Citations per year

Authors

6

Topics & keywords

Keywords
  • Pipeline (software)
  • Metagenomics
  • Computer science
  • Data mining
  • Deep learning
  • Sequence (biology)
  • Artificial intelligence
  • Machine learning
No related works found for this paper.

Funding