articleScienceMar 16, 2023HYBRID OA

Evolutionary-scale prediction of atomic-level protein structure with a language model

The Metropolitan Opera (United States) · New York University · +3 more institutions

PubMed
Indexed incrossrefpubmed

Abstract

Recent advances in machine learning have leveraged evolutionary information in multiple sequence alignments to predict protein structure. We demonstrate direct inference of full atomic-level protein structure from primary sequence using a large language model. As language models of protein sequences are scaled up to 15 billion parameters, an atomic-resolution picture of protein structure emerges in the learned representations. This results in an order-of-magnitude acceleration of high-resolution structure prediction, which enables large-scale structural characterization of metagenomic proteins. We apply this capability to construct the ESM Metagenomic Atlas by predicting structures for >617 million metagenomic…

Citation impact

4,717
total citations
FWCI
684.07
Percentile
100%
References
64
Citations per year

Authors

15

Topics & keywords

Keywords
  • Metagenomics
  • Computer science
  • Inference
  • Protein structure prediction
  • Construct (python library)
  • Sequence (biology)
  • Protein structure
  • Scale (ratio)
UN Sustainable Development Goals
  • Quality Education
No related works found for this paper.