Fast and accurate protein structure search with Foldseek
Max Planck Institute for Multidisciplinary Sciences · Seoul National University · +4 more institutions
Abstract
As structure prediction methods are generating millions of publicly available protein structures, searching these databases is becoming a bottleneck. Foldseek aligns the structure of a query protein against a database by describing tertiary amino acid interactions within proteins as sequences over a structural alphabet. Foldseek decreases computation times by four to five orders of magnitude with 86%, 88% and 133% of the sensitivities of Dali, TM-align and CE, respectively.
Citation impact
- FWCI
- 328.13
- Percentile
- 100%
- References
- 49
Authors
8- MVMichel van KempenCorresponding
Max Planck Institute for Multidisciplinary Sciences
- SKStephanie Kim
Seoul National University
- CTCharlotte Tumescheit
Seoul National University
- MMMilot Mirdita
Seoul National University, Max Planck Institute for Multidisciplinary Sciences
- JLJeong-Jae Lee
Seoul National University
Topics & keywords
- Bottleneck
- Alphabet
- Computer science
- Computation
- Protein structure
- Protein tertiary structure
- Computational biology
- Information retrieval