Genome modelling and design across all domains of life with Evo 2
Palo Alto Institute · Arc Research Institute · +14 more institutions
Abstract
All of life encodes information with DNA. Although tools for genome sequencing, synthesis and editing have transformed biological research, we still lack sufficient understanding of the immense complexity encoded by genomes to predict the effects of many classes of genomic changes or to intelligently compose new biological systems. Artificial intelligence models that learn information from genomic sequences across diverse organisms have increasingly advanced prediction and design capabilities1,2. Here we introduce Evo 2, a biological foundation model trained on 9 trillion DNA base pairs from a highly curated genomic atlas spanning all domains of life to have a 1 million token context window with…
Citation impact
- FWCI
- 292.23
- Percentile
- 100%
- References
- 50
Authors
62- GBGaryk BrixiCorresponding
Palo Alto Institute, Arc Research Institute, Stanford University, Core Laboratories (United States)
- MGMatthew G. Durrant
Palo Alto Institute, Arc Research Institute, Core Laboratories (United States)
- JKJerome Ku
Palo Alto Institute, Arc Research Institute, Core Laboratories (United States)
- MNMohsen Naghipourfar
Palo Alto Institute, Arc Research Institute, University of California, Berkeley, Core Laboratories (United States)
- MPMichael Poli
BioQ Pharma (United States), Stanford University, Core Laboratories (United States)
Topics & keywords
- Genome
- Genomics
- Model organism
- Human genome