Analysis of 6.4 million SARS-CoV-2 genomes identifies mutations associated with fitness
Broad Institute · San Francisco Foundation · +7 more institutions
Abstract
Repeated emergence of severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) variants with increased fitness underscores the value of rapid detection and characterization of new lineages. We have developed PyR 0 , a hierarchical Bayesian multinomial logistic regression model that infers relative prevalence of all viral lineages across geographic regions, detects lineages increasing in prevalence, and identifies mutations relevant to fitness. Applying PyR 0 to all publicly available SARS-CoV-2 genomes, we identify numerous substitutions that increase fitness, including previously identified spike mutations and many nonspike mutations within the nucleocapsid and nonstructural proteins. PyR 0 forecasts…
Citation impact
- FWCI
- 35.18
- Percentile
- 100%
- References
- 51
Authors
13- FOFritz ObermeyerCorresponding
Broad Institute, San Francisco Foundation, University of San Francisco
- MJMartin Jankowiak
Broad Institute, San Francisco Foundation, University of San Francisco
- NBNikolaos Barkas
Broad Institute
- SFS. F. Schaffner
Broad Institute, Harvard University
- JDJesse D. Pyle
Broad Institute
Topics & keywords
- Genome
- Biology
- Genetics
- Mutation
- Severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2)
- Evolutionary biology
- Coronavirus disease 2019 (COVID-19)
- Genetic Fitness
- Good health and well-being