Estimating excess mortality due to the COVID-19 pandemic: a systematic analysis of COVID-19-related mortality, 2020–21
Abstract
Mortality statistics are fundamental to public health decision making. Mortality varies by time and location, and its measurement is affected by well known biases that have been exacerbated during the COVID-19 pandemic. This paper aims to estimate excess mortality from the COVID-19 pandemic in 191 countries and territories, and 252 subnational units for selected countries, from Jan 1, 2020, to Dec 31, 2021.
All-cause mortality reports were collected for 74 countries and territories and 266 subnational locations (including 31 locations in low-income and middle-income countries) that had reported either weekly or monthly deaths from all causes during the pandemic in 2020 and 2021, and for up to 11 year previously. In addition, we obtained excess mortality data for 12 states in India. Excess mortality over time was calculated as observed mortality, after excluding data from periods affected by late registration and anomalies such as heat waves, minus expected mortality. Six models were used to estimate expected mortality; final estimates of expected mortality were based on an ensemble of these models. Ensemble weights were based on root mean squared errors derived from an out-of-sample predictive validity test. As mortality records are incomplete worldwide, we built a statistical model that predicted the excess mortality rate for locations and periods where all-cause mortality data were not available. We used least absolute shrinkage and selection operator (LASSO) regression as a variable selection mechanism and selected 15 covariates, including both covariates pertaining to the COVID-19 pandemic, such as seroprevalence, and to background population health metrics, such as the Healthcare Access and Quality Index, with direction of effects on excess mortality concordant with a meta-analysis by the US Centers for Disease Control and Prevention. With the selected best model, we ran a prediction process using 100 draws for each covariate and 100 draws of estimated coefficients and residuals, estimated from the regressions run at the draw level using draw-level input data on both excess mortality and covariates. Mean values and 95% uncertainty intervals were then generated at national, regional, and global levels. Out-of-sample predictive validity testing was done on the basis of our final model specification.
Citation impact
- FWCI
- 166.61
- Percentile
- 100%
- References
- 32
Authors
96- HWHaidong WangCorresponding
- KRKatherine R Paulson
- SASpencer A Pease
- SWStefanie Watson
- HCHaley Comfort
Topics & keywords
- Excess mortality
- MEDLINE
- Population
- Estimation
- Statistical analysis
Funding
- NSNational Science FoundationAward: 2031096
- BABill and Melinda Gates Foundation
- SASouth African Medical Research Council
- MOMinistry of Education, Culture, Sports, Science and TechnologyAward: /2016
- CDCoordenação de Aperfeiçoamento de Pessoal de Nível SuperiorAwards: 88887, /2020-00, 88887.507149/2020-00, 2020-00
- CNConselho Nacional de Desenvolvimento Científico e TecnológicoAwards: 310679, 465518/2014-1, 310679/2016-8 and 465518/2014-1, 2014-1, PPM-00428-17, 2016-8, -8 and, 310679/2016-8
- MRMedical Research Council
- NHNational Health and Medical Research CouncilAwards: 310679, APP1121516, 2031096