Enhanced secondary analysis of survival data: reconstructing the data from published Kaplan-Meier survival curves
Abstract
The results of Randomized Controlled Trials (RCTs) on time-to-event outcomes that are usually reported are median time to events and Cox Hazard Ratio. These do not constitute the sufficient statistics required for meta-analysis or cost-effectiveness analysis, and their use in secondary analyses requires strong assumptions that may not have been adequately tested. In order to enhance the quality of secondary data analyses, we propose a method which derives from the published Kaplan Meier survival curves a close approximation to the original individual patient time-to-event data from which they were generated.
We develop an algorithm that maps from digitised curves back to KM data by finding numerical solutions to the inverted KM equations, using where available information on number of events and numbers at risk. The reproducibility and accuracy of survival probabilities, median survival times and hazard ratios based on reconstructed KM data was assessed by comparing published statistics (survival probabilities, medians and hazard ratios) with statistics based on repeated reconstructions by multiple observers.
Citation impact
- FWCI
- 22.03
- Percentile
- 100%
- References
- 33
Authors
4Topics & keywords
- Statistics
- Hazard ratio
- Survival analysis
- Event (particle physics)
- Proportional hazards model
- Hazard
- Medicine
- Mathematics