Valid post-selection inference
California University of Pennsylvania · University of Pennsylvania
Abstract
It is common practice in statistical data analysis to perform data-driven variable selection and derive statistical inference from the resulting model. Such inference enjoys none of the guarantees that classical statistical theory provides for tests and confidence intervals when the model has been chosen a priori. We propose to produce valid “post-selection inference” by reducing the problem to one of simultaneous inference and hence suitably widening conventional confidence and retention intervals. Simultaneity is required for all linear functions that arise as coefficient estimates in all submodels. By purchasing “simultaneity insurance” for all possible submodels, the resulting post-selection inference is…
Citation impact
- FWCI
- 34.83
- Percentile
- 100%
- References
- 96
Authors
5- RARichard A. BerkCorresponding
California University of Pennsylvania, University of Pennsylvania
- LBLawrence Brown
California University of Pennsylvania, University of Pennsylvania
- ABAndreas Buja
California University of Pennsylvania, University of Pennsylvania
- KZKai Zhang
- LZLinda Zhao
California University of Pennsylvania, University of Pennsylvania
Topics & keywords
- Inference
- Selection (genetic algorithm)
- Computer science
- Artificial intelligence