Least squares after model selection in high-dimensional sparse models
Memorial Medical Center · IQVIA (United States)
Abstract
In this article we study post-model selection estimators that apply ordinary least squares (OLS) to the model selected by first-step penalized estimators, typically Lasso. It is well known that Lasso can estimate the nonparametric regression function at nearly the oracle rate, and is thus hard to improve upon. We show that the OLS post-Lasso estimator performs at least as well as Lasso in terms of the rate of convergence, and has the advantage of a smaller bias. Remarkably, this performance occurs even if the Lasso-based model selection “fails” in the sense of missing some components of the “true” regression model. By the “true” model, we mean the best $s$-dimensional approximation to the nonparametric…
Citation impact
- FWCI
- 23.86
- Percentile
- 100%
- References
- 53
Authors
2Topics & keywords
- Lasso (programming language)
- Estimator
- Mathematics
- Model selection
- Ordinary least squares
- Rate of convergence
- Nonparametric regression
- Nonparametric statistics