On Some Aspects of Variable Selection for Partial Least Squares Regression Models
Indexed incrossref
Abstract
Abstract This paper tries to explore the optimum variable selection strategy for Partial Least Squares (PLS) regression using a model dataset of cytoprotection data. The compounds of the dataset were classified using K ‐means clustering technique applied on standardized descriptor matrix and ten combinations of training and test sets were generated based on the obtained clusters. For a particular training set, PLS models were developed with a number of components optimized by leave‐one‐out Q 2 and then the developed models were validated (externally) using the test set compounds. For each set, PLS model was initially constructed using all descriptors (variables). The variables having least standardized values…
Citation impact
802
total citations
- FWCI
- 33.56
- Percentile
- 100%
- References
- 26
Citations per year
Authors
2Topics & keywords
Topics
Keywords
- Partial least squares regression
- Mathematics
- Statistics
- Feature selection
- Regression analysis
- Set (abstract data type)
- Test set
- Linear regression
No related works found for this paper.