articleQSAR & Combinatorial ScienceJul 17, 2007BRONZE OA

On Some Aspects of Variable Selection for Partial Least Squares Regression Models

Jadavpur University

Indexed incrossref

Abstract

Abstract This paper tries to explore the optimum variable selection strategy for Partial Least Squares (PLS) regression using a model dataset of cytoprotection data. The compounds of the dataset were classified using K ‐means clustering technique applied on standardized descriptor matrix and ten combinations of training and test sets were generated based on the obtained clusters. For a particular training set, PLS models were developed with a number of components optimized by leave‐one‐out Q 2 and then the developed models were validated (externally) using the test set compounds. For each set, PLS model was initially constructed using all descriptors (variables). The variables having least standardized values…

Citation impact

802
total citations
FWCI
33.56
Percentile
100%
References
26
Citations per year

Authors

2

Topics & keywords

Keywords
  • Partial least squares regression
  • Mathematics
  • Statistics
  • Feature selection
  • Regression analysis
  • Set (abstract data type)
  • Test set
  • Linear regression
No related works found for this paper.