articleBMC BioinformaticsJun 22, 2011GOLD OA

Sparse PLS discriminant analysis: biologically relevant feature selection and graphical displays for multiclass problems

University of Queensland · Laboratoire de Génétique Cellulaire · +3 more institutions

PubMed
Indexed incrossrefdoajpubmed

Abstract

Background

Variable selection on high throughput biological data, such as gene expression or single nucleotide polymorphisms (SNPs), becomes inevitable to select relevant information and, therefore, to better characterize diseases or assess genetic structure. There are different ways to perform variable selection in large data sets. Statistical tests are commonly used to identify differentially expressed features for explanatory purposes, whereas Machine Learning wrapper approaches can be used for predictive purposes. In the case of multiple highly correlated variables, another option is to use multivariate exploratory approaches to give more insight into cell biology, biological pathways or complex traits.

Results

A simple extension of a sparse PLS exploratory approach is proposed to perform variable selection in a multiclass classification framework.

Citation impact

996
total citations
FWCI
6.52
Percentile
100%
References
62
Citations per year

Authors

3

Topics & keywords

Keywords
  • Interpretability
  • Feature selection
  • Linear discriminant analysis
  • Computer science
  • Artificial intelligence
  • Selection (genetic algorithm)
  • Machine learning
  • Data mining
UN Sustainable Development Goals
  • Reduced inequalities
No related works found for this paper.

Funding