reviewMolecular BioSystemsOct 23, 2014Closed access

PLS/OPLS models in metabolomics: the impact of permutation of dataset rows on the K-fold cross-validation quality parameters

Centre National de la Recherche Scientifique · Université Paris Cité · +18 more institutions

PubMed
Indexed incrossrefpubmed

Abstract

Among all the software packages available for discriminant analyses based on projection to latent structures (PLS-DA) or orthogonal projection to latent structures (OPLS-DA), SIMCA (Umetrics, Umeå Sweden) is the more widely used in the metabolomics field. SIMCA proposes many parameters or tests to assess the quality of the computed model (the number of significant components, R2, Q2, pCV-ANOVA, and the permutation test). Significance thresholds for these parameters are strongly application-dependent. Concerning the Q2 parameter, a significance threshold of 0.5 is generally admitted. However, during the last few years, many PLS-DA/OPLS-DA models built using SIMCA have been published with Q2 values lower than…

Citation impact

658
total citations
FWCI
12.23
Percentile
100%
References
21
Citations per year

Authors

8

Topics & keywords

Keywords
  • OPLS
  • Fold (higher-order function)
  • Row
  • Cross-validation
  • Permutation (music)
  • Metabolomics
  • Computer science
  • Computational biology
No related works found for this paper.