Double-check: validation of diagnostic statistics for PLS-DA models in metabolomics studies
University of Amsterdam · Netherlands Metabolomics Centre
Abstract
Partial Least Squares-Discriminant Analysis (PLS-DA) is a PLS regression method with a special binary 'dummy' y-variable and it is commonly used for classification purposes and biomarker selection in metabolomics studies. Several statistical approaches are currently in use to validate outcomes of PLS-DA analyses e.g. double cross validation procedures or permutation testing. However, there is a great inconsistency in the optimization and the assessment of performance of PLS-DA models due to many different diagnostic statistics currently employed in metabolomics data analyses. In this paper, properties of four diagnostic statistics of PLS-DA, namely the number of misclassifications (NMC), the Area Under the…
Citation impact
- FWCI
- 8.25
- Percentile
- 100%
- References
- 42
Authors
4- ESEwa SzymańskaCorresponding
University of Amsterdam, Netherlands Metabolomics Centre
- ESEdoardo Saccenti
Netherlands Metabolomics Centre, University of Amsterdam
- AKAge K. Smilde
Netherlands Metabolomics Centre, University of Amsterdam
- JAJohan A. Westerhuis
University of Amsterdam, Netherlands Metabolomics Centre
Topics & keywords
- Linear discriminant analysis
- Statistics
- Permutation (music)
- Receiver operating characteristic
- Partial least squares regression
- Mathematics
- Cross-validation
- Statistical hypothesis testing
- Reduced inequalities