Class imbalance should not throw you off balance: Choosing the right classifiers and performance metrics for brain decoding with imbalanced data
Osnabrück University · Université de Montréal · +11 more institutions
Abstract
Machine learning (ML) is increasingly used in cognitive, computational and clinical neuroscience. The reliable and efficient application of ML requires a sound understanding of its subtleties and limitations. Training ML models on datasets with imbalanced classes is a particularly common problem, and it can have severe consequences if not adequately addressed. With the neuroscience ML user in mind, this paper provides a didactic assessment of the class imbalance problem and illustrates its impact through systematic manipulation of data imbalance ratios in (i) simulated data and (ii) brain data recorded with electroencephalography (EEG), magnetoencephalography (MEG) and functional magnetic resonance imaging…
Citation impact
- FWCI
- 30.85
- Percentile
- 100%
- References
- 95
Authors
18- PTPhilipp ThölkeCorresponding
Osnabrück University, Université de Montréal
- YMYorguin-José Mantilla-Ramos
Universidad de Antioquia, Université de Montréal
- HAHamza Abdelhedi
Université de Montréal
- CMCharlotte Maschke
Montreal Neurological Institute and Hospital, McGill University, Université de Montréal
- ADArthur Dehgan
Centre National de la Recherche Scientifique, Aix-Marseille Université, Institut de Neurosciences de la Timone, Université de Montréal
Topics & keywords
- Magnetoencephalography
- Computer science
- Robustness (evolution)
- Metric (unit)
- Artificial intelligence
- Binary classification
- Decoding methods
- Machine learning
- Reduced inequalities
Funding
- MUMcGill University
- CRCanada Research Chairs
- MMitacs
- CFCanada First Research Excellence Fund
- IDInstitut de Valorisation des Données
- CFCourtois Foundation
- CICanadian Institutes of Health Research
- NSNatural Sciences and Engineering Research Council of Canada
- FDFonds de recherche du Québec – Nature et technologiesAwards: 265502, 2020-RS4-265502