A Review of Feature Selection Methods for Machine Learning-Based Disease Risk Prediction
University of Auckland · Maurice Wilkins Centre · +5 more institutions
Abstract
Machine learning has shown utility in detecting patterns within large, unstructured, and complex datasets. One of the promising applications of machine learning is in precision medicine, where disease risk is predicted using patient genetic data. However, creating an accurate prediction model based on genotype data remains challenging due to the so-called "curse of dimensionality" (i.e., extensively larger number of features compared to the number of samples). Therefore, the generalizability of machine learning models benefits from feature selection, which aims to extract only the most "informative" features and remove noisy "non-informative," irrelevant and redundant features. In this article, we provide a…
Citation impact
- FWCI
- 173.19
- Percentile
- 100%
- References
- 143
Authors
4- NPNicholas Pudjihartono
University of Auckland
- TFTayaza Fadason
University of Auckland, Maurice Wilkins Centre
- AWAndreas W. Kempa-LiehrCorresponding
University of Auckland
- JMJustin M. O’SullivanCorresponding
Agency for Science, Technology and Research, Garvan Institute of Medical Research, University of Auckland, Singapore Institute for Clinical Sciences, Maurice Wilkins Centre, MRC Lifecourse Epidemiology Unit, University of Southampton
Topics & keywords
- Feature selection
- Machine learning
- Artificial intelligence
- Computer science
- Selection (genetic algorithm)
- Feature (linguistics)
- Philosophy