Enhancing malware detection with feature selection and scaling techniques using machine learning models
Wycliffe College · American University · +3 more institutions
Abstract
The increasing prevalence of malware presents a critical challenge to cybersecurity, emphasizing the need for robust detection methods. This study uses a binary tabular classification dataset to evaluate the impact of feature selection, feature scaling, and machine learning (ML) models on malware detection. The methodology involves experimenting with three feature scaling techniques (no scaling, normalization, and min-max scaling), three feature selection methods (no selection, Linear Discriminant Analysis (LDA), and Principal Component Analysis (PCA)), and twelve ML models, including traditional algorithms and ensemble methods. A publicly available dataset with 11,598 samples and 139 features is utilized, and…
Citation impact
- FWCI
- 52.38
- Percentile
- 100%
- References
- 100
Authors
8Topics & keywords
- Feature selection
- Computer science
- Normalization (sociology)
- Artificial intelligence
- Preprocessor
- Linear discriminant analysis
- Machine learning
- Malware
- Reduced inequalities