articleScientific ReportsMar 17, 2025GOLD OA

Enhancing malware detection with feature selection and scaling techniques using machine learning models

Wycliffe College · American University · +3 more institutions

PubMed
Indexed incrossrefdoajpubmed

Abstract

The increasing prevalence of malware presents a critical challenge to cybersecurity, emphasizing the need for robust detection methods. This study uses a binary tabular classification dataset to evaluate the impact of feature selection, feature scaling, and machine learning (ML) models on malware detection. The methodology involves experimenting with three feature scaling techniques (no scaling, normalization, and min-max scaling), three feature selection methods (no selection, Linear Discriminant Analysis (LDA), and Principal Component Analysis (PCA)), and twelve ML models, including traditional algorithms and ensemble methods. A publicly available dataset with 11,598 samples and 139 features is utilized, and…

Citation impact

46
total citations
FWCI
52.38
Percentile
100%
References
100
Citations per year

Authors

8

Topics & keywords

Keywords
  • Feature selection
  • Computer science
  • Normalization (sociology)
  • Artificial intelligence
  • Preprocessor
  • Linear discriminant analysis
  • Machine learning
  • Malware
UN Sustainable Development Goals
  • Reduced inequalities
No related works found for this paper.

Funding