Enhancing malware detection with feature selection and scaling techniques using machine learning models

Hasan, Rakibul; Biswas, Barna; Samiun, Md; Saleh, Md. Abu; Prabha, Mani; Akter, Jahanara; Joya, Fatema Haque; Abdullah, Masuk

doi:10.1038/s41598-025-93447-x

articleScientific ReportsMar 17, 2025GOLD OA

Enhancing malware detection with feature selection and scaling techniques using machine learning models

RHRakibul Hasan BBBarna Biswas MSMd Samiun MAMd. Abu Saleh MPMani Prabha

Wycliffe College · American University · +3 more institutions

PubMed

Indexed incrossrefdoajpubmed

Abstract

The increasing prevalence of malware presents a critical challenge to cybersecurity, emphasizing the need for robust detection methods. This study uses a binary tabular classification dataset to evaluate the impact of feature selection, feature scaling, and machine learning (ML) models on malware detection. The methodology involves experimenting with three feature scaling techniques (no scaling, normalization, and min-max scaling), three feature selection methods (no selection, Linear Discriminant Analysis (LDA), and Principal Component Analysis (PCA)), and twelve ML models, including traditional algorithms and ensemble methods. A publicly available dataset with 11,598 samples and 139 features is utilized, and…

Citation impact

46

total citations

FWCI: 52.38
Percentile: 100%
References: 100

Citations per year

Authors

8

Topics & keywords

Topics

Keywords

Feature selection
Computer science
Normalization (sociology)
Artificial intelligence
Preprocessor
Linear discriminant analysis
Machine learning
Malware

UN Sustainable Development Goals

Reduced inequalities

No related works found for this paper.

Funding

DE
Debreceni Egyetem