Indonesian Journal of Electrical Engineering and Computer Science
ARAhuja, RakeshASAhuja, SachinGDGupta, DeepaliHMHaque, Mohd Junedul
Indexed incrossref
Abstract
Classifying high-dimensional data are a challenging task in data mining. Gene expression data is a type of high-dimensional data that has thousands of features. The study was proposing a method to extract knowledge from highdimensional gene expression data by selecting features and classifying. Lasso was used for selecting features and the classification and regression tree (CART) algorithm was used to construct the decision tree model. To examine the stability of the lasso decision tree, we performed bootstrap aggregating (Bagging) with 50 replications. The gene expression data used was an ovarian tumor dataset that has 1,545 observations, 10,935 gene features, and binary class. The findings of this research…
Citation impact
318
total citations
- FWCI
- —
- Percentile
- —
- References
- 23
Citations per year
Authors
4- ARAhuja, RakeshCorresponding
- ASAhuja, Sachin
- GDGupta, Deepali
- HMHaque, Mohd Junedul
Topics & keywords
Topics
Keywords
- Indonesian
- Computer science
- Science and engineering
- Engineering
- Engineering ethics
- Philosophy
UN Sustainable Development Goals
- Affordable and clean energy
No related works found for this paper.