Indonesian Journal of Electrical Engineering and Computer Science

ARAhuja, RakeshASAhuja, SachinGDGupta, DeepaliHMHaque, Mohd Junedul
Indexed incrossref

Abstract

Classifying high-dimensional data are a challenging task in data mining. Gene expression data is a type of high-dimensional data that has thousands of features. The study was proposing a method to extract knowledge from highdimensional gene expression data by selecting features and classifying. Lasso was used for selecting features and the classification and regression tree (CART) algorithm was used to construct the decision tree model. To examine the stability of the lasso decision tree, we performed bootstrap aggregating (Bagging) with 50 replications. The gene expression data used was an ovarian tumor dataset that has 1,545 observations, 10,935 gene features, and binary class. The findings of this research…

Citation impact

318
total citations
FWCI
Percentile
References
23
Citations per year

Authors

4
  • AR
    Ahuja, RakeshCorresponding
  • AS
    Ahuja, Sachin
  • GD
    Gupta, Deepali
  • HM
    Haque, Mohd Junedul

Topics & keywords

Keywords
  • Indonesian
  • Computer science
  • Science and engineering
  • Engineering
  • Engineering ethics
  • Philosophy
UN Sustainable Development Goals
  • Affordable and clean energy
No related works found for this paper.

Funding