articleBioinformaticsOct 28, 2011BRONZE OA

MissForest—non-parametric missing value imputation for mixed-type data

ETH Zurich · Life Science Zurich

PubMed
Indexed inarxivcrossrefdatacitedoajpubmed

Abstract

MOTIVATION: Modern data acquisition based on high-throughput technology is often facing the problem of missing data. Algorithms commonly used in the analysis of such large-scale data often depend on a complete set. Missing value imputation offers a solution to this problem. However, the majority of available imputation methods are restricted to one type of variable only: continuous or categorical. For mixed-type data, the different types are usually handled separately. Therefore, these methods ignore possible relations between variable types. We propose a non-parametric method which can cope with different types of variables simultaneously. RESULTS: We compare several state of the art methods for the…

Citation impact

6,010
total citations
FWCI
19.54
Percentile
100%
References
32
Citations per year

Authors

2

Topics & keywords

Keywords
  • Imputation (statistics)
  • Categorical variable
  • Missing data
  • Computer science
  • Random forest
  • Data mining
  • Parametric statistics
  • Type I and type II errors
UN Sustainable Development Goals
  • Life in Land
No related works found for this paper.