MissForest—non-parametric missing value imputation for mixed-type data
ETH Zurich · Life Science Zurich
Indexed inarxivcrossrefdatacitedoajpubmed
Abstract
MOTIVATION: Modern data acquisition based on high-throughput technology is often facing the problem of missing data. Algorithms commonly used in the analysis of such large-scale data often depend on a complete set. Missing value imputation offers a solution to this problem. However, the majority of available imputation methods are restricted to one type of variable only: continuous or categorical. For mixed-type data, the different types are usually handled separately. Therefore, these methods ignore possible relations between variable types. We propose a non-parametric method which can cope with different types of variables simultaneously. RESULTS: We compare several state of the art methods for the…
Citation impact
6,010
total citations
- FWCI
- 19.54
- Percentile
- 100%
- References
- 32
Citations per year
Authors
2Topics & keywords
Topics
Keywords
- Imputation (statistics)
- Categorical variable
- Missing data
- Computer science
- Random forest
- Data mining
- Parametric statistics
- Type I and type II errors
UN Sustainable Development Goals
- Life in Land
No related works found for this paper.