articleBioinformaticsJul 15, 2006BRONZE OA

Integrating structured biological data by Kernel Maximum Mean Discrepancy

Ludwig-Maximilians-Universität München · Max Planck Institute for Biological Cybernetics · +2 more institutions

PubMed
Indexed incrossrefdoajpubmed

Abstract

Results

We study the practical feasibility of an MMD-based test on three central data integration tasks: Testing cross-platform comparability of microarray data, cancer diagnosis, and data-content based schema matching for two different protein function classification schemas. In all of these experiments, including high-dimensional ones, MMD is very accurate in finding samples that were generated from the same distribution, and outperforms its best competitors.

Conclusions

We have defined a novel statistical test of whether two samples are from the same distribution, compatible with both multivariate and structured data, that is fast, easy to implement, and works well, as confirmed by our experiments. AVAILABILITY: http://www.dbs.ifi.lmu.de/~borgward/MMD.

No related works found for this paper.

Funding