articleBioinformaticsOct 1, 2002Closed access

The mutual information: Detecting and evaluating dependencies betweenvariables

University of Potsdam · Max Planck Society · +1 more institution

PubMed
Indexed incrossrefdoajpubmed

Abstract

MOTIVATION: Clustering co-expressed genes usually requires the definition of 'distance' or 'similarity' between measured datasets, the most common choices being Pearson correlation or Euclidean distance. With the size of available datasets steadily increasing, it has become feasible to consider other, more general, definitions as well. One alternative, based on information theory, is the mutual information, providing a general measure of dependencies between variables. While the use of mutual information in cluster analysis and visualization of large-scale gene expression data has been suggested previously, the earlier studies did not focus on comparing different algorithms to estimate the mutual information…

Citation impact

831
total citations
FWCI
2.30
Percentile
100%
References
22
Citations per year

Authors

5

Topics & keywords

Keywords
  • Mutual information
  • Computer science
  • Cluster analysis
  • Data mining
  • Euclidean distance
  • Similarity (geometry)
  • Distance measures
  • Measure (data warehouse)
No related works found for this paper.