Discovering statistically significant biclusters in gene expression data

Tanay, Amos; Sharan, Roded; Shamir, Ron

doi:10.1093/bioinformatics/18.suppl_1.s136

articleBioinformaticsJul 1, 2002Closed access

Discovering statistically significant biclusters in gene expression data

ATAmos Tanay RSRoded Sharan RSRon Shamir

Tel Aviv University

PubMed

Indexed incrossrefdoajpubmed

Abstract

In gene expression data, a bicluster is a subset of the genes exhibiting consistent patterns over a subset of the conditions. We propose a new method to detect significant biclusters in large expression datasets. Our approach is graph theoretic coupled with statistical modelling of the data. Under plausible assumptions, our algorithm is polynomial and is guaranteed to find the most significant biclusters. We tested our method on a collection of yeast expression profiles and on a human cancer dataset. Cross validation results show high specificity in assigning function to genes based on their biclusters, and we are able to annotate in this way 196 uncharacterized yeast genes. We also demonstrate how the…

Citation impact

861

total citations

FWCI: 8.86
Percentile: 100%
References: 20

Citations per year

Authors

3

Topics & keywords

Topics

Keywords

Expression (computer science)
Gene expression
Computational biology
Gene
Biology
Computer science
Genetics
Data mining

No related works found for this paper.