Wednesday, September 24, 2008

Exploiting Domain Knowledge to Improve Biological Significance of Bi-clusters with Key Missing Genes

In an era of increasingly complex biological datasets, one of the key steps in gene functional analysis comes from clustering genes based on co-expression. Biclustering algorithms can identify gene clusters with local co-expressed patterns, which are more likely to define genes functioning together than global clustering methods. However, these algorithms are not effective in uncovering gene regulatory networks because the mined biclusters lack genes that may be critical in the function but may not be co-expressed with the clustered genes. In this project, we introduce a biclustering method called SKeleton Biclustering (SKB), which builds high quality biclusters from microarray data, creates relationships among the biclustered genes based on Gene Ontology annotations, and identifies genes that are missing in the biclusters. SKB thus defines inter-bicluster and intra-bicluster functional relationships. The delineation of functional relationships and incorporation of such missing genes may help biologists to discover biological processes that are important in a given study and provides clues for how the processes may be functioning together. Experimental results on yeast cell cycles and Arabidopsis cold-response microarray datasets show that, with SKB, the biological significance of the biclusters is considerably improved. 

--by Jin Chen, Liping Ji, Wynne Hsu, Kian-Lee Tan, Seung Rhee, ICDE, 2009

1 Comments:

Anonymous Anonymous said...

Good words.

7:52 AM  

Post a Comment

Subscribe to Post Comments [Atom]

<< Home