Functional module detection by functional flow pattern mining in protein interaction networks

Cho, Young-Rae; Shi, Lei; Zhang, Aidong

doi:10.1186/1471-2105-9-S10-O1

Volume 9 Supplement 10

Highlights from the Fourth International Society for Computational Biology (ISCB) Student Council Symposium

Oral presentation
Open access
Published: 30 October 2008

Functional module detection by functional flow pattern mining in protein interaction networks

Young-Rae Cho¹,
Lei Shi¹ &
Aidong Zhang¹

BMC Bioinformatics volume 9, Article number: O1 (2008) Cite this article

3553 Accesses
8 Citations
Metrics details

Background

A functional module has been defined as a group of molecules that participate in the same functional activities. Various graph-theoretic or data-mining techniques have been applied to discover functional modules from protein interaction networks [1]. However, their performance has been compromised by false-positive and false-negative interaction data and complex connectivity of the interaction networks. In our earlier study [2], we have introduced the functional flow-based approach to efficiently identify overlapping modules, which are generally large-sized, from interaction networks. In this abstract, we extend this approach by mining functional flow patterns for the purpose of detecting small-sized modules for specific functions.

Methods

Our approach includes three steps. First, we integrate the interaction network with semantic data from Gene Ontology [3] to generate a weighted interaction network, which is functionally reliable. Next, we simulate functional flow starting from selected informative proteins and identify primary modules for general-level functions [2]. As the last step, we obtain the set of functional flow patterns for each primary module by flow simulation from all nodes within the module. A functional flow pattern is defined as a sequence of quantities of functional influence of a source protein on target proteins. The coherent patterns are then captured by a pattern-based clustering algorithm [4] as final modules for specific-level functions. The significant assumption is that if two source proteins have similar functional flow patterns across all the other targets proteins, then they are likely to have the same function.

Results

We tested our flow-pattern clustering method using a sub-network, structured by the proteins having functions on Cell Cycle and DNA Processing and the interactions between them. The output modules were compared to the functional categories and their annotations from MIPS [5] using statistical p-value analysis (see Table 1). We assessed the performance of our algorithm comparing to two competing methods: the clique percolation method [6] as a density-based approach to find densely connected sub-graphs, and the betweenness-cut method [7] as a hierarchical approach to iteratively separate a graph and find the best partition. As a result, our algorithm had higher accuracy than the others by approximately 20% (see Table 2).

Table 1

Full size table

Table 2

Full size table

Conclusion

The modules, identified from protein interaction networks, provide an understanding of functional associations among proteins. In this study, we introduced a framework to detect functional modules in protein interaction networks. We demonstrated that our approach accurately handles the erroneous and complex networks.

References

Sharan R, Ulitsky I, Shamir R: Network-based prediction of protein function. Molecular Systems Biology 2007, 3: 88. 10.1038/msb4100129
Article PubMed Central PubMed Google Scholar
Cho Y-R, Hwang W, Ramanathan M, Zhang A: Semantic integration to identify overlapping functional modules in protein interaction networks. BMC Bioinformatics 2007, 8: 265. 10.1186/1471-2105-8-265
Article PubMed Central PubMed Google Scholar
Gene Ontology Consortium: The Gene Ontology project in 2008. Nucleic Acids Res 2008, 36(Database issue):D440-D444. 10.1093/nar/gkm883
Google Scholar
Wang H, Wang W, Yang J, Yu PS: Clustering by pattern similarity in large data sets. Proceedings of ACM SIGMOD International Conference on Management of Data 2002, 394–405.
Google Scholar
Mewes HW, Dietmann S, Frishman D, Gregory R, Mannhaupt G, Mayer KF, Munsterkotter M, Ruepp A, Spannagl M, Stumpflen V, Rattei T: MIPS: analysis and annotation of genome information in 2007. Nucleic Acids Res 2008, 36(Database issue):D196-D201. 10.1093/nar/gkm980
PubMed Central CAS PubMed Google Scholar
Palla G, Derenyi I, Farkas I, Vicsek T: Uncovering the overlapping community structure of complex networks in nature and society. Nature 2005, 435: 814–818. 10.1038/nature03607
Article CAS PubMed Google Scholar
Dunn R, Dudbridge F, Sanderson CM: The use of edge-betweenness clustering to investigate biological function in protein interaction networks. BMC Bioinformatics 2005, 6: 39. 10.1186/1471-2105-6-39
Article PubMed Central PubMed Google Scholar

Download references

Author information

Authors and Affiliations

Department of Computer Science, State University of New York, Buffalo, NY, 14260, USA
Young-Rae Cho, Lei Shi & Aidong Zhang

Authors

Young-Rae Cho
View author publications
You can also search for this author in PubMed Google Scholar
Lei Shi
View author publications
You can also search for this author in PubMed Google Scholar
Aidong Zhang
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Young-Rae Cho.

Rights and permissions

Open Access This article is published under license to BioMed Central Ltd. This is an Open Access article is distributed under the terms of the Creative Commons Attribution 2.0 International License (https://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Reprints and permissions

About this article

Cite this article

Cho, YR., Shi, L. & Zhang, A. Functional module detection by functional flow pattern mining in protein interaction networks. BMC Bioinformatics 9 (Suppl 10), O1 (2008). https://doi.org/10.1186/1471-2105-9-S10-O1

Download citation

Published: 30 October 2008
DOI: https://doi.org/10.1186/1471-2105-9-S10-O1

Highlights from the Fourth International Society for Computational Biology (ISCB) Student Council Symposium

Functional module detection by functional flow pattern mining in protein interaction networks

Background

Methods

Results

Conclusion

References

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Keywords

BMC Bioinformatics

Contact us

Highlights from the Fourth International Society for Computational Biology (ISCB) Student Council Symposium

Functional module detection by functional flow pattern mining in protein interaction networks

Background

Methods

Results

Conclusion

References

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Share this article

Keywords

BMC Bioinformatics

Contact us