- Poster presentation
- Open Access
InteroPORC: an automated tool to predict highly conserved protein interaction networks
© Michaut et al; licensee BioMed Central Ltd 2008
- Published: 30 October 2008
- Target Species
- Protein Interaction Network
- Inference Process
- Cluster Link
- Interaction Dataset
We defined a new inference process, called InteroPorc, combining source interactions with clusters of orthologous proteins. The method is indeed based on the PORC data (Putative ORthologous Cluster) provided by Integr8. The Integr8 database systematically provides all sequenced genomes and their corresponding proteomes (currently 655 organisms). Consequently, these orthologous clusters are of paramount interest since they contain all sequenced organisms. The inference process consisted of two steps. First, we abstracted protein interactions onto orthologous cluster links. For a given source interaction, if both proteins belonged to a cluster, we constructed a link between these two clusters. In the second step, we projected these cluster links onto a specific target species. Practically, for a given link, if both clusters contained a protein from the target species, we predicted an interaction between these proteins.
We applied our automated prediction tool to the cyanobacteria Synechocystis. It enabled us to predict a new network of 1,463 protein-protein interactions when less than 200 interactions were experimentally annotated in the databases. In the same way, we predicted for instance 13,469 interactions for the rat.
This open-source application can either be run online through a web interface or downloaded at http://biodev.extra.cea.fr/interoporc/. To run the tool online, we have collected source interactions from the three manually curated databases IntAct, MINT and DIP. The user just has to indicate the taxonomy identifier of the species he/she is interested in. Running online usually takes two minutes. It is also possible to download the tool for stand-alone use to get more flexibility. For example, the source interaction dataset can be changed to use only highly relevant source interactions or private datasets. Moreover, this application can be run on all platforms since it has been developed in Java.
This tool is highly interesting to quickly get a raw picture of the protein interaction network of any sequenced organism. Moreover, it should greatly facilitate comparative studies since it provides a common method to predict protein interaction networks for lots of species in an automatic way. Finally, it is noteworthy that the method has been implemented separately from the interaction data used. Since the quality of the interactions is still a problem to be addressed, it is of great importance to be able to choose which interactions one would like to transfer.
This article is published under license to BioMed Central Ltd.