Ortholog-based protein-protein interaction prediction and its application to inter-species interactions

Lee, Sheng-An; Chan, Cheng-hsiung; Tsai, Chi-Hung; Lai, Jin-Mei; Wang, Feng-Sheng; Kao, Cheng-Yan; Huang, Chi-Ying F

doi:10.1186/1471-2105-9-S12-S11

Volume 9 Supplement 12

Seventh International Conference on Bioinformatics (InCoB2008)

Research
Open access
Published: 12 December 2008

Ortholog-based protein-protein interaction prediction and its application to inter-species interactions

Sheng-An Lee^1,4,
Cheng-hsiung Chan¹,
Chi-Hung Tsai⁵,
Jin-Mei Lai⁶,
Feng-Sheng Wang⁷,
Cheng-Yan Kao^4,5 &
…
Chi-Ying F Huang^1,2,3,4

BMC Bioinformatics volume 9, Article number: S11 (2008) Cite this article

12k Accesses
68 Citations
Metrics details

Abstract

Background

The rapid growth of protein-protein interaction (PPI) data has led to the emergence of PPI network analysis. Despite advances in high-throughput techniques, the interactomes of several model organisms are still far from complete. Therefore, it is desirable to expand these interactomes with ortholog-based and other methods.

Results

Orthologous pairs of 18 eukaryotic species were expanded and merged with experimental PPI datasets. The contributions of interologs from each species were evaluated. The expanded orthologous pairs enable the inference of interologs for various species. For example, more than 32,000 human interactions can be predicted. The same dataset has also been applied to the prediction of host-pathogen interactions. PPIs between P. falciparum calmodulin and several H. sapiens proteins are predicted, and these interactions may contribute to the maintenance of host cell Ca²⁺ concentration. Using comparisons with Bayesian and structure-based approaches, interactions between putative HSP40 homologs of P. falciparum and the H. sapiens TNF receptor associated factor family are revealed, suggesting a role for these interactions in the interference of the human immune response to P. falciparum.

Conclusion

The PPI datasets are available from POINT http://point.bioinformatics.tw/ and POINeT http://poinet.bioinformatics.tw/. Further development of methods to predict host-pathogen interactions should incorporate multiple approaches in order to improve sensitivity, and should facilitate the identification of targets for drug discovery and design.

Background

Many genome-wide high throughput yeast two-hybrid analyses have generated PPI datasets for various model organisms. Moreover, systematic manual curation of human protein interactomes, including BioGRID [1], MIPS [2], IntAct [3], PINdb [4], DIP [5], HPRD [6] and MINT [7], has also generated significant, but far from complete, datasets. Therefore, in addition to an empirical screening of the interacting proteins of a given target, a comparative strategy should further facilitate functional annotation of uncharacterized proteins.

Using our knowledge of conserved interactions in other organisms (or interologs) [8] to elucidate the interacting networks of a particular target protein, we have previously established a publicly accessible and functional database, POINT (the Prediction Of INTeractome database) http://point.bioinformatics.tw/[9]. The application of a similar concept and the addition of further filtering criteria have recently been reported and, as a result, have produced many outstanding studies such as Ulysses [10], OPHID [11] and HomoMINT [12]. Recently, additional high-throughput yeast two-hybrid experiments have generated an enormous number of human PPIs [13, 14], which now require assessments of their accuracy [15] and further evaluations using the concept of interologs. Conversely, interologs may be used to estimate the reliability of high throughput observations.

It is expected that the interactions between conserved orthologs, which are conserved genes and gene products in different species, will be conserved as well. However, accurate human interolog predictions inferred from different species are much less abundant than expected [6, 12]. Additionally, some argue that interologs are less conserved than orthologs [12]. The extent to which ortholog-based PPI predictions can be applied has not been extensively analyzed.

In this work, orthologous pairs from 18 eukaryotic species have been expanded. Using experimental PPIs, interologs for these 18 species can be predicted and analyzed. This concept has been applied to host-pathogen PPI predictions. An analysis of predicted H. sapiens-P. falciparum interactions revealed PPIs that are highly related to the maintenance of Ca²⁺ levels in host cells. When comparing this method to other prediction methods, we find that this approach can complement Bayesian statistical methods [16] and structure-based methods [17].

Results and discussion

Orthologs shared by H. sapiens and other model organisms

The complete ortholog matrix from 18 eukaryotic species is shown in Additional File 1: Table S1. For brevity, only the orthologs between H. sapiens and five common model organisms are presented (Table 1). These orthologs were based on the HomoloGene database. Interologs were determined from the model organisms M. musculus (mouse), R. norvegicus (rat), D. melanogaster (fruit fly), C. elegans (worm) and S. cerevisiae (yeast).

Table 1 Numbers of ortholog shared by human and five model organisms

Full size table

Based on ortholog information, the conservation of genes and ortholog groups among 18 eukaryotic species were identified. We found 81 genes that were conserved in all 18 species presented in HomoloGene (Additional File 2: Table S2), suggesting that these genes are fundamental and/or vital to eukaryotes. Interestingly, 243 genes are missing in P. falciparum, but found in the other 17 species, including members of the proteosome, various ATP synthases and many mitochondria-related genes. While most species in the HomoloGene database share a high proportion of orthologs with other species (ranging from 48.3% in O. sativa to 87.4% in H. sapiens), less than 20% of the 5,266 genes in P. falciparum can be grouped with genes from other species. This suggests that the lifestyle and biological processes of this parasite deviate from those of other organisms.

PPIs in the POINT database

PPIs from the various model organisms were used to infer PPIs (interologs) in higher order organisms such as H. sapiens. Because experimental PPIs from the target organisms are needed to verify these inferred PPIs, collections of PPIs are essential for an ortholog-based approach. The POINT database has collected most of the available public PPI data for a range of organisms (Table 2). It contains more than 44,000 human PPIs with available ortholog information. In addition, more than 70,000 yeast interactions are available, suggesting that a considerable number of human interologs can be inferred. Most of these interactions were obtained from high-throughput techniques such as yeast two-hybrid screening, which is prone to a high rate of false positives and other errors. Within the high-confidence dataset, where only PPIs verified by two or more methods or reported in the literature two or more times are included, there are 28,559 human PPIs and 25,612 yeast PPIs with available ortholog information.

Table 2 Protein-protein interactions collected in the POINT database.

Full size table

While the use of high-confidence PPIs eliminates many potential PPIs that are present in the available datasets, this trimming process reduces the false positive rate. Among the organisms listed in Table 2, S. cerevisiae shows the most dramatic drop in the number of PPIs when only high-confidence PPIs are selected. The reason for this is obvious: this species is a single cell organism. Most of the PPI datasets were obtained using high-throughput approaches, and have not been verified by other methods or reported independently in the literature. For H. sapiens, the number of high-confidence PPIs exceeds even those of yeast. However, some species in the HomoloGene database do not have PPI data available. For example, P. troglodytes (Chimpanzee) and C. familiaris (dog) have no inferred human interologs despite the large number of orthologs they share with H. sapiens.

Interologs inferred from ortholog pairs

Given n objects in an undirected network (graph), there will be n(n-1)/2 relationships among these n objects and n*n relationships for a directed network. Since there are 19,491 human ortholog groups (Table 1), we therefore can assume that there are 19,491*(19,491-1)/2 pairwise interactions among these gene products. Certainly, a complete graph is not reasonable or biologically feasible. However, we can assume that each interaction can be associated with a probability and that the probability for a non-interacting pair will be 0. At this stage, we do not have enough information to assign a probability for each theoretical interaction. However, we can expand all 189,939,795 interactions among these 19,491 orthologous groups.

The interologs were inferred from ortholog information. Using the orthologous groups shared by humans and other species, we can obtain the maximum number of interologs from currently available interactomes. Only two orthologous groups shared by more than two species can be used to infer interologs. For example, if orthologous group A is shared by humans and mice, and orthologous group B is also shared by humans and mice, there will be a potential interolog A-B between humans and mice, although the probabilities associated with these two interactions (one in human and one in mouse) are not known.

Based on this assumption, we analyzed a number of orthologous group pairs and identified a number of species sharing these orthologous groups for H. sapiens (Additional File 3: Table S3). Among the 189,939,795 interactions, 180,191,177 interologs were inferred from ortholog information. This translates to 94.86% coverage of interologs (IC^HSA). Although the theoretical interolog coverage is high (nearly 95%), the interolog coverage on currently available PPIs is not significant. For all available human PPIs, only 3,859/44,251 interactions (8.72%) can be inferred from known interactions in other model organisms. Using the trimmed set of high-confidence PPIs, this coverage drops to 4.61% (1,316/28,559). There is an obvious gap between the theoretical upper boundary and the experimentally observed data.

To investigate the origin of this gap, we further analyzed the interolog coverage of each model organism. Five common model organisms were selected. The number of inferable interologs, experimental PPI derived interologs and their interolog coverage were calculated (Table 3 and Table S3). It is interesting that the most commonly used model organism, S. cerevisiae (yeast), has a theoretical interolog coverage of only 0.67% (interologs inferred from yeast divided by all human interactions), whereas the IC^HSAof M. musculus (mouse) and R. norvegicus (rat) are larger by two-orders of magnitude. However, for experimental human PPIs, the IC^HSAof mouse is only 2-fold higher than that of yeast, and the IC^HSAof rat is lower than that of yeast. The species contributions, C^Sp, shown in this table are also informative. While mouse contributes 43.07% of the known interologs, yeast contributes only 19.85%. This trend was mostly unchanged for high-confidence PPIs, except the contribution of yeast was boosted to 32.29%.

The mapping of all orthologous group pairs permits interolog prediction for various eukaryotic species. For example, in the POINeT web service http://poinet.bioinformatics.tw/, interologs can be inferred for seven eukaryotic species (H. sapiens, M. musculus, D. melangaster, C. elegans, S. cerevisiae, A. thaliana, and P. falciparum). Currently, more than 32,000 human interologs can be inferred. Among them, 3,859 have been confirmed by experimental evidence. The continual growth of interactomes in every eukaryotic species will continue to improve the ability to predict interologs.

Table 3 Contributions of model organisms to human theoretical and experimental interologs.

Full size table

Prediction of inter-species host-pathogen interactions

P. falciparum is a parasite with a complex life cycle, and this malarial parasite threatens millions of lives worldwide. Based on the HomoloGene database, P. falciparum has the least similar genome in comparison to other species. Only roughly 20% (990/5,266) of the genes share orthologous groups with other organisms. This suggests that many cellular processes vital to other eukaryotes may be missing or replaced in P. falciparum, and the interplay between the parasite and its two hosts may compensate for the functions missing in the P. falciparum genome. The interactome of P. falciparum has been determined experimentally [18] and modeled genome-wide [19]. This allows comparisons to be done between the genomes and interactomes of P. falciparum and its two hosts, H. sapiens and A. gambiae (the African malaria mosquito).

Using the experimental PPIs and interologs, 3,090 inter-species interactions between P. falciparum and H. sapiens (and not intra-P. falciparum interactions) were found (Additional File 4: Table S4). The Gene Ontology annotations of the P. falciparum and H. sapiens genes were identified. These inter-species PPIs have been grouped based on the ontology of their biological processes. The resulting network is illustrated in Figure 1. The nodes in Figure 1 are biological processes from P. falciparum and H. sapiens. Links between P. falciparum and H. sapiens biological processes were derived from interactions linking two genes that participate in the respective biological processes in the two species. Darker lines indicate the involvement of more interactions, allowing more interplay between the two biological processes. The P. falciparum biological processes are shaded using different levels of grey. Darker nodes indicate that more genes are involved in the process. In Figure 1, the metabolic processes and cellular processes of P. falciparum are most abundant in the host-parasite interaction network. This is understandable, since P. falciparum is a parasite and needs to acquire nutrients from the host erythrocyte. In the genomic-wide model of the P. falciparum interactome, only a small fraction of intra-P. falciparum interactions contributed to metabolic processes [19], which supports the notion that P. falciparum metabolic processeses may be dependant on human metabolic and cellular processes. There are also other interesting interactions between P. falciparum and the antimicrobial, antibacterial, cell killing and immune system processes of H. sapiens.

Filtering and analysis of predicted inter-species interactions

Although more than 3,000 H. sapiens-P. falciparum PPIs were inferred, not all of these interactions are likely to take place under physiological conditions due to spatiotemporal constraints. Filtering using gene ontology annotations resulted in 918 host-pathogen interactions. Further filtering of P. falciparum sequences using the presence/absence of translocational signals led to 95 PPIs (Figure 2). Only 15 P. falciparum proteins participate in these 95 PPIs (Table 4). One of the P. falciparum proteins, calmodulin (PF14_0323), interacts with 50 human proteins. It is well known that P. falciparum requires an environment with high Ca²⁺ levels [20], and the abundence of calmodulin-based interactions may help P. falciparum maintain this high Ca²⁺ concentration [21]. Among the 50 human proteins interacting with PF14_0323, 13 also interact with human calmodulin (CALM3). This suggests that P. falciparum calmoduin shares some of the targets of human calmodulin, and may hijack these PPIs for its own purpose. The protein with the second highest number of interactions was N-myristoyltransferase (PF14_0127). Many proteins interacting with calmodulin require myristoylation in N-terminal [22–24], further supports the functioning of the calmodulin-centric network.

Table 4 P. falciparum proteins participate in 95 PPIs filtered from 918 host-pathogen interactions.

Full size table

Previously, Dyer et al. [16] have inferred host-pathogen interactions using Bayesian statistics. H. sapiens-P. falciparum PPIs predicted by the Bayesian approach are mainly enriched in 'blood coagulation' and 'membrane integration' protein interactions. This may partly be due to the gene ontology terms used to filter the PPIs. It is difficult to compare the two works, since the datasets and methodology used are different. However, the intersection of the two datasets reveals 3 interactions between PF14_0359 and the TNF receptor associated factor family (TRAF1, TRAF2 and TRAF6). PF14_0359 is a hypothetical protein. Inspection of the HomoloGene database reveals that PF14_0359 may be a homolog of DNAJA1 (HSP40). The functional implications of these three interactions require further investigation. However, TNF associated factor family are known to be involved in host immune response, suggesting that P. falciparum may interfere with this defence mechanism in H. sapiens. All in all, the diversity of different host-pathogen interaction inference methods suggests that these and other approaches may complement each other. And further development of the ability to predict host-pathogen interactions may benefit from the combination of multiple diverse approaches.

Conclusion

The expansion of all orthologous pairs enables the inference of interologs for various eukaryotic organisms, as illustrated by POINeT http://poinet.bioinformatics.tw/. The same inference method can also be applied to the prediction of inter-species interaction, especially in the case of host-pathogen interactions. The H. sapiens-P. falciparum PPIs inferred in our work reveal that P. falciparum may utilize calcium modulating proteins in the host cell to maintain Ca²⁺ levels, and this may serve as a target for drug development strategies [25].

Methods

Ortholog information for interolog analysis

One of the limitations inherent in the analysis of interologs is the assignment of the orthologs, which is achieved using various BLAST algorithms together with several additional criteria [6, 9, 11, 26, 27] or from the NCBI HomoloGene and other protein/gene cluster databases. In this work, the ortholog information for each human gene was identified using the NCBI HomoloGene Release 54 [28]. The NCBI HomoloGene database contains homologous information for 18 eukaryotic organisms and has been augmented with homology and phenotype information drawn from various sources, e.g., MGI [29] and Fly base [30].

Collection of PPIs

The new version of POINT integrated several publicly accessible PPI datasets (Additional File 5: Table S5). These data sources have diverse entry formats, disparate ID systems and different protein symbols. The diversity of these datasets made the task of performing cross-site browsing or iterative querying very tedious and challenging. We systematically re-organized these datasets to improve and standardize the publicly accessible PPIs. High-throughput PPI datasets are prone to false positives and errors. Therefore, we also generate a relatively high-confidence PPI subset, which refers to a PPI subset where the PPIs have been verified by two or more experimental methods or published in the literature two or more times.

Evaluation of interolog coverage

The interolog coverage is quantifiable from an estimation of the ortholog-based PPI prediction power. The definition of interolog coverage is as follows:

I C^{H S A} = \frac{N}{T^{H S A}} \times 100 %

where T^HSAis the total number of human (H. sapiens) interactions (whether theoretical, experimental, or highly confident), N is the number of interologs, and IC^HSAis the interolog coverage for the human interactome. Another measure is the contribution of a given model organism to the human interologs and this is defined as

C^{S p} = \frac{I^{S p}}{T I^{H S A}} \times 100 %

where TI^HSAis the total number of human interologs, I^Spis the number of interologs inferred from species Sp, and C^Spis the contribution of species Sp to human interologs.

Inference and filtering of inter-species interactions

With the expanded orthologous pairs, intra- and inter-species PPIs can be inferred with ease. The inference of H. sapiens-P. falciparum interactions are based on orthologous pairs with one-side orthology to P. falciparum. For example, given a PPI between M_a and M_b in species M, if M_a has an ortholog in P. falciparum (P_a), and M_b has an ortholog H_b in H. sapiens (but not in P. falciparum), an interaction between P_a and H_b is inferred.

However, interologs inferred from orthologous pairs may not occur in vivo, especially in the case of inter-species interactions. P. falciparum inhabits a parasitophorous vacuole after its entry into the red blood cell. A translocational signal peptide (RELXE/Q) is required to translocate P. falciparum proteins into red blood cell cytoplasm for host-cell manipulation [31–33]. Also, proteins localized in the nucleus (both H. sapiens and P. falciparum) are not likely to participate in inter-species PPIs. Two filters have been applied to reduce such unlikely cases. The first filter utilizes gene ontology annotations. Human proteins with the following annotations were removed: mitochondria, nucleus, ribosome, cell process, helicase activity, complex, nuclease activity, nucleic acid binding, nucleotide binding or proteolysis. The second filter utilizes the translocation signal RELXE/Q, where X refers to any amino acids. P. falciparum sequences matching this pattern within the first 25% of its length are kept, since translocation signals are likely to appear at the N-terminal.

References

Stark C, Breitkreutz BJ, Reguly T, Boucher L, Breitkreutz A, Tyers M: BioGRID: a general repository for interaction datasets. Nucleic Acids Res 2006, 34: D535–539. 10.1093/nar/gkj109
Article PubMed Central CAS PubMed Google Scholar
Pagel P, Kovac S, Oesterheld M, Brauner B, Dunger-Kaltenbach I, Frishman G, Montrone C, Mark P, Stumpflen V, Mewes HW, et al.: The MIPS mammalian protein-protein interaction database. Bioinformatics 2005, 21: 832–834. 10.1093/bioinformatics/bti115
Article CAS PubMed Google Scholar
Hermjakob H, Montecchi-Palazzi L, Lewington C, Mudali S, Kerrien S, Orchard S, Vingron M, Roechert B, Roepstorff P, Valencia A, et al.: IntAct: an open source molecular interaction database. Nucleic Acids Res 2004, 32: D452–455. 10.1093/nar/gkh052
Article PubMed Central CAS PubMed Google Scholar
Luc PV, Tempst P: PINdb: a database of nuclear protein complexes from human and yeast. Bioinformatics 2004, 20: 1413–1415. 10.1093/bioinformatics/bth114
Article CAS PubMed Google Scholar
Salwinski L, Miller CS, Smith AJ, Pettit FK, Bowie JU, Eisenberg D: The Database of Interacting Proteins: 2004 update. Nucleic Acids Res 2004, 32: D449–451. 10.1093/nar/gkh086
Article PubMed Central CAS PubMed Google Scholar
Gandhi TK, Zhong J, Mathivanan S, Karthick L, Chandrika KN, Mohan SS, Sharma S, Pinkert S, Nagaraju S, Periaswamy B, et al.: Analysis of the human protein interactome and comparison with yeast, worm and fly interaction datasets. Nat Genet 2006, 38: 285–293. 10.1038/ng1747
Article CAS PubMed Google Scholar
Zanzoni A, Montecchi-Palazzi L, Quondam M, Ausiello G, Helmer-Citterich M, Cesareni G: MINT: a Molecular INTeraction database. FEBS Lett 2002, 513: 135–140. 10.1016/S0014-5793(01)03293-8
Article CAS PubMed Google Scholar
Walhout AJ, Sordella R, Lu X, Hartley JL, Temple GF, Brasch MA, Thierry-Mieg N, Vidal M: Protein interaction mapping in C. elegans using proteins involved in vulval development. Science 2000, 287: 116–122. 10.1126/science.287.5450.116
Article CAS PubMed Google Scholar
Huang TW, Tien AC, Huang WS, Lee YC, Peng CL, Tseng HH, Kao CY, Huang CY: POINT: a database for the prediction of protein-protein interactions based on the orthologous interactome. Bioinformatics 2004, 20: 3273–3276. 10.1093/bioinformatics/bth366
Article CAS PubMed Google Scholar
Kemmer D, Huang Y, Shah SP, Lim J, Brumm J, Yuen MM, Ling J, Xu T, Wasserman WW, Ouellette BF: Ulysses – an application for the projection of molecular interactions across species. Genome Biol 2005, 6: R106. 10.1186/gb-2005-6-12-r106
Article PubMed Central PubMed Google Scholar
Brown KR, Jurisica I: Online predicted human interaction database. Bioinformatics 2005, 21: 2076–2082. 10.1093/bioinformatics/bti273
Article CAS PubMed Google Scholar
Persico M, Ceol A, Gavrila C, Hoffmann R, Florio A, Cesareni G: HomoMINT: an inferred human network based on orthology mapping of protein interactions discovered in model organisms. BMC Bioinformatics 2005,6(Suppl 4):S21. 10.1186/1471-2105-6-S4-S21
Article PubMed Central PubMed Google Scholar
Stelzl U, Worm U, Lalowski M, Haenig C, Brembeck FH, Goehler H, Stroedicke M, Zenkner M, Schoenherr A, Koeppen S, et al.: A human protein-protein interaction network: a resource for annotating the proteome. Cell 2005, 122: 957–968. 10.1016/j.cell.2005.08.029
Article CAS PubMed Google Scholar
Rual JF, Venkatesan K, Hao T, Hirozane-Kishikawa T, Dricot A, Li N, Berriz GF, Gibbons FD, Dreze M, Ayivi-Guedehoussou N, et al.: Towards a proteome-scale map of the human protein-protein interaction network. Nature 2005, 437: 1173–1178. 10.1038/nature04209
Article CAS PubMed Google Scholar
Deane CM, Salwinski L, Xenarios I, Eisenberg D: Protein interactions: two methods for assessment of the reliability of high throughput observations. Mol Cell Proteomics 2002, 1: 349–356. 10.1074/mcp.M100037-MCP200
Article CAS PubMed Google Scholar
Dyer MD, Murali TM, Sobral BW: Computational prediction of host-pathogen protein-protein interactions. Bioinformatics 2007, 23: i159–166. 10.1093/bioinformatics/btm208
Article CAS PubMed Google Scholar
Davis FP, Barkan DT, Eswar N, McKerrow JH, Sali A: Host pathogen protein interactions predicted by comparative modeling. Protein Sci 2007, 16: 2585–2596. 10.1110/ps.073228407
Article PubMed Central CAS PubMed Google Scholar
LaCount DJ, Vignali M, Chettier R, Phansalkar A, Bell R, Hesselberth JR, Schoenfeld LW, Ota I, Sahasrabudhe S, Kurschner C, et al.: A protein interaction network of the malaria parasite Plasmodium falciparum. Nature 2005, 438: 103–107. 10.1038/nature04104
Article CAS PubMed Google Scholar
Date SV, Stoeckert CJ Jr: Computational modeling of the Plasmodium falciparum interactome reveals protein function on a genome-wide scale. Genome Res 2006, 16: 542–549. 10.1101/gr.4573206
Article PubMed Central CAS PubMed Google Scholar
Tromans A: Malaria: the calcium connection. Nature 2004, 429: 253. 10.1038/429253a
Article CAS PubMed Google Scholar
Gazarini ML, Thomas AP, Pozzan T, Garcia CR: Calcium signaling in a low calcium environment: how the intracellular malaria parasite solves the problem. J Cell Biol 2003, 161: 103–110. 10.1083/jcb.200212130
Article PubMed Central CAS PubMed Google Scholar
Matsubara M, Titani K, Taniguchi H, Hayashi N: Direct involvement of protein myristoylation in myristoylated alanine-rich C kinase substrate (MARCKS)-calmodulin interaction. J Biol Chem 2003, 278: 48898–48902. 10.1074/jbc.M305488200
Article CAS PubMed Google Scholar
Hayashi N, Nakagawa C, Ito Y, Takasaki A, Jinbo Y, Yamakawa Y, Titani K, Hashimoto K, Izumi Y, Matsushima N: Myristoylation-regulated direct interaction between calcium-bound calmodulin and N-terminal region of pp60v-src. J Mol Biol 2004, 338: 169–180. 10.1016/j.jmb.2004.02.041
Article CAS PubMed Google Scholar
Matsubara M, Jing T, Kawamura K, Shimojo N, Titani K, Hashimoto K, Hayashi N: Myristoyl moiety of HIV Nef is involved in regulation of the interaction with calmodulin in vivo. Protein Sci 2005, 14: 494–503. 10.1110/ps.04969605
Article PubMed Central CAS PubMed Google Scholar
Scheibel LW, Colombani PM, Hess AD, Aikawa M, Atkinson CT, Milhous WK: Calcium and calmodulin antagonists inhibit human malaria parasites (Plasmodium falciparum): implications for drug design. Proc Natl Acad Sci USA 1987, 84: 7310–7314. 10.1073/pnas.84.20.7310
Article PubMed Central CAS PubMed Google Scholar
Goffard N, Garcia V, Iragne F, Groppi A, de Daruvar A: IPPRED: server for proteins interactions inference. Bioinformatics 2003, 19: 903–904. 10.1093/bioinformatics/btg091
Article CAS PubMed Google Scholar
von Mering C, Jensen LJ, Snel B, Hooper SD, Krupp M, Foglierini M, Jouffre N, Huynen MA, Bork P: STRING: known and predicted protein-protein associations, integrated and transferred across organisms. Nucleic Acids Res 2005, 33: D433–437. 10.1093/nar/gki005
Article PubMed Central CAS PubMed Google Scholar
Wheeler DL, Barrett T, Benson DA, Bryant SH, Canese K, Chetvernin V, Church DM, DiCuccio M, Edgar R, Federhen S, et al.: Database resources of the National Center for Biotechnology Information. Nucleic Acids Res 2007, 35: D5–12. 10.1093/nar/gkl1031
Article PubMed Central CAS PubMed Google Scholar
Blake JA, Eppig JT, Bult CJ, Kadin JA, Richardson JE: The Mouse Genome Database (MGD): updates and enhancements. Nucleic Acids Res 2006, 34: D562–567. 10.1093/nar/gkj085
Article PubMed Central CAS PubMed Google Scholar
Grumbling G, Strelets V: FlyBase: anatomical data, images and queries. Nucleic Acids Res 2006, 34: D484–488. 10.1093/nar/gkj068
Article PubMed Central CAS PubMed Google Scholar
Hiller NL, Bhattacharjee S, van Ooij C, Liolios K, Harrison T, Lopez-Estrano C, Haldar K: A host-targeting signal in virulence proteins reveals a secretome in malarial infection. Science 2004, 306: 1934–1937. 10.1126/science.1102737
Article CAS PubMed Google Scholar
Marti M, Good RT, Rug M, Knuepfer E, Cowman AF: Targeting malaria virulence and remodeling proteins to the host erythrocyte. Science 2004, 306: 1930–1933. 10.1126/science.1102452
Article CAS PubMed Google Scholar
Przyborski J, Lanzer M: Parasitology. The malarial secretome. Science 2004, 306: 1897–1898. 10.1126/science.1107072
Article CAS PubMed Google Scholar

Download references

Acknowledgements

This research was supported by grants from the Program for Promoting Academic Excellence of Universities (National Yang Ming University) to CYH and the National Science Council (Program for Interdisciplinary Research Project: NSC95-2627-B-400-002 to CYH, NSC95-2627-B-030-001 to JML, NSC96-2627-B-194-001 to FSW, and NSC95-2627-B-002-011 to CYK). This study was also partly funded by the "Gene Diagnostic Service Model Research and Niche Market Analysis" project of the Institute of Information Industry, supported by the Ministry of Economy Affairs of the Republic of China.

This article has been published as part of BMC Bioinformatics Volume 9 Supplement 12, 2008: Asia Pacific Bioinformatics Network (APBioNet) Seventh International Conference on Bioinformatics (InCoB2008). The full contents of the supplement are available online at http://www.biomedcentral.com/1471-2105/9?issue=S12.

Author information

Authors and Affiliations

Institute of Clinical Medicine, National Yang-Ming University, Taipei 112, Taiwan
Sheng-An Lee, Cheng-hsiung Chan & Chi-Ying F Huang
Institute of Bio-Pharmaceutical Sciences, National Yang-Ming University, Taipei 112, Taiwan
Chi-Ying F Huang
Institute of Biotechnology in Medicine, National Yang-Ming University, Taipei 112, Taiwan
Chi-Ying F Huang
Department of Computer Science and Information Engineering, National Taiwan University, Taipei 10617, Taiwan
Sheng-An Lee, Cheng-Yan Kao & Chi-Ying F Huang
Institute for Information Industry, Taipei, Taiwan
Chi-Hung Tsai & Cheng-Yan Kao
Department of Life Science, Fu-Jen Catholic University, Taipei Hsien 242, Taiwan
Jin-Mei Lai
Department of Chemical Engineering, National Chung Cheng University, Chia-Yi 621, Taiwan
Feng-Sheng Wang

Authors

Sheng-An Lee
View author publications
You can also search for this author in PubMed Google Scholar
Cheng-hsiung Chan
View author publications
You can also search for this author in PubMed Google Scholar
Chi-Hung Tsai
View author publications
You can also search for this author in PubMed Google Scholar
Jin-Mei Lai
View author publications
You can also search for this author in PubMed Google Scholar
Feng-Sheng Wang
View author publications
You can also search for this author in PubMed Google Scholar
Cheng-Yan Kao
View author publications
You can also search for this author in PubMed Google Scholar
Chi-Ying F Huang
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding authors

Correspondence to Cheng-Yan Kao or Chi-Ying F Huang.

Additional information

Competing interests

The authors declare that they have no competing interests.

Authors' contributions

CYH, CYK, FSW, and JML provide the concept and guidelines for the POINT/POINeT web servers. SAL collects and analyzes the protein-protein interaction and ortholog data, and predicts the inter-species interaction data. CHC proposes the inter-species and host-pathogen concept and writes the manuscript. CHT provides the literature about P. falciparum.

Sheng-An Lee, Cheng-hsiung Chan contributed equally to this work.

Electronic supplementary material

Additional file 1: Table S1. The orthologous group coverage among 18 eukaryotic species. (XLS 44 KB)

Additional file 2: Table S2. Orthologous groups conserved in multiple species. (XLS 78 KB)

Additional file 3: Table S3. Interlog coverage and contributions for each species. (XLS 31 KB)

Additional file 4: Table S4. Predicted inter-species interactions between P. falciparum and H. sapiens. (XLS 428 KB)

Additional file 5: Table S5. Protein-protein interaction data sources. (XLS 30 KB)

Rights and permissions

Open Access This article is published under license to BioMed Central Ltd. This is an Open Access article is distributed under the terms of the Creative Commons Attribution License ( https://creativecommons.org/licenses/by/2.0 ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Reprints and permissions

About this article

Cite this article

Lee, SA., Chan, Ch., Tsai, CH. et al. Ortholog-based protein-protein interaction prediction and its application to inter-species interactions. BMC Bioinformatics 9 (Suppl 12), S11 (2008). https://doi.org/10.1186/1471-2105-9-S12-S11

Download citation

Published: 12 December 2008
DOI: https://doi.org/10.1186/1471-2105-9-S12-S11

Seventh International Conference on Bioinformatics (InCoB2008)

Ortholog-based protein-protein interaction prediction and its application to inter-species interactions