Open Access

Predicting the Interactome of Xanthomonas oryzae pathovar oryzae for target selection and DB service

  • Jeong-Gu Kim1,
  • Daeui Park2,
  • Byoung-Chul Kim2,
  • Seong-Woong Cho2,
  • Yeong Tae Kim1,
  • Young-Jin Park1,
  • Hee Jung Cho1,
  • Hyunseok Park3,
  • Ki-Bong Kim4,
  • Kyong-Oh Yoon5,
  • Soo-Jun Park6,
  • Byoung-Moo Lee1Email author and
  • Jong Bhak2Email author
Contributed equally
BMC Bioinformatics20089:41

DOI: 10.1186/1471-2105-9-41

Received: 18 July 2007

Accepted: 24 January 2008

Published: 24 January 2008



Protein-protein interactions (PPIs) play key roles in various cellular functions. In addition, some critical inter-species interactions such as host-pathogen interactions and pathogenicity occur through PPIs. Phytopathogenic bacteria infect hosts through attachment to host tissue, enzyme secretion, exopolysaccharides production, toxins release, iron acquisition, and effector proteins secretion. Many such mechanisms involve some kind of protein-protein interaction in hosts. Our first aim was to predict the whole protein interaction pairs (interactome) of Xanthomonas oryzae pathovar oryzae (Xoo) that is an important pathogenic bacterium that causes bacterial blight (BB) in rice. We developed a detection protocol to find possibly interacting proteins in its host using whole genome PPI prediction algorithms. The second aim was to build a DB server and a bioinformatic procedure for finding target proteins in Xoo for developing pesticides that block host-pathogen protein interactions within critical biochemical pathways.


A PPI network in Xoo proteome was predicted by bioinformatics algorithms: PSIMAP, PEIMAP, and iPfam. We present the resultant species specific interaction network and host-pathogen interaction, XooNET. It is a comprehensive predicted initial PPI data for Xoo. XooNET can be used by experimentalists to pick up protein targets for blocking pathological interactions. XooNET uses most of the major types of PPI algorithms. They are: 1) Protein Structural Interactome MAP (PSIMAP), a method using structural domain of SCOP, 2) Protein Experimental Interactome MAP (PEIMAP), a common method using public resources of experimental protein interaction information such as HPRD, BIND, DIP, MINT, IntAct, and BioGrid, and 3) Domain-domain interactions, a method using Pfam domains such as iPfam. Additionally, XooNET provides information on network properties of the Xoo interactome.


XooNET is an open and free public database server for protein interaction information for Xoo. It contains 4,538 proteins and 26,932 possible interactions consisting of 18,503 (PSIMAP), 3,118 (PEIMAP), and 8,938 (iPfam) pairs. In addition, XooNET provides 3,407 possible interaction pairs between two sets of proteins; 141 Xoo proteins that are predicted as membrane proteins and rice proteomes. The resultant interacting partners of a query protein can be easily retrieved by users as well as the interaction networks in graphical web interfaces. XooNET is freely available from


Proteins constitute 50 percent or more of the dry weight of living organisms. They have the most diverse biological roles. They function by interacting with other molecules including proteins themselves. Usually, protein-protein interactions are the key mechanisms of normal and pathological functions of living cells. Recently, genomic-scale identification of PPI in model organisms such as Saccharomyces cerevisiae [13] and Escherichia coli [4] have been reported to map the network protein-protein interactions. However, few have been known for phytopathogens and their molecular interactions with hosts. Generally, a phytopathogenic bacterium invades hosts in the following steps: attachment to the host tissue, secretion of degradation enzymes, production of exopolysaccharides, release of toxins, acquisition of iron, and secretion of effector proteins [5]. The gene-for-gene theory that PPI between an effector protein from pathogen and the specific receptor in plant host results in the hypersensitive response and resistance was proposed by Flor [6]. Rossier et al. [7] proposed a model for the role of Xanthomonas campestris pv. vesicatoria Hrp proteins in type III secretion and interaction with its plant hosts. Later, Alegria et al. [8, 9] proved that the PPI is critical in Hrp type III and type IV secretion systems of Xanthomonas axonopodis pv. citri by yeast two-hybrid experiments. There are a few reports on the PPIs involving the effector protein AvrBs3 of Xanthomonas campestris pv. vesicatoria [10, 11].

Rice (Oryza sativa) is one of the major crops in the world, and bacterial blight (BB) causes a huge yield loss (as high as 50% in severely infested fields [12]). Xoo, the rice pathogen causing BB has been completely sequenced [GenBank: AE013598] [13] and the first report on Xoo PPI by glutathione-bead binding experiments. The study included PPIs of several Hrp proteins [14].

Although there was a report showing that some Xoo insertion mutants of unknown or hypothetical protein genes had shown changed pathogenicity [15], it is a long way to go to find all the proteins and their interactions involved in Xoo's pathogenicity. Also, it is expensive and time-consuming to carry out interaction experiments for the whole organism. This led us to develop XooNET which gives us a guidance in targeting pathogenic proteins and their interactions.

In XooNET, predicted PPI information involving Hrp proteins can give us additional function information. For example, Xa21, the resistance gene of rice, has been reported [16]. However, its corresponding Avr protein is yet to be reported. For this instance, the predicted PPI of Xoo can lead users to the function of the effector proteins and finally the target Avr protein(s). There are some pesticides being registered and used against Xoo. However, they were not developed for specific targets, and hence not very effective. The PPI network information Xoo can help the researchers to detect more specific drug targets and increase the pesticide potency.

Construction and Content

PSIMAP-based interactions

4,538 proteins of Xoo were retrieved from NCBI and were aligned with SCOP [17] domains using the PSI-BLAST [18] algorithm with a common expect value (E-value) cut-off of 0.001 [19]. By applying SCOP domain interaction pairs obtained from the PSIMAP [20] based interaction information database, PSIbase [21], 18,503 predicted PPIs were obtained for 1,862 Xoo proteins. This was around 41% of the total Xoo proteins.

PEIMAP-based interactions

The same 4,538 proteins of Xoo were aligned with proteins in PEIMAP using the BLASTP [18] algorithm with a cut-off of 40% sequence identity and 80% length coverage. The PEIMAP includes PPI information from six popular source databases: DIP (Database of Interacting Proteins) [22], BIND (Biomolecular Interaction Network Database) [23], IntAct (Database system and analysis tools for protein interaction data) [24], MINT (Molecular Interactions Database) [25], HPRD (Human Protein Reference Database) [26], and BioGrid (A general repository for interaction datasets) [27]. By applying PEIMAP interaction pairs, 3,118 predicted PPIs were obtained for 629 Xoo proteins. These PPIs was around 14% of the total Xoo proteins.

Calculating Interactions based on iPfam

Pfam [28] domains of all the Xoo proteins were aligned with hmmpfam by the cut-off of 0.01 (E-value). By integrating them with Pfam domain interaction pairs from iPfam [29], 8,938 predicted protein-protein interactions were constructed with 1,362 selected proteins comprising approximately 30% of Xoo proteins.

Selecting High-confidence interactions

As a filter, we used the 'combined score' between any pair of proteins which were predicted by PEIMAP, PSIMAP, and iPfam algorithms. As a result, we selected 684 Xoo proteins participating in 2,494 high-confidence PPIs (> 0.6) that were commonly found in all the three databases encompassing PSIMAP, PEIMAP, and iPfam. Those were further rescaled into the confidence range from 0.0 to 1.0 combining all the scores (these were visualized in the Java applet viewer of a modified Integrator program).

Predicting PPIs between Xoo and Rice

Oryza sativa is known as the sole host of Xoo. Therefore, we added 3,407 PPI interaction predictions between Xoo and rice (Oryza sativa japonica and Oryza sativa indica). We chose 354 proteins expected to be membrane proteins and extra cellular proteins in Xoo using GO-Slim [30]. With these data and PSIMAP, PEIMAP, and iPfam algorithms, we predicted interactions between Xoo and Oryza sativa japonica (1,269/26,887), or Oryza sativa indica (18/118). As a result, we predicted that 141 Xoo proteins have 3,407 interaction pairs with rice (PEIMAP:25; PSIMAP:2,266; iPfam:2,124). We evaluated many different thresholds of psi-Blast and hmmpfam for domain assignment, and the most adequate one was 10e-4 for PSIMAP, 40% identity and 70% coverage for PEIMAP, and 10e-2 for iPfam.


XooNET can be accessed by gene symbols, gene descriptions, locus tags, and NCBI gi numbers to find gene information and interacting partners not only of Xoo but also of Oryza sativa. Users can also input amino acid sequences. In addition to giving users the functional category of gene sets, XooNET provides the tree of gene ontology annotation using GO-Slim. Figure 1 shows the search interface and the result.
Figure 1

XooNET system and interfaces. (a) XooNET integrates four complementary protein-protein interaction databases including PSIMAP, PEIMAP, and iPfam. It shows three search interfaces: (1) search in high-confidence PPI network, (2) keyword and sequence search, and (3) functionally categorized tree navigation of gene ontology annotation. (b) A search result showing the list of predicted interacting proteins, supporting databases, and their synonymous IDs.


The public interaction databases such as BIND and DIP at this time are limited for the PPIs of Xoo. Therefore, PEIMAP, which is an integrated resource of experimental PPI data, covers only about 14% of the total Xoo proteins. We found that some PPI pairs reported in experiments (Jang et al., 2007; Kim et al., unpublished) were not predicted by XooNET by using the updated PEIMAP algorithm. The cases include: interactions between HrpB1 and RecA, HrpB2 and RecA, HrpB5 and XorII, Hpa2 and RecA; AvrBs2 and HpaP, and AvrBs2 and HrcQ. This shows that the prediction capability of XooNET is still limited for newly discovered protein interactions. By contrast, XooNET predicted that AvrBs2 is interacts with itself. However, a yeast two-hybrid assay showed no self interaction (Kim et al., unpublished). Thus, to increase the prediction boundary of XooNET, we expanded it by providing a field for users to add newly confirmed experimental interaction information.

Avr proteins are known to be crucial effectors that make many bacterial species pathogenic. We found 15 annotated AvrBs3 homologues in Xoo that fall on to three groups according to the interaction promiscuity in protein protein interaction: group 1, zero or one; group 2, more than 60; and group 3, around 10 partners. The highly interactive protein group showed that their numerous partners are functionally related to pathogenicity and can be subdivided. This shows that PPI analysis can assist researchers in discovering new targets and in designing more systematic experiments. One such highly interacting protein, Xoo1125, a hypothetical protein which has over 60 interaction partners including the Avr proteins, caused the loss of pathogenicity when transposon insertion mutation was carried out in a separate experiment. This suggests that XooNET approach is useful in investigating the functions of unknown or hypothetical proteins in Xanthomonas oryzae pathovar oryzae.


XooNET is an integrated database of mutually complementary protein-protein interaction databases: PSIMAP, PEIMAP, and iPfam. The XooNET server is the first specialized Xoo PPI database which provides information of possibly interacting partners against query proteins. In particular, as only one third of the Xoo proteome are fully annotated, there are still many hypothetical and unknown proteins. XooNET provides a platform for biologists to annotate them by predicting their interaction partners and looking into their pathways.


PSIMAP Algorithm

The basic procedure of PSIMAP is to infer interactions between proteins by using their homologs. Interactions among domains or proteins for known PDB (Protein Data Bank) structures are the basis for the prediction. If an unknown protein has a homolog to a domain, PSIMAP assumes that the query tends to interact with its homolog's partners. Its concept is called 'homologous interaction'. The original interaction between two proteins or domains is based on the euclidean distance. Therefore, PSIMAP gives a structure based interaction prediction [20].

PEIMAP Algorithm

PEIMAP (Protein Experimental Interactome MAP) has been constructed by combining several experimental protein-protein interaction databases. We carried out redundancy check to remove identical protein sequences from the source interaction databases. At present, it contains 116,773 proteins and 229,799 interactions.




We thank our colleagues at KOBIC, especially, Woo-yeon Kim and SungHun Lee. This project was supported by a grant from the KRIBB Research Initiative Program of Korea and R01-2004-000-10172-0 grant of KOSEF, by the grant from the NIAB 05-4-12-4-2 and by NIAB 07-4-21-22-1 (BioGreen21 20070501034003 from RDA).

Authors’ Affiliations

Microbial Genetics Division, National Institute of Agricultural Biotechnology (NIAB), Rural Development Administration (RDA)
Korean BioInformation Center (KOBIC), KRIBB
Department of Computer Science and Engineering, Ewha Womans University
Department of Biotechnology and Informatics, Sang Myung University
Macrogen Inc.
Bioinformatics Team, Electronics and Telecommunications Research Institute (ETRI)


  1. Uetz P, Giot L, Cagney G, Mansfield TA, Judson RS, Knight JR, Lockshon D, Narayan V, Srinivasan M, Pochart P, Qureshi-Emili A, Li Y, Godwin B, Conover D, Kalbfleisch T, Vijayadamodar G, Yang M, Johnston M, Fields S, Rothberg JM: A comprehensive analysis of protein-protein interactions in Saccharomyces cerevisiae. Nature 2000, 403(6770):623–627. 10.1038/35001009View ArticlePubMedGoogle Scholar
  2. Ito T, Tashiro K, Muta S, Ozawa R, Chiba T, Nishizawa M, Yamamoto K, Kuhara S, Sakaki Y: Toward a protein-protein interaction map of the budding yeast: A comprehensive system to examine two-hybrid interactions in all possible combinations between the yeast proteins. Proc Natl Acad Sci USA 2000, 97(3):1143–1147. 10.1073/pnas.97.3.1143PubMed CentralView ArticlePubMedGoogle Scholar
  3. Ito T, Chiba T, Ozawa R, Yoshida M, Hattori M, Sakaki Y: A comprehensive two-hybrid analysis to explore the yeast protein interactome. Proc Natl Acad Sci USA 2001, 98(8):4569–4574. 10.1073/pnas.061034498PubMed CentralView ArticlePubMedGoogle Scholar
  4. Arifuzzaman M, Maeda M, Itoh A, Nishikata K, Takita C, Saito R, Ara T, Nakahigashi K, Huang HC, Hirai A, Tsuzuki K, Nakamura S, Altaf-Ul-Amin M, Oshima T, Baba T, Yamamoto N, Kawamura T, Ioka-Nakamichi T, Kitagawa M, Tomita M, Kanaya S, Wada C, Mori H: Large-scale identification of protein-protein interaction of Escherichia coli K-12. Genome Res 2006, 16(5):686–691. 10.1101/gr.4527806PubMed CentralView ArticlePubMedGoogle Scholar
  5. Boucher C, Genin S, Arlat M: Concepts actuels sur la pathogénie chez les bactéries phytopathogènes. C R Acad Sci III 2001, 324(10):915–922.View ArticlePubMedGoogle Scholar
  6. Flor HH: Current status of the gene-for-gene concept. Annu Rev Phytopathol 1971, 9: 275–296. 10.1146/ ArticleGoogle Scholar
  7. Rossier O, Van den Ackerveken G, Bonas U: HrpB2 and HrpF from Xanthomonas are type III-secreted proteins and essential for pathogenicity and recognition by the host plant. Mol Microbiol 2000, 38(4):828–838. 10.1046/j.1365-2958.2000.02173.xView ArticlePubMedGoogle Scholar
  8. Alegria MC, Docena C, Khater L, Ramos CH, da Silva AC, Farah CS: New protein-protein interactions identified for the regulatory and structural components and substrates of the type III Secretion system of the phytopathogen Xanthomonas axonopodis Pathovar citri. J Bacteriol 2004, 186(18):6186–6197. 10.1128/JB.186.18.6186-6197.2004PubMed CentralView ArticlePubMedGoogle Scholar
  9. Alegria MC, Souza DP, Andrade MO, Docena C, Khater L, Ramos CH, da Silva AC, Farah CS: Identification of new protein-protein interactions involving the products of the chromosome- and plasmid-encoded type IV secretion loci of the phytopathogen Xanthomonas axonopodis pv. citri. J Bacteriol 2005, 187(7):2315–2325. 10.1128/JB.187.7.2315-2325.2005PubMed CentralView ArticlePubMedGoogle Scholar
  10. Büttner D, Lorenz C, Weber E, Bonas U: Targeting of two effector protein classes to the type III secretion system by a HpaC- and HpaB-dependent protein complex from Xanthomonas campestris pv. vesicatoria. Mol Microbiol 2006, 59(2):513–527. 10.1111/j.1365-2958.2005.04924.xView ArticlePubMedGoogle Scholar
  11. Gurlebeck D, Szurek B, Bonas U: Dimerization of the bacterial effector protein AvrBs3 in the plant cell cytoplasm prior to nuclear import. Plant J 2005, 42(2):175–187. 10.1111/j.1365-313X.2005.02370.xView ArticlePubMedGoogle Scholar
  12. Ezuka A, Kaku H: A historical review of bacterial blight of rice. Bull Natl Inst Agrobiol Resour (Japan) 2000, 15: 53–54.Google Scholar
  13. Lee BM, Park YJ, Park DS, Kang HW, Kim JG, Song ES, Park IC, Yoon UH, Hahn JH, Koo BS, Lee GB, Kim H, Park HS, Yoon KO, Kim JH, Jung CH, Koh NH, Seo JS, Go SJ: The genome sequence of Xanthomonas oryzae pathovar oryzae KACC10331 the bacterial blight pathogen of rice. Nucleic Acids Res 2005, 33(2):577–586. 10.1093/nar/gki206PubMed CentralView ArticlePubMedGoogle Scholar
  14. Jang M, Park BC, Lee DH, Bae K-H, Cho S, Park HS, Lee BR, Park SG: Interaction proteome analysis of Xanthomonas Hrp proteins. J Microbiol Biotechnol 2007, 17: 359–363.PubMedGoogle Scholar
  15. Lee BM, Park YJ, Kim JG, Kang HW: Genomic study of Xanthomonas oryzae pv. oryzae KACC10331. Proceedings of the International Workshop Xanthomonas genome research: 27–28 October 2005; Bielefeld Germany
  16. Song WY, Wang GL, Chen LL, Kim HS, Pi LY, Holsten T, Gardner J, Wang B, Zhai WX, Zhu LH, Fauquet C, Ronald P: A receptor kinase-like protein encoded by the rice disease resistance gene, Xa21. Science 1995, 270(5243):1804–1806. 10.1126/science.270.5243.1804View ArticlePubMedGoogle Scholar
  17. Hubbard TJ, Murzin AG, Brenner SE, Chothia C: SCOP: a structural classification of proteins database. Nucleic acids research 1997, 25(1):236–239. 10.1093/nar/25.1.236PubMed CentralView ArticlePubMedGoogle Scholar
  18. Altschul SF, Madden TL, Schaffer AA, Zhang J, Zhang Z, Miller W, Lipman DJ: Gapped BLAST and PSI-BLAST: a new generation of protein database search programs. Nucleic acids research 1997, 25(17):3389–3402. 10.1093/nar/25.17.3389PubMed CentralView ArticlePubMedGoogle Scholar
  19. Park D, Lee S, Bolser D, Schroeder M, Lappe M, Oh D, Bhak J: Comparative interactomics analysis of protein family interaction networks using PSIMAP (protein structural interactome map). Bioinformatics (Oxford, England) 2005, 21(15):3234–3240. 10.1093/bioinformatics/bti512View ArticleGoogle Scholar
  20. Park J, Lappe M, Teichmann SA: Mapping protein family interactions: intramolecular and intermolecular protein family interaction repertoires in the PDB and yeast. Journal of molecular biology 2001, 307(3):929–938. 10.1006/jmbi.2001.4526View ArticlePubMedGoogle Scholar
  21. Gong S, Yoon G, Jang I, Bolser D, Dafas P, Schroeder M, Choi H, Cho Y, Han K, Lee S, Choi H, Lappe M, Holm L, Kim S, Oh D, Bhak J: PSIbase: a database of Protein Structural Interactome map (PSIMAP). Bioinformatics (Oxford, England) 2005, 21(10):2541–2543. 10.1093/bioinformatics/bti366View ArticleGoogle Scholar
  22. Xenarios I, Rice DW, Salwinski L, Baron MK, Marcotte EM, Eisenberg D: DIP: the database of interacting proteins. Nucleic acids research 2000, 28(1):289–291. 10.1093/nar/28.1.289PubMed CentralView ArticlePubMedGoogle Scholar
  23. Bader GD, Hogue CW: BIND – a data specification for storing and describing biomolecular interactions, molecular complexes and pathways. Bioinformatics (Oxford, England) 2000, 16(5):465–477. 10.1093/bioinformatics/16.5.465View ArticleGoogle Scholar
  24. Hermjakob H, Montecchi-Palazzi L, Lewington C, Mudali S, Kerrien S, Orchard S, Vingron M, Roechert B, Roepstorff P, Valencia A, Margalit H, Armstrong J, Bairoch A, Cesareni G, Sherman D, Apweiler R: IntAct: an open source molecular interaction database. Nucleic acids research 2004, (32 Database):D452–455. 10.1093/nar/gkh052
  25. Zanzoni A, Montecchi-Palazzi L, Quondam M, Ausiello G, Helmer-Citterich M, Cesareni G: MINT: a Molecular INTeraction database. FEBS letters 2002, 513(1):135–140. 10.1016/S0014-5793(01)03293-8View ArticlePubMedGoogle Scholar
  26. Peri S, Navarro JD, Kristiansen TZ, Amanchy R, Surendranath V, Muthusamy B, Gandhi TK, Chandrika KN, Deshpande N, Suresh S, Rashmi BP, Shanker K, Padma N, Niranjan V, Harsha HC, Talreja N, Vrushabendra BM, Ramya MA, Yatish AJ, Joy M, Shivashankar HN, Kavitha MP, Menezes M, Choudhury DR, Ghosh N, Saravana R, Chandran S, Mohan S, Jonnalagadda CK, Prasad CK, Kumar-Sinha C, Deshpande KS, Pandey A: Human protein reference database as a discovery resource for proteomics. Nucleic acids research 2004, (32 Database):D497–501. 10.1093/nar/gkh070
  27. Stark C, Breitkreutz BJ, Reguly T, Boucher L, Breitkreutz A, Tyers M: BioGRID: a general repository for interaction datasets. Nucleic acids research 2006, (34 Database):D535–539. 10.1093/nar/gkj109
  28. Sonnhammer EL, Eddy SR, Birney E, Bateman A, Durbin R: Pfam: multiple sequence alignments and HMM-profiles of protein domains. Nucleic acids research 1998, 26(1):320–322. 10.1093/nar/26.1.320PubMed CentralView ArticlePubMedGoogle Scholar
  29. Finn RD, Marshall M, Bateman A: iPfam: visualization of protein-protein interactions in PDB at domain and amino acid resolutions. Bioinformatics (Oxford, England) 2005, 21(3):410–412. 10.1093/bioinformatics/bti011View ArticleGoogle Scholar
  30. Camon E, Magrane M, Barrell D, Lee V, Dimmer E, Maslen J, Binns D, Harte N, Lopez R, Apweiler R: The Gene Ontology Annotation (GOA) Database : sharing knowledge in Uniprot with Gene Ontology. Nucleic Acids Research 2004, (32 Database):D262-D266. 10.1093/nar/gkh021


© Kim et al; licensee BioMed Central Ltd. 2008

This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.