- Research article
- Open Access
A computational approach to discovering the functions of bacterial phytochromes by analysis of homolog distributions
BMC Bioinformatics volume 7, Article number: 141 (2006)
Phytochromes are photoreceptors, discovered in plants, that control a wide variety of developmental processes. They have also been found in bacteria and fungi, but for many species their biological role remains obscure. This work concentrates on the phytochrome system of Agrobacterium tumefaciens, a non-photosynthetic soil bacterium with two phytochromes. To identify proteins that might share common functions with phytochromes, a co-distribution analysis was performed on the basis of protein sequences from 138 bacteria.
A database of protein sequences from 138 bacteria was generated. Each sequence was BLASTed against the entire database. The homolog distribution of each query protein was then compared with the homolog distribution of every other protein (target protein) of the same species, and the target proteins were sorted according to their probability of co-distribution under random conditions. As query proteins, phytochromes from Agrobacterium tumefaciens, Pseudomonas aeruginosa, Deinococcus radiodurans and Synechocystis PCC 6803 were chosen along with several phytochrome-related proteins from A. tumefaciens. The Synechocystis photosynthesis protein D1 was selected as a control. In the D1 analyses, the ratio between photosynthesis-related proteins and those not related to photosynthesis among the top 150 in the co-distribution tables was > 3:1, showing that the method is appropriate for finding partner proteins with common functions. The co-distribution of phytochromes with other histidine kinases was remarkably high, although most co-distributed histidine kinases were not direct BLAST homologs of the query protein. This finding implies that phytochromes and other histidine kinases share common functions as parts of signalling networks. All phytochromes tested, with one exception, also revealed a remarkably high co-distribution with glutamate synthase and methionine synthase. This result implies a general role of bacterial phytochromes in ammonium assimilation and amino acid metabolism.
It was possible to identify several proteins that might share common functions with bacterial phytochromes by the co-distribution approach. This computational approach might also be helpful in other cases.
Many photoreceptors such as rhodopsins, phytochromes, cryptochromes and phototropins have been discovered in eukaryotic organisms by a combination of biochemical and physiological assays , whereas prokaryotic versions of these proteins have often been identified during genome projects. Phytochromes, which are photoreceptors with a bilin chromophore, control a broad range of developmental processes in plants . The discovery of plant phytochrome in the late 1950s was the starting point for biochemical, molecular and physiological characterisations. Since the late 1990s, phytochromes have also been found in many bacteria and fungi [3–8], in most cases after the phytochrome gene has been identified during genome sequencing . Prototypical bacterial phytochromes are light-regulated histidine kinases, which trans-phosphorylate cognate response regulator proteins . In cyanobacteria, a number of light effects such as phototaxis , control of the circadian clock , chromatic adaptation  and adaptation to blue light conditions , are controlled by proteins that contain domains with rather weak homology to the so-called GAF-domain of phytochrome. The biological function of prototypical phytochrome is known for only a few bacteria. The cyanobacterial phytochrome Cph1 is important for adaptation to strong light conditions  and is involved in regulating several genes, including gifA, which encodes a regulator of glutamine synthase . However, the signal transduction link between Cph1 and the observed light effects is still obscure and the biological role of the prototypical phytochromes in other cyanobacteria is unknown.
The most obvious effects controlled by bacterial phytochromes have been found for Bradyrhizobium spp., a photosynthetic plant symbiont, and the purple bacterium Rhodopseudomonas palustris. In both species, the synthesis of bacteriochlorophyll and carotenoid pigments is under phytochrome control [16, 17].
Many non-photosynthetic bacteria, including the gamma ray resistant Deinococcus radiodurans, the soil bacterium Agrobacterium tumefaciens and the pathogen Pseudomonas aeruginosa also contain phytochromes [5, 6, 18]. For D. radiodurans, it has been reported that the phytochrome BphP controls the regulation of carotenoid biosynthesis , but as in Synechocystis, the signal transduction link between input and output is unknown. The biological role of phytochromes in other non-photosynthetic bacteria is unknown.
The present work concentrates on the phytochrome system of A. tumefaciens. This bacterium is known for its ability to induce plant tumours by gene transformation, a mechanism that is widely used for plant transformation . The genome of this bacterium was sequenced by two groups in 2001 [20, 21], the sequencing data revealed two phytochrome genes. Since their discovery, both A. tumefaciens phytochromes have been analysed biochemically as recombinant proteins. These studies provided deeper insight into general phytochrome functions and phytochrome diversity [6, 22–28]. Both phytochromes are expressed in A. tumefaciens cells, as revealed by UV/vis spectroscopy .
The computational approach presented here is aimed at identifying proteins that might share common features with bacterial phytochromes. The analysis is based on the distribution of protein homologs among bacteria according to the following concept: if proteins share common functions in one organism, they might have evolved together, with similar mutation rates, and their homologs should be abundant in similar subsets of species.
The principle of the investigation is outlined in Figure 1. In this example, six species were used for an imaginary global homology analysis. Each species has between eight and ten proteins, which are abbreviated by a letter for the species and a digit. Protein A1 of species A is the query protein of the imaginary co-distribution analysis. This protein has homologs in four other species and one homolog, A2, in species A itself. In Table 1, which gives the result of the co-distribution analysis, the proteins of species A are sorted according to the co-distribution probability p, which is calculated as described in the Methods section. Proteins that have homologs in a similar subset as the query protein A1 are listed at the top. Protein A6 has the best co-distribution match, with a probability of co-distribution under random conditions of p = 0.2. A2 is a direct homolog of A1; this is indicated by "1" in column e. The header of Table 1 contains further comments.
The co-distribution analyses in this study are based on protein sequences from 138 bacteria, listed in Additional file 21. The tabulated results of two global BLAST  searches with E-values of 0.000001 and 10 were stored in files that were used to compare the homolog distribution of protein couples. Two different BLAST analyses were undertaken in order to determine the dependence of the results on the E-value. The number of homologs usually differs according to the chosen E-value and lies within a reasonable range for the query proteins (see e.g. Table 2). Each query protein was probed against all the other proteins of the same species to produce sorted co-distribution tables, as outlined in the example of Figure 1 and Table 1 (see Additional files 1 to 20).
The photosynthetic protein D1 from the cyanobacterium Synechocystis PCC 6803 was chosen as a control query protein to see whether the co-distribution approach identifies related proteins (see Additional files 17 and 18). In addition, five phytochromes from four different species were selected as query proteins: Agp1 (= BphP1) (see Additional files 1 and 2) and Agp2 (= BphP2) (see Additional files 3 and 4) from A. tumefaciens, BphP from P. aeruginosa (see Additional files 13 and 14), BphP from D. radiodurans (see Additional files 11 and 12) and Cph1 from Synechocystis (see Additional files 15 and 16). The agp1 gene of A. tumefaciens is arranged in a gene cluster as depicted in Fig. 2. A response regulator protein termed AgR and a histidine kinase termed ExsG are encoded in the same operon. It has been shown that AgR is phosphorylated by Agp1 in a light-dependent manner . ExsG is homologous to the histidine kinase module of Agp2. A response regulator termed ExsF is encoded in the other DNA strand next to the exsG gene. In prokaryotes, such an arrangement points to possible common functions among the encoded proteins. Since the principal goal of this study was to gain information about the phytochrome system of A. tumefaciens, AgR (see Additional files 5 and 7), ExsF (see Additional files 7 and 8) and ExsG (see Additional files 9 and 10) were also selected as query proteins. The response regulator of Synechocystis phytochrome, Rcp1, which is phosphorylated by Cph1 , was also included (see Additional files 19 and 20).
For each query protein, the results of both global BLAST analyses were taken as sources for the co-distribution analyses. Thus, 20 co-distribution lists were generated. The target proteins in these lists were sorted as outlined in Table 1. All co-distribution lists are presented as additional files on the BMC web server.
The BLAST analysis with an E-value of 0.000001 revealed D1 homologs in 6 species. These species are identical to the 6 cyanobacteria that were selected for global analysis. In Synechocystis, there are 142 proteins with exactly the same distribution. Among these are 37 other proteins with photosynthesis-related functions, such as phycocyanin, allophycocyanin, ferredoxin and photosystem subunit proteins. A green background marks the corresponding field in the co-distribution table (see Additional file 17). Twenty-four of the 142 proteins have been annotated as "hypothetical" and 70 as "unknown protein". Eleven proteins that are clearly not related to photosynthesis, such as ribosomal proteins or tRNA synthetase, also have the same distribution as D1. A yellow background marks these fields in the co-distribution table.
When the D1 co-distribution analysis was based on a BLAST E-value of 10 (see Additional file 18), D1 homologs were found in 16 species. Among these are the six cyanobacteria and one other photosynthetic bacterium, R. palustris. The other 9 species are non-photosynthetic. Owing to the higher number of species with D1 homologs, the co-distribution results are more differentiated. For comparison with the previous survey, the top 149 proteins were inspected in more detail. Since the proteins placed between positions 141 and 149 have the same distribution, it was not possible to use exactly the same number of proteins as in the previous case. Among the top 149 proteins there are 42 photosynthesis-related proteins, 10 proteins with functions not related to photosynthesis, 17 "hypothetical proteins", 77 "unknown proteins", and 3 proteins for which the function could not clearly be assigned to photosynthesis. The number of photosynthesis-related proteins is comparable with the first analysis, but the selection is slightly different. For example, a ribulose-bisphosphate carboxylase subunit and a carbon dioxide concentrating protein subunit are among the top 149 in the second analysis, whereas the same proteins are placed at positions 184 and 378, respectively, in the first analysis.
The D1 analysis showed that proteins with related functions can be identified by the present approach, since among the top ca. 150 proteins there are three to four times more photosynthesis-related proteins than proteins with other known functions. It seems likely that among the hypothetical and unknown proteins listed at the top of the co-distribution tables, there are many other proteins with functions related to photosynthesis.
In cases where the phytochromes or phytochrome-related proteins mentioned above were chosen as query (see Additional files 1, 2, 3, 4 and 11, 12, 13, 14, 15, 16), the proteins placed within the top 100 in each co-distribution list were compared with all proteins from the same species. Text-based searches were performed to count the number of proteins belonging to particular groups of proteins such as histidine kinases, response regulators and transcription factors. These results are summarized in Table 2. It is remarkable that in all cases the frequency of histidine kinases (or "two-component sensors") among the top 100 co-distributed proteins is much higher than among all proteins of the species. For example, Agp1 has 18% to 20% co-distributed "two-component sensors" (= histidine kinases) among the top 100, whereas only 1.08% of all A. tumefaciens proteins belong to this group. Since bacterial phytochromes are also histidine kinases, the co-distribution with other histidine kinases might simply be based on direct homology, as between A1 and A2 in the example of Fig. 1. However, most co-distributed histidine kinases are not direct BLAST homologs, as indicated in column "e" of the co-distribution tables. The results for the co-distribution of (two-component-) response regulators, which are substrates of histidine kinases, are qualitatively comparable with those for the histidine kinases. There are, however, two exceptions: in the Agp2 and Rcp1 analyses, which were based on the low E-value BLAST search, the frequency of response regulators among the top 100 in the lists was comparable with that in the entire protein population.
In the case of the A. tumefaciens query proteins, the possibility was tested that proteins designated "transcriptional regulators" are enriched within the top 100 in the lists. For all five query proteins (see Additional files 1 to 10), the frequency of co-distributed transcriptional regulators increases when the BLAST analysis E-value is changed from low to high. The latter but not the former values are above average.
The five A. tumefaciens query proteins were also tested for co-distribution among each other. Table 3 gives the positions in the co-distribution tables for each possible combination. This table shows that AgR and Agp1 match quite well: in three out of four combinations, the target protein was among the first 100 in the co-distribution list. Similarly, there is also a rather good match between Cph1 and Rcp1 of Synechocystis (Table 4). The putative response regulator of D. radiodurans BphP (gi number: 15807719) appears at positions 5 and 433 in the BphP co-distribution tables. For the combinations ExsG/Agp2, ExsF/Agp2 and ExsF/ExsG, the target protein is placed among the first 100 in the co-distribution lists, indicating a rather high co-distribution. Agp2 and ExsG are direct BLAST homologs, as indicated in column "e" of the result tables. The similarity between these two proteins has been noted previously . As mentioned above, the exsF and exsG genes are located close together (Fig. 2). This arrangement suggests that ExsF is a substrate of ExsG, which in turn could explain the good match between these proteins.
Besides phytochromes, A. tumefaciens contains another putative photoreceptor, a flavoprotein that belongs of the cryptochrome/photolyase group. Cryptochromes and photolyases are homologous proteins that serve as photoreceptors and catalyze light-dependent DNA repair mechanisms, respectively. In A. tumefaciens, the protein annotated as DNA photolyase (gi: 17935123) has also been classified as cryptochrome , but functional details are as yet unclear. In plants, the signal transduction pathways of cryptochromes and phytochromes are intertwined . For the plant Arabidopsis thaliana it has been reported that a cryptochrome interacts directly with a phytochrome . For these reasons, the co-distribution of phytochromes and cryptochromes/photolyases was of particular interest. In A. tumefaciens, there is a good match between Agp1 as query and the DNA photolyase (cryptochrome) as target; the latter is placed at positions 156 and 17 in the co-distribution tables (see Additional files 1 and 2). There is no significant co-distribution between Agp2 and cryptochrome/photolyase (see Additional files 3 and 4). In Synechocystis and P. aeruginosa, the co-distribution between phytochromes and cryptochromes/photolyases is rather poor (see Additional files 13, 14, 15 and 16), and in D. radiodurans, cryptochromes/photolyases seem to be absent (see Additional files 11 and 12).
The Agp1 lists were inspected for further candidates that might share common functions with this phytochrome. One striking observation was that either two or three glutamate synthase large subunits are among the first 10 proteins in the co-distribution tables (see Additional files 1 and 2). There are three large and one small subunits of this enzyme in A. tumefaciens. With Agp2 as query, none of the glutamate synthase subunits appeared among the first proteins in the co-distribution table (see Additional file 2). However, in the case of D. radiodurans, Synechocystis and P. aeruginosa phytochromes, cross-correlation with glutamate synthase subunits is also obvious (see Additional files 11, 12, 13, 14, 15, 16). In both Synechocystis Cph1 tables, two ferredoxin-dependent glutamate synthases are found among the top 60; in the D. radiodurans tables, the large subunit of glutamate synthase is placed among the top 50; and in the P. aeruginosa tables, the large subunit is found at positions 74 and 123.
Another enzyme of amino acid metabolism, methionine synthase, is also located at the top of the Agp1 lists, namely at positions 3 and 87 (see Additional files 1 and 2). Again, there is no significant co-distribution between Agp2 and this protein (see Additional files 3 and 4), but with the other phytochromes (see Additional files 11 to 16) the co-distribution is in the range of glutamate synthase (Table 5).
For a given target protein, the co-distribution table shows which proteins have similar or equal BLAST-homolog distributions among a set of species. These tables can be used to test for the co-distribution of protein couples with known functions or to find protein partners with a yet unknown mutuality, given that the relationship has led to co-evolution of these proteins. When the photosynthetic protein D1 was chosen as query protein, many other photosynthesis-related proteins appeared among the first ca. 150 proteins of the co-distribution tables. The ratio between these proteins and those that are not related to photosynthesis is > 3:1. This result implies that the chances of finding protein couples with related functions by the method presented here are rather high.
Two classes of proteins are known to act together with bacterial phytochromes: response regulators, which are trans-phosphorylated by the phytochrome histidine kinase subunit, and heme oxygenases, the enzymes that catalyse the last or the second last step in phytochrome chromophore biosynthesis. Depending on the species, genes for either protein may be found next to the phytochrome genes . Heme oxygenases appear in rather low positions in all the phytochrome co-distribution tables. A co-evolutionary relationship between these proteins is thus not supported by the present study. Phylogenetic analyses imply that cyanobacterial heme oxygenases are of different origin from proteobacterial homologs, whereas bacterial phytochromes seem to share one common origin [35, 36]. This might explain the rather large distance between the two proteins in the co-distribution analysis. The co-distribution between Agp1 (Table 3)/Cph1 (Table 4) and their cognate response regulators is in general rather good. In the phytochrome co-distribution tables, other response regulators are found higher in the list. An unambiguous identification of the cognate response regulator by the present approach is thus not expected. However, this approach could reduce the number of proteins to be analysed for those species where the response regulator is yet to be identified. In P. aeruginosa, the cognate phytochrome response regulator cannot be deduced from the gene arrangement. According to the list of P. aeruginosa proteins, there are 56 response regulators in this species; an initial biochemical screen could focus on those placed at the tops of the co-distribution lists.
The rather high frequency of histidine kinases and response regulators among those proteins listed at the tops of the phytochrome co-distribution tables suggests that bacterial phytochromes and other histidine kinases act together in a complex intracellular network. The common model of two-component signalling predicts that histidine kinases act as homodimers and that they specifically transfer phosphate to one cognate response regulator . However, more complex interactions might exist in the natural host. (i) Two different histidine kinase monomers could form heterodimers. For the plant A. thaliana it has been shown that four of the five phytochromes can form heterodimers, most likely by their histidine-kinase-like subunits . (ii) Different response regulators might function as substrates of the histidine kinase. (iii) The downstream signalling pathways of different input histidine kinases could merge. For signal transduction in which bacterial phytochromes are involved, none of these possibilities has yet been tested.
In addition to proteins with regulatory functions, the enzyme glutamate synthase is of particular interest. In A. tumefaciens, there are three large and one small glutamate synthase subunits. Depending on the E-value of the global BLAST analysis, either two or three of these proteins were among the top 10 in the co-distribution tables. With the exception of Agp2, all other phytochromes in the present study have co-distributed glutamate synthases. At least one glutamate synthase (subunit) is placed among the top 100 in the co-distribution lists. In the cyanobacterium Synechocystis, the ferredoxin-dependent enzyme matches better with the phytochrome Cph1 query protein than the NADH-dependent enzyme. In plants, where phytochrome action has been analysed over decades, there are also ferredoxin-dependent and NADH-dependent glutamate synthases. The ferredoxin-dependent enzyme is located in the plastid, where it acts together with glutamine synthase to incorporate ammonium (NH4+) into glutamine and glutamate. Ammonium is formed from nitrite (NO2-) by nitrite reductase, which like glutamate synthase is directly coupled to the photosynthetic electron cascade in the chloroplast via ferredoxin. The expression of all three enzymes and the cytosolic nitrate reductase, which catalyses the conversion of nitrate into nitrite, is light-regulated by phytochrome [39–43].
In cyanobacteria, enzymes of ammonium assimilation seem to be regulated by ammonium, but not by light . Light control of gene transcription was analysed by RNA profiling in wild type and Cph1 and Cph2 mutants of Synechocystis . In these studies, no influence of Cph1 on the abundance of glutamate synthase mRNA was found (T. Börner and T. Hübschmann, personal communication). However, the expression of GifA, a regulatory protein of glutamine synthase, which acts in cooperation with glutamate synthase in the "GOGAT cycle", seems be under the control of Cph1 and Cph2, as deduced from expression profiling results on the double mutant . It could therefore be that the GOGAT cycle is indirectly under the control of phytochromes in Synechocystis.
The present data imply that bacterial phytochromes might contribute to the regulation of glutamate synthase in other prokaryotic species as well. The fact that phytochrome homologs and glutamate synthase homologs are found in similar sub-sets of species points to a common and ancient link between these two groups of proteins.
How can these proteins be connected? Glutamate is a key molecule of nitrogen metabolism. Glutamate and glutamine are the first amino acids in which ammonium is fixed into organic matter. Glutamate serves as nitrogen source and in most species also as a carbon source for porphyrins. In the "tRNA pathway", realized in the majority of bacteria and plants, glutamate-tRNA is used as substrate for the synthesis of δ aminolevulinic acid, which is the key molecule in porphyrin synthesis. In α proteobacteria (including A. tumefaciens), yeast and mammals, δ aminolevulinic acid is formed from glycine and succinate . In this case, their porphyrins obtain only the amino group of glutamate, which is transferred to serine, the substrate of glycine synthesis . Owing to their extended π-electron systems, porphyrins absorb visible light, predominantly in the longer wavelength regions. The biological functions of many porphyrins, for example chlorophylls in photosynthetic organisms, are directly related to light absorption; other porphyrins such as heme are involved in electron transport or redox reactions. Light absorbed by free porphyrins, leading to photosensitization, can have damaging effects by generating reactive oxygen species. Thus, the synthesis of porphyrins by light-exposed cells must be tightly regulated. Not only photosynthetic organisms but also other organisms that are exposed to sunlight might benefit from light regulation of porphyrin synthesis. The histidine kinase activity of bacterial phytochromes depends on light and the presence of the bilin chromophore. Therefore, phytochromes may also be regarded as sensors for the end product of porphyrin biosynthesis. It therefore seems plausible that phytochromes might have evolved as regulators of porphyrin synthesis.
A possible connection between phytochrome and methionine synthase is less evident. A literature survey gave no indication of phytochrome-mediated regulation of methionine synthase expression. If glutamate synthase catalyses an early step in the amino acid metabolism network, methionine synthase catalyses a late step [46, 47]. Methionine, like glutamate, serves as substrate for other enzymatic reactions besides protein synthesis: the activated form, S-adenosylmethionine, is used for methylation reactions including DNA  and protein methylation, and for the synthesis of the gaseous hormone ethylene in land plants . Methionine synthase is also important for regenerating methionine from S-adenosyl-homocysteine, the breakdown product of the methylation reaction. It could be that DNA methylation protects DNA from UV damage  and that the turnover of methionine is therefore higher in the light than in the dark. In this way, phytochrome could have come into play.
Although such scenarios on co-evolution are speculative, the present co-distribution data might help to gain a better understanding of phytochrome function in bacteria. The co-distribution lists contain other proteins besides those discussed in this article that might share common functions with phytochromes. In combination with genome, proteome and mutant studies, the method presented here can give clues to the evolution of signal transduction, metabolism and other cellular functions. In the present approach, only one digit was used to express protein homology (homologous or not homologous). This decision was based on the E-values of the BLAST analyses. Co-distribution analyses based on graded information about protein homology might give even more precise results. In addition, information about the length and position of homologous sequences could be included. BLAST is a heuristic algorithm, designed for fast database searches. If the number of protein sequences is not too great, accurate methods for sequence comparison can also be chosen . The time required for the global BLAST analyses was approximately 2 weeks on a standard desktop computer with 2 GByte RAM. With faster machines, is seems realistic to test each possible combination of two protein sequences in a database containing 400000 sequences by non-heuristic alignment algorithms.
The co-distribution analysis has allowed a deeper insight to be gained into possible evolutionary relationships. Controls with D1 as query protein show that the method identifies other proteins with related functions. The present studies have allowed the possible relationships between bacterial phytochromes and other specific proteins, such as response regulators and histidine kinases, to be tested. With glutamate synthase, a protein was identified that might be evolutionarily linked to Agrobacterium phytochrome Agp1 and phytochromes of other species. The method presented thus helps to guide the design of molecular studies.
Annotated 389373 protein sequences from 138 bacteria with sequenced genomes were downloaded from the FASTA databases given under the NCBI web site . A list of all species and their NCBI tax-id is given in Additional file 21. The header of each protein sequence has several fields in which the gi number, the protein function and the species name are given. Using the Linux "sed" program, these headers were modified so that the species TaxID  was included in each header. These modified FASTA files were concatenated and converted in one BLAST database using the formatdb program (see  for documentation of BLAST). Then two global BLAST analyses were performed on the local computer; the protein sequences from each modified FASTA file were used as inputs. In the first global BLAST analysis, the E-value was set to 0.000001; in the second analysis the default E-value 10 was used. The tabulated output data of the global analyses (using the BLAST-m 8 parameter) were stored in files that were named according to the input files. In this way, the results of the global BLAST analyses could be used for species-wide comparisons. In the next step, superfluous information was deleted using a PERL script. The stripped files contain the gi number and TaxID of the query proteins, and for each query protein a list of hits with the gi number, TaxID and bitscore value of each hit. The bitscore information was not used during subsequent analyses but might help for later refinements.
For co-distribution analyses, the name and gi number of the query protein (most often a phytochrome) and its hit-list were stored in a separate file, which was used as input for comparison with all other proteins of the same species. For these comparisons, a PERL script was written. This script lists the gi numbers of the target proteins, the number of species in which homologs were found and the number of hit species that were identical with the query protein. Column "e" of the list indicates whether the target protein is a "direct homolog" of the query protein. Another PERL script was used that adds the annotated protein name of each protein according to its gi number.
The probability p of the co-distribution found was calculated by the formula
where k is the number of species in which homologs of the query protein were found, l is the number of species with homologs of the target protein, m is the number of hit species with homologs to either protein, and n is the total number of species (= 138). If a random sample from a population of n members is taken l times without replacement, the probability that m samples belong to a sub-set that has k members is p.
Finally, the results were sorted according to this probability. The parameter abbreviations k, l, m, n and p are also used in the co-distribution tables, which are given as additional information. Since the query protein is also addressed as target protein and has the lowest p value, it always appears at the top of the list – unless there are other proteins that are found in exactly the same species; in this case the query protein is placed manually at the top of the list.
Co-distribution tables sorted according to the co-distribution probability p.
- Agp1 and 2:
phytochromes from Agrobacterium tumefaciens
cyanobacterial phytochrome 1
response regulator of Agrobacterium phytochrome
Briggs WR, Spudich JL: Handbook of Photosensory Receptors. Weinheim, Germany: Wiley Verlag; 2005.
Kendrick RE, Kronenberg GHM: Photomorphogenesis in Plants. 2nd edition. Edited by: Kendrick RE, Kronenberg GHM. Dordrecht/Boston/London: Kluwer Academic Publishers; 1994.
Hughes J, Lamparter T, Mittmann F, Hartmann E, Gärtner W, Wilde A, Börner T: A prokaryotic phytochrome. Nature 1997, 386: 663. 10.1038/386663a0
Yeh KC, Wu SH, Murphy JT, Lagarias JC: A cyanobacterial phytochrome two-component light sensory system. Science 1997, 277: 1505–1508. 10.1126/science.277.5331.1505
Davis SJ, Vener AV, Vierstra RD: Bacteriophytochromes: Phytochrome-like photoreceptors from nonphotosynthetic eubacteria. Science 1999, 286: 2517–2520. 10.1126/science.286.5449.2517
Lamparter T, Michael N, Mittmann F, Esteban B: Phytochrome from Agrobacterium tumefaciens has unusual spectral properties and reveals an N-terminal chromophore attachment site. Proc Natl Acad Sci USA 2002, 99: 11628–11633. 10.1073/pnas.152263999
Idnurm A, Heitman J: Light controls growth and development via a conserved pathway in the fungal kingdom. PLoS Biol 2005, 3: e95. 10.1371/journal.pbio.0030095
Lamparter T: Evolution of cyanobacterial and plant phytochromes. FEBS Lett 2004, 573: 1–5. 10.1016/j.febslet.2004.07.050
Kaneko T, Sato S, Kotani H, Tanaka A, Asamizu E, Nakamura Y, Miyajima N, Hirosawa M, Sugiura M, Sasamoto S, Kimura T, Hosouchi T, Matsuno A, Muraki A, Nakazaki N, Naruo K, Okumura S, Shimpo S, Takeuchi C, Wada T, Watanabe A, Yamada M, Yasuda M, Taba S: Sequence analysis of the genome of the unicellular cyanobacterium Synechocystis sp. strain PCC6803. II. Sequence determination of the entire genome and assignment of potential protein-coding regions. DNA Res 1996, 3: 109–136. 10.1093/dnares/3.3.109
Yoshihara S, Ikeuchi M: Phototactic motility in the unicellular cyanobacterium Synechocystis sp. PCC 6803. Photochem Photobiol Sci 2004, 3: 512–518. 10.1039/b402320j
Schmitz O, Katayama M, Williams SB, Kondo T, Golden SS: CikA, a bacteriophytochrome that resets the cyanobacterial circadian clock. Science 2000, 289: 765–768. 10.1126/science.289.5480.765
Kehoe DM, Grossman R: Similarity of a chromatic adaptation sensor to phytochrome and ethylene receptors. Science 1996, 273: 1409–1412.
Wilde A, Churin Y, Schubert H, Börner T: Disruption of a Synechocystis sp. PCC 6803 gene with partial similarity to phytochrome genes alters growth under changing light qualities. FEBS Lett 1997, 406: 89–92. 10.1016/S0014-5793(97)00248-2
Fiedler B, Broc D, Schubert H, Rediger A, Börner T, Wilde A: Involvement of cyanobacterial phytochromes in growth under different light qualities and quantities. Photochem Photobiol 2004, 79: 551–555. 10.1562/RN-013R.1
Hübschmann T, Yamamoto H, Gieler T, Murata N, Börner T: Red and far-red light alter the transcript profile in the cyanobacterium Synechocystis sp. PCC 6803: impact of cyanobacterial phytochromes. FEBS Lett 2005, 579: 1613–1618. 10.1016/j.febslet.2005.01.075
Giraud E, Fardoux J, Fourrier N, Hannibal L, Genty B, Bouyer P, Dreyfus B, Vermeglio A: Bacteriophytochrome controls photosystem synthesis in anoxygenic bacteria. Nature 2002, 417: 202–205. 10.1038/417202a
Giraud E, Vuillet L, Hannibal L, Fardoux J, Zappa S, Adriano JM, Berthomieu C, Bouyer P, Pignol D, Verméglio A: A new type of bacteriophytochrome acts in tandem with a classical bacteriophytochrome to control the antennae synthesis in Rhodopseudomonas palustris . J Biol Chem 2005, 280: 32389–32397. 10.1074/jbc.M506890200
Tasler R, Moises T, Frankenberg-Dinkel N: Biochemical and spectroscopic characterization of the bacterial phytochrome of Pseudomonas aeruginosa . FEBS J 2005, 272: 1927–1936. 10.1111/j.1742-4658.2005.04623.x
Gelvin SB: Agrobacterium -mediated plant transformation: The biology behind the "gene-jockeying" tool. Microbiol Mol Biol Rev 2003, 67: 16–37. 10.1128/MMBR.67.1.16-37.2003
Goodner B, Hinkle G, Gattung S, Miller N, Blanchard M, Qurollo B, Goldman BS, Cao Y, Askenazi M, Halling C, Mullin L, Houmiel K, Gordon J, Vaudin M, Iartchouk O, Epp A, Liu F, Wollam C, Allinger M, Doughty D, Scott C, Lappas C, Markelz B, Flanagan C, Crowell C, Gurson J, Lomo C, Sear C, Strub G, Cielo C, Slater S: Genome sequence of the plant pathogen and biotechnology agent Agrobacterium tumefaciens C58. Science 2001, 294: 2323–2328. 10.1126/science.1066803
Wood DW, Setubal JC, Kaul R, Monks DE, Kitajima JP, Okura VK, Zhou Y, Chen L, Wood GE, Almeida NFJ, Woo L, Chen Y, Paulsen IT, Eisen JA, Karp PD, Bovee DS, Chapman P, Clendenning J, Deatherage G, Gillet W, Grant C, Kutyavin T, Levy R, Li MJ, McClelland E, Palmieri A, Raymond C, Rouse G, Saenphimmachak C, Wu Z, Romero P, Gordon D, Zhang S, Yoo H, Tao Y, Biddle P, Jung M, Krespan W, Perry M, Gordon-Kamm B, Liao L, Kim S, Hendrick C, Zhao ZY, Dolan M, Chumley F, Tingey SV, Tomb JF, Gordon MP, Olson MV, Nester EW: The genome of the natural genetic engineer Agrobacterium tumefaciens C58. Science 2001, 294: 2317–2323. 10.1126/science.1066804
Karniol B, Vierstra RD: The HWE histidine kinases, a new family of bacterial two-component sensor kinases with potentially diverse roles in environmental signaling. J Bacteriol 2004, 186: 445–453. 10.1128/JB.186.2.445-453.2004
Karniol B, Vierstra RD: The pair of bacteriophytochromes from Agrobacterium tumefaciens are histidine kinases with opposing photobiological properties. Proc Natl Acad Sci USA 2003, 100: 2807–2812. 10.1073/pnas.0437914100
Borucki B, von Stetten D, Seibeck S, Lamparter T, Michael N, Mroginski MA, Otto H, Murgida DH, Heyn MP, Hildebrandt P: Light-induced proton release of phytochrome is coupled to the transient deprotonation of the tetrapyrrole chromophore. J Biol Chem 2005, 280: 34358–34364. 10.1074/jbc.M505493200
Inomata K, Hammam MAS, Kinoshita H, Murata Y, Khawn H, Noack S, Michael N, Lamparter T: Sterically Locked Synthetic Bilin Derivatives and Phytochrome Agp1 from Agrobacterium tumefaciens Form Photoinsensitive Pr- and Pfr-like Adducts. J Biol Chem 2005, 280: 24491–24497. 10.1074/jbc.M504710200
Lamparter T, Michael N: Agrobacterium Phytochrome as an Enzyme for the Production of ZZE Bilins. Biochemistry 2005, 44: 8461–8469. 10.1021/bi047510g
Lamparter T, Michael N, Caspani O, Miyata T, Shirai K, Inomata K: Biliverdin binds covalently to Agrobacterium phytochrome Agp1 via its ring A vinyl side chain. J Biol Chem 2003, 278: 33786–33792. 10.1074/jbc.M305563200
Lamparter T, Carrascal M, Michael N, Martinez E, Rottwinkel G, Abian J: The biliverdin chromophore binds covalently to a conserved cysteine residue in the N-terminus of Agrobacterium phytochrome Agp1. Biochemistry 2004, 43: 3659–3669. 10.1021/bi035693l
Oberpichler I, Molina I, Neubauer O, Lamparter T: Phytochromes from Agrobacterium tumefaciens: difference spectroscopy with extracts of wild type and knockout mutants. FEBS Lett 2005, in press.
Altschul SF, Gish W, Miller W, Myers EW, Lipman DJ: Basic local alignment search tool. J Mol Biol 1990, 215: 403–410. 10.1006/jmbi.1990.9999
Kleine T, Lockhart P, Batschauer A: An Arabidopsis protein closely related to Synechocystis cryptochrome is targeted to organelles. Plant J 2003, 35: 93–103. 10.1046/j.1365-313X.2003.01787.x
Ahmad M, Cashmore AR: The blue-light receptor cryptochrome 1 shows functional dependence on phytochrome A or phytochrome B in Arabidopsis thaliana. Plant J 1997, 11: 421–427. 10.1046/j.1365-313X.1997.11030421.x
Ahmad M, Jarillo JA, Smirnova O, Cashmore AR: The CRY1 blue light photoreceptor of Arabidopsis interacts with phytochrome A in vitro. Mol Cell 1998, 1: 939–948. 10.1016/S1097-2765(00)80094-5
Bhoo SH, Davis SJ, Walker J, Karniol B, Vierstra RD: Bacteriophytochromes are photochromic histidine kinases using a biliverdin chromophore. Nature 2001, 414: 776–779. 10.1038/414776a
Terry MJ, Linley PJ, Kohchi T: Making light of it: the role of plant haem oxygenases in phytochrome chromophore synthesis. Biochem Soc Trans 2002, 30: 604–609. 10.1042/BST0300604.
Brücker G, Mittmann F, Hartmann E, Lamparter T: Targeted site-directed mutagenesis of a heme oxygenase locus by gene replacement in the moss Ceratodon purpureus. Planta 2005, 220: 864–874. 10.1007/s00425-004-1411-6
Stock AM, Robinson VL, Goudreau PN: Two-component signal transduction. Annu Rev Biochem 2000, 69: 183–215. 10.1146/annurev.biochem.69.1.183
Sharrock RA, Clack T: Heterodimerization of type II phytochromes in Arabidopsis. Proc Natl Acad Sci USA 2004, 101: 11500–11505. 10.1073/pnas.0404286101
Appenroth KJ, Oelmueller R: Regulation of transcript level and nitrite reductase activity by phytochrome and nitrate in turions of Spirodela polyrhiza. Physiol Plant 1995, 93: 272–278. 10.1034/j.1399-3054.1995.930210.x
Lam HM, Coschigano KT, Oliveira IC, Melo-Oliveira R, Coruzzi GM: The molecular-genetics of nitrogen assimilation into amino acids in higher plants. Annual Review of Plant Physiology 1996, 47: 569–593. 10.1146/annurev.arplant.47.1.569
Teller S, Schmidt KH, Appenroth KJ: Ferredoxin-dependent but not NADH-dependent glutamate synthase is regulated by photochrome and a blue/UV-A light receptor in turions of Spirodela polyrhiza . Plant Physiology and Biochemistry 1996, 34: 713–719.
Lillo C, Provan F, Appenroth KJ: Photoreceptors involved in the regulation of nitrate reductase in Spirodela polyrhiza . Journal of Photochemistry and Photobiology B Biology 1998, 44: 205–210. 10.1016/S1011-1344(98)00144-4
Suzuki A, Rioual S, Lemarchand S, Godfroy N, Roux Y, Boutin JP, Rothstein S: Regulation by light and metabolites of ferredoxin-dependent glutamate synthase in maize. Physiol Plant 2001, 112: 524–530. 10.1034/j.1399-3054.2001.1120409.x
Muro-Pastor MI, Reyes JC, Florencio FJ: Ammonium assimilation in cyanobacteria. Photosynthesis Research 2005, 83: 135–150. 10.1007/s11120-004-2082-7
Jahn D, Hungerer C, Troup B: Unusual pathways and environmentally regulated genes in bacterial heme biosynthesis [Ungewöhnliche Wege und umweltregulierte Gene der bakteriellen Hämbiosynthese]. Naturwissenschaften 1996, 83: 389–400.
Umbarger HE: Amino acid biosynthesis and its regulation. Annual Review of Biochemistry 1978, 47: 533–606. 10.1146/annurev.bi.47.070178.002533
Azevedo RA, Lea PJ: Lysine metabolism in higher plants. Amino Acids 2001, 20: 261–279. 10.1007/s007260170043
Jeltsch A: Beyond Watson and Crick: DNA methylation and molecular enzymology of DNA methyltransferases. Chembiochem 2002, 3: 275–293.
Buchanan BB, Gruissem W, Jones RL: Biochemistry and molecular biology of plants. West Sussex, UK: John Wiley and Sons; 2002.
Tuteja N, Tuteja R, Singh MB, Bhalla PL, Misra MK: Molecular mechanisms of DNA damage and repair: Progress in plants. Critical Reviews in Biochemistry and Molecular Biology 2001, 36: 337–397. 10.1080/20014091074219
Smith TF, Waterman MS: Identification of common molecular subsequences. J Mol Biol 1981, 147: 195–197. 10.1016/0022-2836(81)90087-5
I thank the NCBI help desk for valuable information on BLAST formats as well as Thomas Börner and Thomas Hübschmann for information on Synechcoystis RNA profiling. This work was financially supported by the Deutsche Forschungsgemeinschaft (DFG) through Sfb 498, TP B2 and La 799/7-2.
TL developed the method, loaded and modified the protein database files, performed local BLAST runs, wrote the PERL scripts, analysed the results and wrote the manuscript.
Electronic supplementary material
Additional File 1: Sorted list of co-distribution results based on a low E-value global BLAST analysis, A. tumefaciens phytochrome Agp1 as query and all A. tumefaciens proteins as target. (XLS 712 KB)
Additional File 2: Sorted list of co-distribution results based on a high E-value global BLAST analysis, A. tumefaciens phytochrome Agp1 as query and all A. tumefaciens proteins as target. (XLS 714 KB)
Additional File 3: Sorted list of co-distribution results based on a low E-value global BLAST analysis, A. tumefaciens phytochrome Agp2 as query and all A. tumefaciens proteins as target. (XLS 715 KB)
Additional File 4: Sorted list of co-distribution results based on a high E-value global BLAST analysis, A. tumefaciens phytochrome Agp2 as query and all A. tumefaciens proteins as target. (XLS 714 KB)
Additional File 5: Sorted list of co-distribution results based on a low E-value global BLAST analysis, A. tumefaciens response regulator AgR as query and all A. tumefaciens proteins as target. (XLS 710 KB)
Additional File 6: Sorted list of co-distribution results based on a high E-value global BLAST analysis, A. tumefaciens response regulator AgR as query and all A. tumefaciens proteins as target. (XLS 716 KB)
Additional File 7: Sorted list of co-distribution results based on a low E-value global BLAST analysis, A. tumefaciens response regulator ExsF as query and all A. tumefaciens proteins as target. (XLS 715 KB)
Additional File 8: Sorted list of co-distribution results based on a high E-value global BLAST analysis, A. tumefaciens response regulator ExsF as query and all A. tumefaciens proteins as target. (XLS 716 KB)
Additional File 9: Sorted list of co-distribution results based on a low E-value global BLAST analysis, A. tumefaciens histidine kinase ExsG as query and all A. tumefaciens proteins as target. (XLS 718 KB)
Additional File 10: Sorted list of co-distribution results based on a high E-value global BLAST analysis, A. tumefaciens histidine kinase ExsG as query and all A. tumefaciens proteins as target. (XLS 718 KB)
Additional File 11: Sorted list of co-distribution results based on a low E-value global BLAST analysis, D. radiodurans phytochrome BphP as query and all D. radiodurans proteins as target. (XLS 460 KB)
Additional File 12: Sorted list of co-distribution results based on a high E-value global BLAST analysis, D. radiodurans phytochrome BphP as query and all D. radiodurans proteins as target. (XLS 460 KB)
Additional File 13: Sorted list of co-distribution results based on a low E-value global BLAST analysis, P. aeruginosa phytochrome BphP as query and all P. aeruginosa proteins as target. (XLS 754 KB)
Additional File 14: Sorted list of co-distribution results based on a high E-value global BLAST analysis, P. aeruginosa phytochrome BphP as query and all P. aeruginosa proteins as target. (XLS 756 KB)
Additional File 15: Sorted list of co-distribution results based on a low E-value global BLAST analysis, Synechocystis PCC 6803 phytochrome Cph1 as query and all Synechocystis proteins as target. (XLS 436 KB)
Additional File 16: Sorted list of co-distribution results based on a high E-value global BLAST analysis, Synechocystis PCC 6803 phytochrome Cph1 as query and all Synechocystis proteins as target. (XLS 438 KB)
Additional File 17: Sorted list of co-distribution results based on a low E-value global BLAST analysis, the Synechocystis PCC 6803 photosynthesis protein D1 as query and all Synechocystis proteins as target. (XLS 436 KB)
Additional File 18: Sorted list of co-distribution results based on a high E-value global BLAST analysis, the Synechocystis PCC 6803 photosynthesis protein D1 as query and all Synechocystis proteins as target. (XLS 438 KB)
Additional File 19: Sorted list of co-distribution results based on a low E-value global BLAST analysis, Synechocystis PCC 6803 response regulator Rcp1 as query and all Synechocystis proteins as target. (XLS 436 KB)
Additional File 20: Sorted list of co-distribution results based on a high E-value global BLAST analysis, Synechocystis PCC 6803 response regulator Rcp1 as query and all Synechocystis proteins as target. (XLS 438 KB)
About this article
Cite this article
Lamparter, T. A computational approach to discovering the functions of bacterial phytochromes by analysis of homolog distributions. BMC Bioinformatics 7, 141 (2006). https://doi.org/10.1186/1471-2105-7-141
- Response Regulator
- Histidine Kinase
- Query Protein
- Homolog Distribution