Skip to main content

Data recovery and integration from public databases uncovers transformation-specific transcriptional downregulation of cAMP-PKA pathway-encoding genes



The integration of data from multiple genome-wide assays is essential for understanding dynamic spatio-temporal interactions within cells. Such integration, which leads to a more complete view of cellular processes, offers the opportunity to rationalize better the high amount of "omics" data freely available in several public databases.

In particular, integration of microarray-derived transcriptome data with other high-throughput analyses (genomic and mutational analysis, promoter analysis) may allow us to unravel transcriptional regulatory networks under a variety of physio-pathological situations, such as the alteration in the cross-talk between signal transduction pathways in transformed cells.


Here we sequentially apply web-based and statistical tools to a case study: the role of oncogenic activation of different signal transduction pathways in the transcriptional regulation of genes encoding proteins involved in the cAMP-PKA pathway. To this end, we first re-analyzed available genome-wide expression data for genes encoding proteins of the downstream branch of the PKA pathway in normal tissues and human tumor cell lines. Then, in order to identify mutation-dependent transcriptional signatures, we classified cancer cells as a function of their mutational state. The results of such procedure were used as a starting point to analyze the structure of PKA pathway-encoding genes promoters, leading to identification of specific combinations of transcription factor binding sites, which are neatly consistent with available experimental data and help to clarify the relation between gene expression, transcriptional factors and oncogenes in our case study.


Genome-wide, large-scale "omics" experimental technologies give different, complementary perspectives on the structure and regulatory properties of complex systems. Even the relatively simple, integrated workflow presented here offers opportunities not only for filtering data noise intrinsic in high throughput data, but also to progressively extract novel information that would have remained hidden otherwise. In fact we have been able to detect a strong transcriptional repression of genes encoding proteins of cAMP/PKA pathway in cancer cells of different genetic origins. The basic workflow presented herein may be easily extended by incorporating other tools and can be applied even by researchers with poor bioinformatics skills.


Integration achieves one of the most important imperatives of systems biology, namely it reduces the dimensionality of global data needed to deliver useful information about the networks active in the system of interest. The integration of data from different sources provides an effective means to deal with this issue by reinforcing bona fide observations and reducing false negatives. Moreover, because different experimental technologies provide different insights into a system, the integration of multiple data types offers the greatest information about a particular cellular process [13]. For example, gene perturbation experiments (e.g., knockouts or RNA interference) and microarrays analysis can reveal relationships between genes that may imply direct physical interactions or indirect logical interactions. Indeed, microarray experiments permit us to look at overall patterns of gene expression in order to understand the architecture of genetic regulatory networks, a global approach that could ultimately lead to complete description of the transcription-control mechanisms in a cell. In contrast, chromatin immunoprecipitation (ChIp) data can reveal direct protein-DNA interactions or cofactor associations with bound transcription factors. Combined together, these technologies can provide a much more detailed view of a transcriptional regulatory network than either alone.

Several recent methods have addressed the problem of heterogeneous data integration and network prediction by modeling the noise inherent in high-throughput genomic datasets, especially by using statistical methods, which can significantly improve specificity and sensitivity and allow the robust integration of datasets with heterogeneous properties [4, 5]. However, many of these methods recently developed to implement our ability to integrate and compare heterogeneous data, are often not easy to use and/or not freely accessible [6, 7]. Taking into consideration that the development of efficient methods that facilitate the biological interpretation of these data is crucial, in the present work we focus on efficient identification of regulatory mechanisms, and propose an approach for analysis and interpretation of gene expression data based on the integration of various types of related biological information.

The cAMP-PKA signalling pathway is an important regulator of cell fate that controls the activity of metabolic enzymes, transcription factors and cytoskeletal proteins and is strongly associated with the onset of several endocrine and non-endocrine tumors. A fundamental characteristic of cAMP is its ability to stimulate cell proliferation in many cell types while inhibiting in others. Such ability has been related to the fact that cAMP regulates the Ras/Raf/ERK pathway, whose role in cancer onset is well known (about 25% of human cancers have a Ras mutation). Indeed the cAMP pathway is able to suppress ERK signaling through its ability to target C-Raf and conversely, to activate ERK signaling through its ability to target B-Raf [8]. The underlying inhibitory mechanism is reasonably well characterized and involves the uncoupling of Ras signaling to C-Raf. On the other hand, models to explain ERK activation by cAMP are incomplete and in addition to B-Raf the involvement of other proteins has been suggested [9, 10]. Many observations regarding the cAMP ability to inhibit/stimulate proliferation by interfering with ERKs have been collected in normal or immortalized cell lines. However, recognizing the important role of both pathways in the development of cancer, is relevant to a more specific analysis of their crosstalk network also in cancer cells [1114].

As previously described, the Ras pathway is able to crosstalk with the cAMP-PKA pathway by some typical signal transduction mechanisms (i.e. protein-protein interaction, protein phosphorylation). Moreover, through its ability to regulate the activity of a large number of transcription factors [15, 16], the Ras pathway is able to control several transcriptional programs leading to proliferation, differentiation, metabolism, cytoskeletal reorganization and immune response. Such transcriptional programs are the result of ras-specific effectors stimulation [13, 17]. Until now more than ten distinct functional classes of proteins have been involved as effectors of the small GTPase Ras, but the best studied are Raf kinases, type I phosphoinositide (PI) 3-kinases, Ral-guanine nucleotide exchange factors (Ral-GEFs), the Rac exchange factor Tiam1, and phospholipase C [18].

Raf and phosphatidylinositol 3-kinase (PI3K) were the first two identified Ras effectors and the main focus of research investigating Ras functions [19, 20]. Raf promotes cell proliferation and differentiation through the MAP kinase (MAPK) pathway [21], at the same time as PI3K generates anti-apoptotic signalling, directly or through Akt pathway activation [22, 23]. Both signalling pathways can activate two different signals distinct for their response timing. Indeed both MAPK and PI3K are able to activate phosphorylation cascades that lead, as primary effect, to post-translational modification of several substrates (membrane targets, cytosolic targets, cytoskeletal targets and nuclear targets), which rapidly activate functional processes. Early response to Ras signalling is quite fast: for instance in resting cells stimulated with mitogens, Ras-GTP level increases within 2 minutes from stimulation with serum [19]. Raf-1 undergoes transient activation within 2-3 minutes, and rapidly activates the mitogen-activated protein kinase (MAPK) cascade whose most downstream component, ERK, rapidly moves into the nucleus. Here it phosphorylates nuclear proteins, notably transcription factors [24, 25] whose activity can be controlled by regulating their sub-cellular localization, expression, stability, ability to bind to other components of transcriptional complexes and to DNA, and their ability to remodel chromatin structure [26]. Transcription factors are under the control of MAPK pathway, including members of the ETS family (i.e. Ets-1, Ets-2, PU-1), MADS box family (i.e. MEF2A, MEF2C, Sp1), Zinc Finger family (i.e. GATA-2 and GATA-4), bZip family (i.e. Fra-1, c-Jun, JunB, JunD, ATF-2, c-Fos and CREB), bHLH family (i.e. c-Myc, MITF), Nuclear Hormone Receptor (i.e. PR, GR and ER) as well as other transcription factors (i.e., SMAD1, STAT1) and coregulatory proteins (i.e., CBP, p300) [15, 24, 25].

Like ERKs, Akt and other targets of PI3K signalling can phosphorylate and activate transcription factors [27]. Akt protein can control several transcription factors directly or indirectly. Direct targets are the forkhead box proteins, FOXO, and the cell cycle inhibitor, MIZ1, which are both inhibited upon AKT-mediated phosphorylation. AKT-dependent regulation of p53, nuclear factor B (NFkB), c-MYC, activator protein 1 (AP1) and beta-catenin is indirect [16].

Such an observation led us to re-analyze, by using a generalized workflow for data recovery and integration, available data from multiple global assays and several databases (genomics, transcriptomics, promoter analysis and literature). In particular we searched for information for genes encoding proteins of the downstream branch of the PKA pathway (starting from adenylyl cyclase and downstream) in tumor cell lines (NCI60 cells) as a function of mutational activation of different pathways (notably the Ras and PI3K pathway) in comparison with the corresponding normal tissues, with the aim to define better the connection between these pathways in cancer cells [28].

Results and discussion

Gene-expression profiling has been applied extensively in cancer research. As a first step to identify regulatory mechanisms underlining gene-expression profiles it is necessary to extract, filter, cross-reference and structure information from cancer-related data sets [29]. The aim of this work has been the identification of cancer-specific specific gene expression signatures in genes encoding proteins involved in the cAMP-PKA pathway. In particular we wished to identify, if present, differences between primary normal tissues and cancer cells and search for correlation with the pathway mutationally activated in any given transformed cell line by integrating an accurate analysis of recovered data from several databases with the application of different statistical tests.

Transformation-dependent, transcriptional remodelling of the PKA pathway encoding genes in 60 human cancer cell lines (NCI60)

The NCI60 cell collection includes cell lines derived from colorectal, renal, ovarian, breast, prostate, lung and central nervous system cancers, as well as leukaemias and melanomas (Table 1), that are most commonly used in cancer research and drug screening [30, 31]. A good correlation between transcriptional profiles of the cell lines and their tumor cancer of origin [31, 32] has been found for 51 out of 59 cell lines. NCI60 transcriptional profiles are available in public databases.

Table 1 The cancer cell lines in the NCI60 collection sorted by tissue of origin

Since the stabilized cell lines within the NCI60 collection represent a physiological model to study gene profiles in cancer cells, with features strongly similar to cancer tissues, we reviewed information present in public databases about the 60 cell lines and 21 normal tissues, in order to identify transformation-dependent transcriptional signatures for PKA pathway-encoding genes (Table 2).

Table 2 Gene expression profiling datasets of NCI60 cell lines and normal tissues analyzed in this study

We identified and gathered the transcriptional profile for 41 genes encoding proteins involved in the PKA pathway (adenylyl cyclases -ADCY-, phosphodiesterases -PDE-, A-kinase anchor proteins -AKAP-, cAMP-dependent transcriptional factors -TF-, PKA catalytic subunits -PRKAC- and PKA regulatory subunits -PRKACR-, Table 3) and compared expression profiles of cancer cell lines with those of primary normal tissues, collected from different datasets (Table 3). To identify differences between normal and cancer samples, we performed an ANOVA analysis on the entire data set. As shown in Figure 1, distributions of expression values of genes encoding proteins of the cAMP/PKA pathway were statistically different between normal and transformed cells (p-value <0.0001), indicating that in transformed cells the PKA pathway-related genes are differentially expressed as compared to normal cells. Namely, the box plot indicates that, overall, the distribution of expression of values of transformed cells is shifted towards lower expression values. Dispersion of the distribution in transformed cells is much reduced compared to that observed in normal tissues, as if transformation events superimpose a negative regulation that largely abrogates tissue-specific regulation (i.e., the major factor responsible for dispersion of expression in normal tissues, see next paragraph).

Table 3 PKA related genes identified in all the datasets shown in Table 2 and used in this study
Figure 1
figure 1

Statistical analysis of the 41 PKA pathway-encoding genes expression in normal and transformed samples. 81 transcriptional profiles from normal tissues and from the NCI60 cancer cell line collection, were recovered from the GEO database. After normalization (see Methods), the expression values of 41 PKA pathway-encoding genes were used to perform an ANOVA analysis (p-value 0.0001) to evaluate the statistical significance of the differences between normal and transformed samples. IQR: Interquartile Range. Outliers are also shown.

The same data-set was then analyzed through unsupervised hierarchical clustering (as implemented in the GeneSpring platform) that organizes genes according to the similarity or dissimilarity in expression profile, placing the cases with similar expression profiles together as neighbouring columns in the dendrogram (Figure 2). Six different classes corresponding to the main arms of the dendrogram derived from clustering according to Tissue and cell lines (classes I to III correspond to the left main branch of dendrogram, IV to VI to the right branch) were identified. Each cell line is color-coded at the bottom according to its condition (i.e., normal, blue, or transformed, red) or the tissue of origin. Notably, classes II and V contain only transformed cells, while only one transformed cell line clusters in class VI. In most cases clustering effectively separates normal and transformed cell lines of the same histological origin: for instance, normal and transformed cell lines derived from kidney cluster to class I and III, hemopoietic normal and transformed cell lines to IV and II, colon cancer cells are in class II while normal colon in class IV, respectively (Table 4). Class I and IV contained cancer lines of several histological origin, while class II was enriched for cancer cells from colon and blood, class III for ovary and kidney and class V for lung, respectively (Table 4). These results indicate that regulation of the PKA pathway is tissue-dependent, in keeping with the pleiotropic and tissue-specific phenotypes regulated by intracellular cAMP. They also suggest that transformation transcriptionally remodulates the PKA pathway, so that in most cases expression profiling of genes encoding proteins of the cAMP-PKA pathway is quite different in cancer cells as compared to their normal counterparts. Interestingly in class IV, which comprises all the colon and hemopoietic normal samples, we observe strong expression of few genes (AKAP9-11; PDE4D; PRKCB and PRKAR2B; CREB1- colon sample- and AKAP9-11; PDE4B and PDE8A; PRKCB and PRKAR1A; CREB1 and CREM- hemopoietic sample-) as compared to their transformed counterparts, in which the same genes appeared poorly expressed (class II). In human colon carcinoma cells it has been reported that PRKAR2B overexpression suppresses neoplastic cell growth [33], consistently with the notion that abnormal expression of isoforms of PKA regulatory subunits may be involved in neoplastic transformation. Moreover in several models of hemopoietic malignancies, it has been shown that induction of cAMP/PKA pathway stimulates leukemia cell differentiation (event associated to the relapse of the disease) or lymphoma cells apoptosis [34, 35].

Figure 2
figure 2

Hierarchical clustering of the 41 PKA pathway-encoding genes analyzed in this paper. Two-way (gene, column and cell line, row) hierarchical clustering (see Methods) of the same profiles analyzed in Figure 1. Normalized expression is colour-coded from green (poor expression) to red (strong expression). The name of each gene is colour-coded according to family to which it belongs.

The 6 main classes described in the text (red lines on the top of the dendrogram and roman number bottom of the dendrogram) are shown. The distance function is based on Pearson correlation and complete linkage clustering. Legends for expression, condition, gene family and tissue of origin are shown on the right of the dendrogram.

Table 4 Correlation between PKA related gene patterns and tissues

Analysis of mutational status of the NCI60 cell lines and correlation with tissue-specific PKA pathway gene regulation

In the previous paragraph we have shown that a different and a tissue-specific pattern of expression of the PKA pathway encoding genes between normal and transformed samples does exist. Moreover, we observed that a similar pattern is common to different tissues, both in normal and transformed samples. While in normal tissues such a finding may be justified by a common histological origin or by the PKA pathway regulating a common intracellular process (i.e. differentiation, metabolism), in transformed samples, in which the correct regulation of the PKA pathway is lost, such similar gene regulation can suggest a transformation or a mutation-dependent gene regulation.

For this aim, we determined the mutation status of the NCI-60 panel of human cancer cell lines, identified the pathway in which such mutations were involved and correlated the mutation status and pathway altered in the transformed cells with transcriptional profiling data. The 60 cell lines were sorted according to mutational status, using the information provided by Catalogue Of Somatic Mutations In Cancer (, and divided into four groups based on the carried mutation as follows (Table 5):

  1. 1)

    Cell lines carrying mutations able to interfere with the Ras pathway (i.e., mutations in genes encoding Ras, B-Raf, ERBB2, PDGFRA, referred to as Ras), 29 cell lines;

  2. 2)

    Cell lines carrying mutations able to interfere with PI3K-Akt pathway (i.e., mutations in genes encoding PI3KCA, PTEN and Lkb1, referred to as PI3K), 13 cell lines.

  3. 3)

    Cell lines carrying no somatic mutations interfering with the two above pathways (i.e., mutations in genes encoding CDKN2A, p53, referred to as Other Mutation), 14 cell lines;

  4. 4)

    Cell lines for which the presence of somatic mutations interfering with the above pathways has not been searched, referred to as Not Tested), 4 cell lines.

Table 5 NCI60 cell lines with predicted active pathways by mutational analysis

To assess overall data quality and visualize relations and differences between the aforementioned transformed and normal samples, we applied dimensional reduction through principal component analysis (PCA). A three-dimensional PCA plot of all expression data (accounting for 91% of variance) is shown in Figure 3A. PC1 (x axis) effectively separates the normal group from the four groups of transformed cells. PC2 (y axis) effectively separates the Ras group from the others, while PC3 (z axis) best separates the Other Mutation group from the others. Overall, the Ras group appeared to segregate the most from the other groups.

Figure 3
figure 3

Identification of differentially regulated genes in normal and transformed samples. (A) Samples were sorted in five groups according to mutational activation: green, normal; yellow, Ras; red, PI3K; blue, Other Mutation; cyan, Not Tested. Principal Component Analysis (PCA) performed on 41 PKA pathway-encoding genes for normal samples and the four classes of mutation-dependent samples. Each sphere represents the comparative averaging of the 41 genes for each pathway identified by mutational analysis. (B) For each of the 5 groups described in (A), the 41 PKA-encoding genes were clustered, relative to their level of expression, in three subgroups: Strong (>1, red), Average (=1, black) and Low (<1, green). (C) Gene list according to expression level and mutational group of the three subgroups previously indicated, divided for each sample. Color-coding is as follows: blue, common between normal and at least one transformed sample; yellow, specific for normal samples; grey, specific for transformed samples. Percentage of regulated genes for each subgroup is shown at the bottom. (D) ANOVA analysis to evaluate the statistical significance of the differences between the five classes of samples described in (A). The right inset shows p-value of the pair-wise comparisons. Statistically significant differences are indicated in red. IQR: Interquartile Range. Outliers are also shown.

In Figure 3B, the 41 genes encoding proteins involved in the PKA pathway were sorted according to their relative level of expression and color-coded in the graph according to expression: strong (red, value >1), average (black, value=1) and low (green, level <1). These three series were crossed with the groups described above, namely Normal, Ras, PI3K, Other Mutation and Not Tested. In Normal tissues, expression of 83% of the genes was classified as Strong, a value 2-3 fold higher than those observed in the different transformed groups (27-41%). Overall, in the transformed groups, expression of most PKA pathway-encoding genes was classified as Average or Low, with the exception of the Ras group, in which only one gene was scored as low.

Expression of PKA pathway-encoding genes was further classified as follows (Figure 3C): genes with similar level of expression between normal and at least one transformed group (blue color), genes whose expression level is different between the normal and transformed groups (yellow color) and genes with similar expression level among the different transformed groups (grey color). Such a classification allowed us to pinpoint genes, such as ADCY2 and AKAP13 whose expression is strong only in the Normal group. More interestingly, expression of a few genes, such as ADCY3 and AKAP8 was strong only in members of the transformed groups, despite overall reduction in expression of the PKA pathway-encoding genes observed in transformed samples.

These results were further confirmed by pair-wise ANOVA analyses (Figure 3D), in which the distribution of expression values of genes encoding proteins of the cAMP/PKA pathway were found to be statistically different between normal and each group of transformed cells (p-value between 0.0001 and 0.0003). Notably, the difference in distribution between the Ras group and the PI3K and Other Mutation groups was also statistically significant, unlike the difference with the Not Tested group. This suggests that cells in this group may be biased for mutations within genes encoding proteins of the Ras pathway.

To reveal gene expression changes relate to mutation status of the 60 cell lines, and better interpret the results of PCA and ANOVA, a hierarchical clustering was performed. The resulting dendrogram is shown in Figure 4, in which each cell line is color-coded at the bottom according to its tissue of origin -row labeled tissue-, mutated gene -row labeled mutation-, inferred pathway activated by mutation -row labeled pathway-. A robust association between the transcriptional profiles and mutations in the Ras pathway was observed (indicated as Ras, red color). Two cell lines of the Not Tested group were interdispersed within the Ras group, indicating that these two lines are most likely responsible for the lack of statistical difference between the Ras and the Not Tested group (see above). Comparison of the Tissue and Pathway categories indicated that within the two Ras sub-clusters, some tissue-specificity is conserved. Indeed, the left cluster, comprising a total of 18 cell lines, was characterized by 6 colon cancers and 6 leukemias of which 5 on 6 were mutated in Ras pathway. Similarly the right cluster, comprising a total of 19 cell lines, was characterized by 8 melanomas and 5 lung cancers of which 7 on 8 were mutated in Ras pathway for melanoma and 4 on 5 for lung cancer. The other sub-clusters, comprising all the remaining cell lines and the other three groups of mutations and consequently of pathways, were more dispersed along the clustergram. Together, these results indicate that transformation events modulate transcriptional regulation of genes encoding proteins of the PKA pathway and that mutational activation of the Ras pathway originates a distinguishable signature, in comparison with mutational activation of the other genes studied in this report. Such a distinguishable signature is particularly noticeable in melanoma cells, in which strong expression of a gene set encoding a complete functional PKA pathway module (ADCY3; PDE4B, PDE4D and PDE8A; AKAP12; PRKAR1A and PRKAR2B; PRKACB; CREM) is observed, suggesting a deregulated cAMP signaling. Moreover, analysis of expression values for PRKAR1A and PRKAR2B genes indicated the presence in melanoma cells of a high R1/R2 ratio, that has been associated to melanocyte proliferation [36].

Figure 4
figure 4

Hierarchical clustering of the 41 PKA pathway-encoding genes in transformed samples. Two-way (gene, column and cell line, row) hierarchical clustering (see Methods) of the profiles from the NCI60 collection only. Normalized expression is colour-coded from green (poor expression) to red (strong expression). The distance function is based on Pearson correlation and complete linkage clustering. The name of each gene is colour-coded according to the family to which it belongs. Legends for expression, condition, gene family and tissue of origin are shown on the right of the dendrogram. The data have been organized on the basis of the tissue of origin of the cancer (Tissue), the specific oncogenic mutations identified in each cell line (Mutation), the putative altered pathway by the specific mutations (Pathway) and the gene family.

Promoter analysis: finding correlation between oncogenic pathway, transcriptional profiles and promoter regulation

Genes involved in the same pathway or transcriptionally co-regulated are likely to share similar promoter features. To test this hypothesis in our model, the 15 groups previously established (see Figure 3), containing coregulated genes for each group, were used for promoter identification and analysis. Using a series of biocomputing procedures and statistical processes (see Methods and the Figure), we identified Transcription Factor Binding Sites (TFBSs) conserved within the promoters (operationally defined as regions spanning 500 nt upstream and 100 nt downstream from the transcription start site) of the 41 PKA pathway-encoding genes. Genes were sorted in the 15 groups indicated in Figure 3B and 3C, and each group separately analyzed. In this first analysis (Fig. 5A), each TFBS was scored as either absent or present, regardless of the number of copies present within a given promoter. This analysis permitted the identification of 30 TFBSs enriched in the promoters of the 41 PKA pathway-encoding genes whose frequency of occurrence i.e., the ratio between the promoters that contained the specified motif (S) and the 41 promoters in our collection (T) was compared with the frequency of occurrence within vertebrate genomes (computed using the promoter Library Matrix Family of vertebrates that comprises 260.000 vertebrate promoters). Statistical analysis indicated that of these 30 TFBSs, 7 were over-represented (red color) and 9 under-represented (green color). The remaining showed the same frequency of occurrence found in the whole vertebrate genome collection.

Figure 5
figure 5

TFBS identification by using the enrichment as parameter. (A) The panel shows for each TFBS, recognized as relevant (present in ≥ 70% of the promoters of 41 PKA pathway-encoding genes) the percentage of promoters in our collection that contain the motif as compared to Matrix Family Library on vertebrates. This percentage has been calculated by dividing the total number of promoters containing the motif (S) by the total number of promoters (T). Color-coding scheme on the right of the panel. (B) Schematic representation of the TFBSs (color-coded as shown on the right of the panel) identified in the promoters of the 15 subgroups described in the text and in Figure 3. Each cartoon represents the promoter structure resulting from the average of the TFBS identified in ≥ 70% of the gene promoters for each subgroup. The asterisks on the bottom of the cartoon indicate the over-represented TFBS, as scored in panel A, for all the 41 PKA pathway-encoding genes.

A consensus representation for the promoter structure of each subgroup of coregulated genes was drawn by taking into account the 30 TFBSs present in at least 70% of the genes within each subgroup (Figure 5B). Surprisingly, the vast majority of these consensus promoters (13 out of 15) showed a common module (upper part, module), comprising 4 TFBSs: ETSF, MAZF, ZBPF and EGRF, 3 of which are over-represented in our collection (over-represented motifs are indicated by an asterisk at the bottom of the figure). This strongly suggests a functional implication of these TFBSs in expression of PKA pathway-encoding genes. Other interesting features indicated by this analysis include the identification of binding sites for PAX6 (indicated by red P) and ZF5F and NKXH (indicated by red Z and N respectively) only in consensus promoters of some genes within the normal or transformed group, respectively.

Another feature that may be critical in the identification of enriched elements is the number of copies of a given TFBS within a promoter. In fact, it has been documented that the presence of multiple copies of cis-elements in promoters, particularly when clustered, makes transcriptional activation stronger [31, 32]. For this reason, total number and frequency (number of each TFBS/promoter) of the 30 TFBSs previously identified, was scored within each of the 15 subgroups and classified by hierarchical clustering (Figure 6A and 6B, respectively). Analysis using both criteria confirmed the results reported in Figure 5: the presence in promoters of all subgroups of a TFBS module comprising ETSF, MAZF, ZBPF and EGRF. Clustering according to Regulation in Figure 6A show that all promoters of genes characterized by low expression transformed samples cluster together (class II). Promoters belonging to genes with strong expression in the Ras group cluster in a completely independent arm (lower part of the dendrogram), opposite to where cluster promoters belonging to genes with strong expression in the Normal group (class I). Additionally, clustering by frequency highlighted the specific enrichment of EKLF in genes with low expression. Clustering according to both criteria indicated that Normal samples clustered in a different way as compared to transformed samples (upper part of the dendrogram) and that the PI3K, Other Mutation and Not Tested samples were more interspersed along the dendrogram and confirmed that the Ras category showed a different promoter composition as compared to other categories, in keeping with the PCA analysis presented in Figure 3.

Figure 6
figure 6

Hierarchical clustering of TFBSs present in the promoters of the 41 PKA pathway-encoding genes, according to total number and frequency. Two-way (TBFS, column and expression subgroup, see Figure 3, row) hierarchical clustering of the TFBS present within the promoters of the 41 PKA pathway-encoding genes. Clustering was run according to the total number of TBFS present in each group (panel A) or to the frequency, i.e. the total number of a given TBFS divided by the number of promoters (panel (B). The color-coding scale is shown at the top of each panel. The distance function is based on Pearson correlation and complete linkage clustering. The two classes, corresponding to the main arms of the dendrogram, derived from clustering according to "Condition" are shown on the right of each dendrogram.

Data mining for PKA pathway-related gene promoters

As previously described, computational analysis of our promoter collection, permitted the identification of some TFBS that are able to characterize in a specific manner normal and transformed samples. To confirm some of our computational results, we interrogated several databases and searched in the literature for studies on promoter structure of PKA pathway-encoding genes. Experimental studies, using one or more molecular approaches including EMSA, Chromatin Immunoprecipitation and transactivation assay, have been found for 16 PKA pathway-encoding genes: PRKAR1A, PRKAR1B, PRKAR2B, PRKACA, AKAP1, AKAP8, AKAP9, AKAP10, AKAP12, ADCY8, ADCY9, PDE4B, PDE4C, PDE4D, CREB and CREM. This subset of genes was re-analyzed as described above and the obtained results were compared with literature data (Table 6). In total, 36 TFBSs have been experimentally identified: 20 of these (i.e. 55%) have been predicted by our computational approach and for two genes alone (AKAP9 and PRKAKA), none of the experimentally identified sites was identified by the computational approach that overall identified a much higher number of sites compared to those retrieved from literature. The biological significance of the presence of the identified TFBS and of their relationship with oncogenic mutations, notably in the Ras pathway, is proposed below.

Table 6 Comparison between computational data and literature data

PKA type I regulatory subunit A (PRKAR1A) expression has been studied in different cellular models by analyzing its mRNA expression and by using its putative promoter region. In its promoter, binding sites for activator protein-1 and 2 (AP-1 and AP-2) and Sp1 [37] have been identified. Moreover, a more recent work showed a direct activity of FOX family (FOXC2, D1 and D2) transcriptional factors members in the regulation of PRKAR1A expression both at transcriptional and at post-transcriptional levels [38, 39].

The promoter of PRKAR1B has been identified and studied in human and mouse: binding sites for Jun and p53 (human) and Oct-1, Egr1 and Pax1 (mouse) have been found. These binding sites have been experimentally verified by Electrophoretic Mobility Shift Assay, functional analysis and Northern blot [40, 41].

PRKAR2B promoter has been studied in particular in Sertoli cells (human). Some reports identified binding sites for Sp1, NF-1, Myc, C/EBPbeta, able to induce the PRKAR2B promoter, USF1 and USF2. Interestingly, overexpression of USF2, but not USF1, led to inhibition of both cAMP- and C/EBPbeta-mediated induction of PRKAR2B [4244].

The promoter of Protein kinase, cAMP-dependent, catalytic, alpha (PRKACA) has been identified both in humans and mouse, but little information has been produced for human promoter. Indeed, one paper describes the presence of binding sites for USF1 and USF2 transcription factors [45].

AKAP1, AKAP9 and AKAP10 promoters contain binding sites for c-Myc as shown by computational analysis and ChIP experiments in several human cell lines [46, 47]. Moreover, a single study indicates the presence in the promoter of AKAP12 of binding sites for Serum Response Factor transcriptional factors [48] and more recently for Myc.

ADCY9 promoter contains binding sites for c-Myc as shown by an experimental approach [49].

Several promoters of genes encoding phosphodiesterase proteins have been isolated and to some extent studied. All the studies have been performed on sequences of human promoters and in particular the PDE4B, PDE4C (both present in our collection of PKA pathway related genes), and PDE5A, PDE6A, PDE6B and PDE7A promoters (not present in our gene list) have been better characterized. In the PDE4B promoter, binding sites for CREB have been found [50]. In PDE4C promoter, binding sites for Myc have been found [47]. In the PDE5A promoter, binding sites for Jun and AP-2 have been found [51, 52]; in PDE6A and PDE6B promoters, binding sites for Sp1 [53] and Sp4 [54, 55] respectively and in PDE7A promoter, Ets2 and NFkB1 binding sites [56].

The cyclic AMP response element (CRE)-binding protein CREB promoter has been identified in human, mouse and rat. Analysis done on human promoter, experimentally confirmed, identified binding site for c-Myc [57] and Sp1 [58]. Further information about such promoter has been produced in mouse and rat cells which allowed the identification of binding site for NfkB [57].

An important regulative mechanism of the PKA pathway is feedback control. Indeed as well as the cAMP produced by Adenylyl Cyclases, activate PKA kinase activity, PKA is able to inhibit the pathway, activating by phosphorylation the Phosphodiesterases, which ultimately induce hydrolysis of cAMP switching off the pathway. Moreover a huge amount of data has been published regarding the ability of PKA to activate specific transcription factors by phosphorylation: cyclic AMP response element (CRE)-binding protein CREB, the cAMP response element modulator (CREM), the activating transcription factor 1 (ATF-1) and a repressor, ICER (inducible cAMP early repressor) [59] that, to a certain extent, has been shown to regulate PKA pathway-related genes transcription. Some of the promoters, already discussed above, have been shown to have CRE binding sites. Moreover, two interesting recent publications, have identified and characterized in different cellular contexts and by several approaches, through a genome-wide approach, target genes that are regulated by CREB [60, 61]. The authors have identified and proved by ChiP analysis (PRKAR1A, PDE7B) the presence of CRE site in PRKAR1A, in PDE7B, AKAP8, PDE4C and ADCY8. In the latter case they did not observe binding by Chip analysis, but another report has shown that its activation is mediated specifically via the canonical CRE site [62]. Binding sites for CREB1 have been found in PDE7A [56], PDE4D [63], CREM [64] and experimentally confirmed. Moreover analysis of the promoter of CREB gene showed the presence of several CRE binding sites [65, 66].

Most of AP-1 (i.e. Jun), AP-2 and Sp1 transcription factors are involved in growth-related signal transduction pathways, among which Ras is a main actor, and their over-expression can have positive or negative effects on proliferation [6771]. Indeed Sp family has been shown to be regulated by post-translational mechanisms by Ras pathway [72, 73] as well as Ets1 and Ets2 [74, 75] and NFkB [7678].

Egr-1 is an early responsive gene linked to mitogenic stimulation directly regulated by MAPK pathway [7982]. Moreover for Myc [83, 84], C/EBPbeta [85] and NF-1 [86] a large amount of data about their correlation with Ras pathway has been reported. Each of these transcriptional factors has been associated with several cellular responses (proliferation, survival, apoptosis) and transformation as is the case of the PKA pathway as well. Therefore it is possible that mitogenic signal through Ras and the regulation of such transcription factors, modulates the expression of PKA pathway related genes.

An important role, in the activation of the CREB family transcription factors, is played by stimuli which are able to induce their phosphorylation and consequently their activation. In fact as reviewed in [65] not only the protein kinase A is involved in this function but also several growth factors (NGF, FGF, IGF-I, PDGF, EGF), survival signals and hypoxia that often activate the Ras pathway, pointing to an essential role of the latter pathway also in gene transcriptional regulation of PKA pathway-encoding genes by transcription factors of the CREB family.


By using a generalized workflow for data recovery and integration that combines accurate analysis of recovered data from several databases with the application of different statistical tests we have been able to correlate strong transcriptional repression of genes encoding proteins of the cAMP/PKA pathway in transformed samples of different genetic origin (i.e., bearing mutations in different pathways). This finding prompted us to compute consensus promoters, whose composition was specifically enriched for different transcription factor binding sites (TBFS). Comparison of TFBS computationally identified in the consensus promoters with TBFS experimentally identified by a variety of techniques, shows a good agreement. Indeed, by lowering the stringency used in the workflow, some of the TFBS missed by higher stringency analysis (false negatives) were recovered, in keeping with the notion that intersection of different data sets and/or techniques decreases both noise and the number of hits.

The workflow we have followed is summarized in Figure 7 and detailed in Methods section. As the number of sites hosting curated transcriptional profiles increases, more and more data to be used as starting point become available. We used the GEO database to recover data from the NCI60 cell collection (cancer samples) and matching normal tissues and to which specific statistical tests (i.e. ANOVA, Hierarchical clustering) were applied. By using the COSMIC database, which gives information about the mutational status of the NCI60 collection, we could sort the NCI60 cell lines in 4 subgroups with mutational activation of genes encoding components of the Ras pathway, of the PI3K pathway, of other pathways or for which no information was available. Such a sorting allowed us to uncover an hitherto unrecognized oncogene-dependent pattern of regulation of 41 genes encoding components of the cAMP/PKA pathway (Figure 7B and 7C). The transcriptional profiles for transformed cells within one of the identified subgroups may then be used as a new query to GEO database (green arrow), in order to correlate and confirm, i.e. in cancer tissues, the oncogene-dependent pattern identified.

Figure 7
figure 7

Flowchart of our web-based and statistical strategy used to elucidate the relation between PKA encoding genes transcriptional profiles and oncogenic mutations. (A) Flow chart of our web-based and statistical strategy with indication of some of the databases (Source) used, the type of data analyzed (Input), the specific program and statistical test (Tool) used and the result obtained (Output). (B) Graphical representation of the block diagram summarizing functional interconnections within the PKA pathway module with indication of the expression level (geometric mean) of each gene belonging to the network -Strong (red), Average (black) and Low (green)- as identified by our analysis both in normal (B, left) and transformed samples (B, right). (C) Boxplots of the expression of PKA pathway-encoding genes in normal (C, left) and transformed (C, right) samples, grouped for functional classes (ADCY: adenylyl cyclase; AKAP: A-kinase anchor protein; PDE: phosphodiesterase; PRKACR: PKA regulatory subunit; PRKAC: PKA catalytic subunit). The represented value is the median. (D) Schematic representation of the TFBSs (color-coded) identified in the promoters of PKA pathway-encoding genes of normal and transformed samples. Each cartoon represents the promoter structure resulting from the merge of the TFBS identified in ≥ 70% of the gene promoters of all normal samples and transformed samples.

Deregulation of transcriptional programs, such as that identified for PKA pathway-encoding genes, may be considered a direct consequence of a deregulated activity of transcription factors. The TRANSFAC database was used with a high stringency threshold, to identify the regulatory sequence in co-regulated genes with high confidence, improving the deduced linkages between transcription factors and the regulated genes. Using this approach, we demonstrated that in all PKA encoding genes TFBSs for ETS, MAZ, ZBP and EGR transcription factors are present (Figure 7D) and that specific subsets of TFBS are present in the normal and transformed samples. The number of TFBS identified by computational analysis was higher than those that could be retrieved from literature as experimentally determined. This observation was to some extent expected because of limited literature reference availability, complexity to retrieve data, difficulty to analyze data from several origins, and the lack of powerful data analysis and integration tools. Under these less-than-ideal conditions, a dedicated tool such as the TFBS database, can be extremely powerful, allowing predictions that are amenable to experimental verification, should this be necessary. As discussed above, most of the false-negatives that failed to be detected by our computational approaches could be recovered by appropriately lowering the stringency of analysis.

In Figure 7B transcriptional expression of PKA pathway encoding genes is color-mapped (geometric mean, Strong expression, red, Average expression, black and Low expression green) on a block diagram summarizing functional interconnections within the PKA pathway module. A general and balanced co-regulation of both positive and negative regulators of the cAMP/PKA pathway is apparent in both normal and transformed samples. Notably, in normal cells variability in expression is maximal for genes encoding the catalytic subunit of PKA. Because of the pleiotropic role of the PKA pathway (including stimulation of growth and differentiation in many cell types, such as somatotrophs, thyrocytes, melanocytes, ovarian follicular granulosa cells, keratinocytes, nervous, muscle and blood cells and adipocyte and the important role of such pathway in the regulation of the function of tissues as kidney, ovary, brain, and prostate), strong expression in normal tissues is expected [8, 87, 88]. It should also be remembered that cross-talk between the PKA pathway and oncogene-mediated pathways can also take place at post-transcriptional levels. For example, several Authors reported the ability of oncogenic and viral Ras proteins to either stimulate [8991] or inhibit [9294] ADCY activity in different cell lines (thyroid, epithelial, kidney, fibroblast). Moreover an involvement of MAPK or PI3K pathways in the regulation of PDE activity has been reported, suggesting that mitogenic stimulation may positively regulate PDE4 expression directly [95], confirming our transcriptional results, or by post-translational mechanisms in which p42(MAPK) phosphorylation activity has a relevant role in their regulation [96]. Another important post-transcriptional mechanism that links Ras or PI3K pathways to cAMP/PKA pathway is the positive and negative control of CREB activity by a phosphorylation [97, 98]. Moreover, it has been reported that cAMP is able to induce proliferation rather than growth inhibition, in several tumors where oncogenic activation of B-Raf has been identified (i.e., melanoma and thyroid cancer) [8]. Nevertheless, the general and coordinated down-regulation of essentially all genes of the pathway in transformed cells (as compared with normal tissues) suggests that at least one PKA-mediated function needs to be reduced substantially in order to express the transformed phenotype. Although at this stage it is too early to propose specific hypotheses, it is intriguing to remember that PKA has been ascribed a role in activating mitochondrial respiration and decreasing ROS production [99, 100], thus effectively counteracting mitochondria dysfunction that is found associated with increased glycolysis (Warburg effect, [101103]) in many cancer cells. On the other hand, a reduction in oxidative phosphorylation that will decrease ATP supply, as substrate of adenylate cyclase, may result in a decreased cAMP production without relevant changes in the level of the enzymes (and possibly therefore of their gene expression).

It is expected that deeper computational integration of transcriptional data with other genome-wide findings, including -but not limited to- proteomics, interactomics and metabolomics, will allow a better extraction of hidden information. We propose that such data integration can be further applied to examine the topology of biological networks, to provide information on directionality of interactions, and create wiring diagrams that better depict the functional outcome of component-component relationships. Together, these strategies should facilitate a systems approach to modular biology.

Thus systems biology can be approached by perturbing the suspected components of a given cellular process, monitoring the responses, integrating the data and modeling the biological process in question [104]. By applying a single "omic" approach, able to sample a "horizontal" slice (i.e. across all genes or gene products) of a multidimensional space, the knowledge of a system can be expanded from a single gene to a network of genes, which can be regarded as a basic model for the system. When genes or proteins in this network are systematically disrupted, responses from other parts of the network can be recorded and the data obtained can be incorporated into the basic model. However sampling a single dimension of a complex space will undoubtedly provide relevant information, but may not highlight the major regulatory features. Therefore, a wiring diagram that depicts the direction of interactions in the network and the behavior of each of these components can be constructed to better represent the relationships between the components [104]. The example shown in Figure 7 illustrates how our current knowledge of a biological system can be expanded and a model built based on integrated "omic" information. Ultimately, development of such computational methods and their recursive integration with genome-wide and hypothesis-driven experimental investigations that also take into account post-translational and substrate-dependent mechanisms controlling the cAMP/PKA pathway activity, should reconcile experiments from different experimental systems (cell cultures, animal models and human tumor samples) and contribute to explain at an integrative, systems level how the cAMP-PKA pathway is affected by oncogenic processes originated by mutational activation of signal transduction.


Data recovery and normalization

Gene expression data of NCI60 cell lines and normal tissues samples were downloaded from the Gene Expression Omnibus (GEO) at the National Center for Biotechnology Information (NCBI) website ( [105]. In particular, gene expression profiles of NCI60 cell collection (cancer samples) were recovered from GEO database (GSE5949, [31]) in which the experimental data were obtained by using the Affymetrix HG-U95Av2 oligonucleotide array platform. For the analysis only results obtained by oligonucleotide arrays were considered, because this platform uses a different method to evaluate mRNA expression as compared to cDNA array platform. Therefore, also for normal tissue samples, the data used for the comparative analysis, were recovered from transcriptional profiles produced by using U95Av2 oligonucleotide array (GSE96 [106], GSE6731 [107] and GSE1402 [108]).

A total of 81 transcriptional profiles encompassing cancer cell lines with nine histological origins and samples from six normal tissues were recovered. Further details can be found in the legends of Tables 1 and 2. All datasets were generated by downloading and processing CEL files. They were preprocessed using Robust Multichip Average (RMA) [109, 110] and then transformed from log2 values to linear scale values, and normalized per gene to the median value of its level of expression across 81 samples, as implemented in GeneSpring GX 7.3.1 (Silicon Genetics - The RMA preprocessing algorithm includes background and quantile normalization steps [109, 111]. Although background correction, as first step analysis, has been computed separately for each array, all the other procedures performed by using RMA (normalization and summarization), have been performed across all the arrays (RMA is a multiple-array method). Normalization is necessary so that multiple chips can be compared to each other, and analyzed together. The normalization procedure is aimed at making the distributions identical across arrays. RMA usually gives very accurate normalizations. Note that, RMA implemented in GeneSpring GX 7.3.1, all the arrays are used and no chip is discarded. We used RMA analysis as compared to other tools of analysis, because, as described in several papers [112, 113], it successfully reduces the variance of low abundance transcripts and better distinguishes differentially expressed transcripts from those that are unchanging, by using controlled datasets in which known quantities of specific mRNAs have been added to a common reference pool, [109, 110, 114].

Transformation-dependent, transcriptional remodeling of the PKA pathway-encoding genes in 60 human cancer cell lines (NCI60) and 21 human normal tissues

We identified and gathered the transcriptional profile for 41 genes encoding proteins involved in the PKA pathway (adenylyl cyclases -ADCY-, phosphodiesterases -PDE-, A-kinase anchor proteins -AKAP-, cAMP-dependent transcriptional factors -TF-, PKA catalytic subunits -PRKAC- and PKA regulatory subunits -PRKACR-, Table 3).

In order to identify specific variations in the expression pattern of the selected PKA pathway-related genes both in normal and transformed samples, different tools of analysis were used.

Initially, the PKA pathway related genes expression profiles, observed in transformed samples as compared to normal samples, were evaluated by analysis of variance (ANOVA). Such statistical linear modeling procedure, that partitions the total variance into parts corresponding to various sources in the model [115, 116] have been successfully used to analyze microarray data [117120]. In order to model and test the hypothesis that the expression of genes of PKA pathway was different between normal tissues and transformed cell lines, the following comparisons were used: Expression of gene i (where i=i-esimo) of Normal Tissues vs. Transformed cell lines (Figure 1), and a p-value < 0.05.

The same data-set was then analyzed through unsupervised hierarchical clustering [121] (as implemented in the GeneSpring platform). Two-way hierarchical clustering was performed on RMA-generated linear scale expression levels using the Pearson correlation coefficient as the measure of similarity and complete linkage clustering [122]. The results of this process are dendrograms, in which short branches connect very similar elements, and longer branches join elements with diminishing degrees of similarity. The vectors used were sample - normal tissues and transformed cells- and expression of genes of PKA pathway-related genes and the arms were classified by different variables: Conditions and Tissues, (Figure 2).

Analysis of mutational status of the NCI60 cell lines and correlation with tissue-specific PKA pathway gene regulation

The 60 cell lines were sorted according to mutational status, using the information provided by Catalogue Of Somatic Mutations In Cancer ( [123]. This database holds somatic mutation data and other information related to human cancer cell lines and tissues, and can be interrogated through a series of web pages to provide a graphical or tabular view of the data along with various export options. We could sort the NCI60 cell lines in 4 subgroups presenting mutational activation of genes encoding components of the Ras pathway, of the PI3K pathway, of other pathways or for which no information was available, (Table 5).

In order to identify specific variations in the expression pattern of the PKA pathway-related genes in these 4 subgroups, different tools of analysis were used.

We applied unsupervised Principal Component Analysis (PCA) [124, 125] to establish the interrelationships among the samples used in our study. PCA is a statistical method that can be used to reduce complex data sets with multiple variables into significantly smaller numbers of variables (known as components), which retain the relevant variance information used to distinguish the sample groups from another. By visualizing projections of these components in low-dimensional spaces, we were able to observe the grouping of samples reflecting underlying patterns in their gene expression profiles. PCA on the mean centered and scaling data was used to model the effects of oncogene-dependent transformation on the gene expression. The following comparisons were performed: Expression of gene i of Normal Tissues vs. PI3K mutation cell lines; vs. Ras mutation cell lines; vs. Not Tested mutation cell lines; vs. Other Mutation cell lines.

Also in this case, in order to model and test the hypothesis that the expression of genes of PKA pathway was different between normal tissues and the four subgroups previously identified, we applied one-way ANOVA by using the following comparisons: Expression of gene i (where i=i-esimo) of Normal Tissues vs. PI3K mutation cell lines; vs. Ras mutation cell lines; vs. Not Tested mutation cell lines; vs. Other Mutation cell lines, (Figure 3D), and a p-value < 0.05.

The data-set of 41 genes was then analyzed through unsupervised hierarchical clustering (Pearson correlation coefficient and complete linkage clustering). The vectors used were sample - oncogene-dependent transformed cells - and expression of genes of PKA pathway-related genes. The results of this process are dendrograms, in which the arms were classified by different variables: Tissue, mutation and Pathway (Figure 4).

Computational analysis of promoters of differentially regulated PKA pathway-encoding genes and identification of transcriptional factor binding sites

In order to identify Transcriptional Factor Binding Sites (TFBS) present in promoters of co-regulated genes, the 41 PKA pathway-encoding genes were sorted, relative to their level of expression, in three groups: Strong (>1), Average (=1) and Low (<1), where 1 is the expression value calculated by RMA. Each groups was identified in each sample group, i.e., Normal Tissues, cell lines carrying mutation(s) in Ras pathway-encoding genes, cell lines carrying mutation(s) in PI3K pathway-encoding genes, cell lines carrying mutation(s) in other pathways, cell lines Not Tested for mutation, thus generating 15 subgroups. A TFBS was called present only when present in more than 70 % of promoters within each group.

Proximal promoter regions - defined as 500 nt upstream and 100 nt downstream from the transcription start site (TSS), automatically assigned to genes on the basis of 5' cap-site databases integrated into promoter identification program - were identified using Eldorado (gems launcher, Genomatix [126]) and the Genomatix Promoter Database.

TFBS in the promoter regions were identified by using ModelInspector and the Genomatix Promoter Database, comprising a total of 519 matrices from 154 families (Matrix Family Library, on Vertebrates, Version 7.1, June 2008). The Matrix Family Library is based on 260,000 human, mouse, and rat promoter sequences, with an average length of 650bp. Analysis on the 41 PKA pathway-encoding genes was performed with a threshold of 1.0 for the core similarity -that is reached only when the highest conserved bases of a matrix match exactly in the sequence- and a value of 0.85 for the Optimized matrix threshold [127]. Optimized matrix threshold is the optimized value defined in a way that a minimum number of matches is found in non-regulatory test sequences. This value, when is higher than 0.80, permits the reduction of false positive matches.

The total number and frequency (i.e., the ratio between the total number of TBFS and the number of promoters present within each subgroup) of each TFBS within each subgroup were calculated. The frequency of each TFBS called present in each of the 15 subgroups of PKA pathway-encoding genes was compared with the frequency of the same TFBS within the Matrix Family Library on Vertebrates. TFBS enrichment was scored based on p-value generated by hypergeometric distribution and calculated with the 2-tailed Fisher's exact test, implemented through the use of a 2 x 2 contingency table (Figure 5).

In order to identify differences between the 15 groups a two-way hierarchical clustering (by using as vectors sample and TFBS) was applied by using the total number values and the frequency values of each TFBS identified in ≥ 70% of the promoters in each group. The total number value was transformed in the log2 and used in the hierarchical clustering by using the Pearson correlation coefficient as the measure of similarity and complete linkage clustering (Figure 6).

Promoter data mining

To identify known transcription factor binding sites in the promoter sequences of PKA pathway-encoding genes, the annotated promoter and associated information have been retrieved from Transcriptional Regulatory Element Database (TRED) [128] ( and from NCBI ( Both web sites are freely accessible. The results have been shown in the Table 6.


  1. 1.

    Alberghina L, Chiaradonna F, Vanoni M: Systems biology and the molecular circuits of cancer. Chembiochem 2004, 5(10):1322–1333. 10.1002/cbic.200400170

    CAS  PubMed  Google Scholar 

  2. 2.

    Li H, Xuan J, Wang Y, Zhan M: Inferring regulatory networks. Front Biosci 2008, 13: 263–275. 10.2741/2677

    CAS  PubMed  Google Scholar 

  3. 3.

    Zhu X, Gerstein M, Snyder M: Getting connected: analysis and principles of biological networks. Genes Dev 2007, 21(9):1010–1024. 10.1101/gad.1528707

    CAS  PubMed  Google Scholar 

  4. 4.

    Srinivasan BS, Shah NH, Flannick JA, Abeliuk E, Novak AF, Batzoglou S: Current progress in network research: toward reference networks for key model organisms. Briefings in bioinformatics 2007, 8(5):318–332. 10.1093/bib/bbm038

    CAS  PubMed  Google Scholar 

  5. 5.

    Kwoh CK, Ng PY: Network analysis approach for biology. Cell Mol Life Sci 2007, 64(14):1739–1751. 10.1007/s00018-007-7053-7

    CAS  PubMed  Google Scholar 

  6. 6.

    Philippi S, Kohler J: Addressing the problems with life-science databases for traditional uses and systems biology. Nature reviews 2006, 7(6):482–488. 10.1038/nrg1872

    CAS  PubMed  Google Scholar 

  7. 7.

    Romano P, Kracht M, Manniello MA, Stegehuis G, Fritze D: The role of informatics in the coordinated management of biological resources collections. Applied bioinformatics 2005, 4(3):175–186.

    CAS  PubMed  Google Scholar 

  8. 8.

    Dumaz N, Marais R: Integrating signals between cAMP and the RAS/RAF/MEK/ERK signalling pathways. Based on the anniversary prize of the Gesellschaft fur Biochemie und Molekularbiologie Lecture delivered on 5 July 2003 at the Special FEBS Meeting in Brussels. Febs J 2005, 272(14):3491–3504. 10.1111/j.1742-4658.2005.04763.x

    CAS  PubMed  Google Scholar 

  9. 9.

    Mei FC, Qiao J, Tsygankova OM, Meinkoth JL, Quilliam LA, Cheng X: Differential signaling of cyclic AMP: opposing effects of exchange protein directly activated by cyclic AMP and cAMP-dependent protein kinase on protein kinase B activation. J Biol Chem 2002, 277(13):11497–11504. 10.1074/jbc.M110856200

    CAS  PubMed  Google Scholar 

  10. 10.

    Stork PJ, Schmitt JM: Crosstalk between cAMP and MAP kinase signaling in the regulation of cell proliferation. Trends Cell Biol 2002, 12(6):258–266. 10.1016/S0962-8924(02)02294-8

    CAS  PubMed  Google Scholar 

  11. 11.

    Bos JL: The ras gene family and human carcinogenesis. Mutat Res 1988, 195(3):255–271.

    CAS  PubMed  Google Scholar 

  12. 12.

    Cho-Chung YS, Nesterova M, Becker KG, Srivastava R, Park YG, Lee YN, Cho YS, Kim MK, Neary C, Cheadle C: Dissecting the circuitry of protein kinase A and cAMP signaling in cancer genesis: antisense, microarray, gene overexpression, and transcription factor decoy. Ann N Y Acad Sci 2002, 968: 22–36. 10.1111/j.1749-6632.2002.tb04324.x

    CAS  PubMed  Google Scholar 

  13. 13.

    Malumbres M, Barbacid M: RAS oncogenes: the first 30 years. Nat Rev Cancer 2003, 3(6):459–465. 10.1038/nrc1097

    CAS  PubMed  Google Scholar 

  14. 14.

    Siu YT, Jin DY: CREB--a real culprit in oncogenesis. Febs J 2007, 274(13):3224–3232. 10.1111/j.1742-4658.2007.05884.x

    CAS  PubMed  Google Scholar 

  15. 15.

    Treisman R: Regulation of transcription by MAP kinase cascades. Curr Opin Cell Biol 1996, 8(2):205–215. 10.1016/S0955-0674(96)80067-6

    CAS  PubMed  Google Scholar 

  16. 16.

    Bader AG, Kang S, Zhao L, Vogt PK: Oncogenic PI3K deregulates transcription and translation. Nat Rev Cancer 2005, 5(12):921–929. 10.1038/nrc1753

    CAS  PubMed  Google Scholar 

  17. 17.

    Downward J: Signal transduction. Prelude to an anniversary for the RAS oncogene. Science 2006, 314(5798):433–434. 10.1126/science.1134727

    CAS  PubMed  Google Scholar 

  18. 18.

    Downward J: Targeting RAS signalling pathways in cancer therapy. Nat Rev Cancer 2003, 3(1):11–22. 10.1038/nrc969

    CAS  PubMed  Google Scholar 

  19. 19.

    Marais R, Light Y, Mason C, Paterson H, Olson MF, Marshall CJ: Requirement of Ras-GTP-Raf complexes for activation of Raf-1 by protein kinase C. Science 1998, 280(5360):109–112. 10.1126/science.280.5360.109

    CAS  PubMed  Google Scholar 

  20. 20.

    Rodriguez-Viciana P, Marte BM, Warne PH, Downward J: Phosphatidylinositol 3' kinase: one of the effectors of Ras. Philos Trans R Soc Lond B Biol Sci 1996, 351(1336):225–231. discussion 231–222. discussion 231-222. 10.1098/rstb.1996.0020

    CAS  PubMed  Google Scholar 

  21. 21.

    McKay MM, Morrison DK: Integrating signals from RTKs to ERK/MAPK. Oncogene 2007, 26(22):3113–3121. 10.1038/sj.onc.1210394

    CAS  PubMed  Google Scholar 

  22. 22.

    Anderson RA, Boronenkov IV, Doughman SD, Kunz J, Loijens JC: Phosphatidylinositol phosphate kinases, a multifaceted family of signaling enzymes. J Biol Chem 1999, 274(15):9907–9910. 10.1074/jbc.274.15.9907

    CAS  PubMed  Google Scholar 

  23. 23.

    Chang F, Lee JT, Navolanic PM, Steelman LS, Shelton JG, Blalock WL, Franklin RA, McCubrey JA: Involvement of PI3K/Akt pathway in cell cycle progression, apoptosis, and neoplastic transformation: a target for cancer chemotherapy. Leukemia 2003, 17(3):590–603. 10.1038/sj.leu.2402824

    CAS  PubMed  Google Scholar 

  24. 24.

    Davis RJ: Transcriptional regulation by MAP kinases. Mol Reprod Dev 1995, 42(4):459–467. 10.1002/mrd.1080420414

    CAS  PubMed  Google Scholar 

  25. 25.

    Shaul YD, Seger R: The MEK/ERK cascade: From signaling specificity to diverse functions. Biochim Biophys Acta 2007, 1773(8):1213–1226. 10.1016/j.bbamcr.2006.10.005

    CAS  PubMed  Google Scholar 

  26. 26.

    Hazzalin CA, Mahadevan LC: MAPK-regulated transcription: a continuously variable gene switch? Nat Rev Mol Cell Biol 2002, 3(1):30–40. 10.1038/nrm715

    CAS  PubMed  Google Scholar 

  27. 27.

    Chang F, Steelman LS, Lee JT, Shelton JG, Navolanic PM, Blalock WL, Franklin RA, McCubrey JA: Signal transduction mediated by the Ras/Raf/MEK/ERK pathway from cytokine receptors to transcription factors: potential targeting for therapeutic intervention. Leukemia 2003, 17(7):1263–1293. 10.1038/sj.leu.2402945

    CAS  PubMed  Google Scholar 

  28. 28.

    Oda K, Matsuoka Y, Funahashi A, Kitano H: A comprehensive pathway map of epidermal growth factor receptor signaling. Mol Syst Biol 2005, 1: 2005 0010. 10.1038/msb4100014

    PubMed Central  PubMed  Google Scholar 

  29. 29.

    Lander ES: Array of hope. Nat Genet 1999, 21(1 Suppl):3–4. 10.1038/4427

    CAS  PubMed  Google Scholar 

  30. 30.

    Scherf U, Ross DT, Waltham M, Smith LH, Lee JK, Tanabe L, Kohn KW, Reinhold WC, Myers TG, Andrews DT, et al.: A gene expression database for the molecular pharmacology of cancer. Nat Genet 2000, 24(3):236–244. 10.1038/73439

    CAS  PubMed  Google Scholar 

  31. 31.

    Wang H, Huang S, Shou J, Su EW, Onyia JE, Liao B, Li S: Comparative analysis and integrative classification of NCI60 cell lines and primary tumors using gene expression profiling data. BMC Genomics 2006, 7: 166. 10.1186/1471-2164-7-166

    PubMed Central  PubMed  Google Scholar 

  32. 32.

    Ross DT, Scherf U, Eisen MB, Perou CM, Rees C, Spellman P, Iyer V, Jeffrey SS, Van de Rijn M, Waltham M, et al.: Systematic variation in gene expression patterns in human cancer cell lines. Nat Genet 2000, 24(3):227–235. 10.1038/73432

    CAS  PubMed  Google Scholar 

  33. 33.

    Nesterova M, Yokozaki H, McDuffie E, Cho-Chung YS: Overexpression of RII beta regulatory subunit of protein kinase A in human colon carcinoma cell induces growth arrest and phenotypic changes that are abolished by site-directed mutation of RII beta. Eur J Biochem 1996, 235(3):486–494. 10.1111/j.1432-1033.1996.00486.x

    CAS  PubMed  Google Scholar 

  34. 34.

    Guillemin MC, Raffoux E, Vitoux D, Kogan S, Soilihi H, Lallemand-Breitenbach V, Zhu J, Janin A, Daniel MT, Gourmel B, et al.: In vivo activation of cAMP signaling induces growth arrest and differentiation in acute promyelocytic leukemia. J Exp Med 2002, 196(10):1373–1380. 10.1084/jem.20021129

    PubMed Central  CAS  PubMed  Google Scholar 

  35. 35.

    Lerner A, Kim DH, Lee R: The cAMP signaling pathway as a therapeutic target in lymphoid malignancies. Leuk Lymphoma 2000, 37(1–2):39–51.

    CAS  PubMed  Google Scholar 

  36. 36.

    Mantovani G, Bondioni S, Lania AG, Rodolfo M, Peverelli E, Polentarutti N, Veliz Rodriguez T, Ferrero S, Bosari S, Beck-Peccoz P, et al.: High expression of PKA regulatory subunit 1A protein is related to proliferation of human melanoma cells. Oncogene 2008, 27(13):1834–1843. 10.1038/sj.onc.1210831

    CAS  PubMed  Google Scholar 

  37. 37.

    Solberg R, Sandberg M, Natarajan V, Torjesen PA, Hansson V, Jahnsen T, Tasken K: The human gene for the regulatory subunit RI alpha of cyclic adenosine 3', 5'-monophosphate-dependent protein kinase: two distinct promoters provide differential regulation of alternately spliced messenger ribonucleic acids. Endocrinology 1997, 138(1):169–181. 10.1210/en.138.1.169

    CAS  PubMed  Google Scholar 

  38. 38.

    Dahle MK, Knutsen HK, Tasken KA, Pilz R, Tasken K: Cyclic AMP regulates expression of the RI alpha subunit of cAMP-dependent protein kinase through an alternatively spliced 5' UTR. Eur J Biochem 2001, 268(22):5920–5929. 10.1046/j.0014-2956.2001.02542.x

    CAS  PubMed  Google Scholar 

  39. 39.

    Dahle MK, Gronning LM, Cederberg A, Blomhoff HK, Miura N, Enerback S, Tasken KA, Tasken K: Mechanisms of FOXC2- and FOXD1-mediated regulation of the RI alpha subunit of cAMP-dependent protein kinase include release of transcriptional repression and activation by protein kinase B alpha and cAMP. J Biol Chem 2002, 277(25):22902–22908. 10.1074/jbc.M200131200

    CAS  PubMed  Google Scholar 

  40. 40.

    Clegg CH, Haugen HS, Boring LF: Promoter sequences in the RI beta subunit gene of cAMP-dependent protein kinase required for transgene expression in mouse brain. J Biol Chem 1996, 271(3):1638–1644. 10.1074/jbc.271.3.1638

    CAS  PubMed  Google Scholar 

  41. 41.

    Clegg CH, Koeiman NR, Jenkins NA, Gilbert DJ, Copeland NG, Neubauer MG: Structural features of the murine gene encoding the RI beta subunit of cAMP-dependent protein kinase. Mol Cell Neurosci 1994, 5(2):153–164. 10.1006/mcne.1994.1017

    CAS  PubMed  Google Scholar 

  42. 42.

    Dahle MK, Tasken K, Tasken KA: USF2 inhibits C/EBP-mediated transcriptional regulation of the RIIbeta subunit of cAMP-dependent protein kinase. BMC Mol Biol 2002, 3: 10. 10.1186/1471-2199-3-10

    PubMed Central  PubMed  Google Scholar 

  43. 43.

    Knutsen HK, Tasken K, Eskild W, Richards JS, Kurten RC, Torjesen PA, Jahnsen T, Hansson V, Guerin S, Tasken KA: Characterization of the 5'-flanking region of the gene for the cAMP-inducible protein kinase A subunit, RIIbeta, in Sertoli cells. Mol Cell Endocrinol 1997, 129(1):101–114. 10.1016/S0303-7207(97)04045-8

    CAS  PubMed  Google Scholar 

  44. 44.

    Singh IS, Luo Z, Kozlowski MT, Erlichman J: Association of USF and c-Myc with a helix-loop-helix-consensus motif in the core promoter of the murine type II beta regulatory subunit gene of cyclic adenosine 3', 5'-monophosphate-dependent protein kinase. Mol Endocrinol 1994, 8(9):1163–1174. 10.1210/me.8.9.1163

    CAS  PubMed  Google Scholar 

  45. 45.

    Barradeau S, Imaizumi-Scherrer T, Weiss MC, Faust DM: Muscle-regulated expression and determinants for neuromuscular junctional localization of the mouse RIalpha regulatory subunit of cAMP-dependent protein kinase. Proc Natl Acad Sci U S A 2001, 98(9):5037–5042. 10.1073/pnas.081393598

    PubMed Central  CAS  PubMed  Google Scholar 

  46. 46.

    Fernandez PC, Frank SR, Wang L, Schroeder M, Liu S, Greene J, Cocito A, Amati B: Genomic targets of the human c-Myc protein. Genes Dev 2003, 17(9):1115–1129. 10.1101/gad.1067003

    PubMed Central  CAS  PubMed  Google Scholar 

  47. 47.

    Li Z, Van Calcar S, Qu C, Cavenee WK, Zhang MQ, Ren B: A global transcriptional regulatory role for c-Myc in Burkitt's lymphoma cells. Proc Natl Acad Sci U S A 2003, 100(14):8164–8169. 10.1073/pnas.1332764100

    PubMed Central  CAS  PubMed  Google Scholar 

  48. 48.

    Streb JW, Miano JM: AKAP12alpha, an atypical serum response factor-dependent target gene. J Biol Chem 2005, 280(6):4125–4134. 10.1074/jbc.M412466200

    CAS  PubMed  Google Scholar 

  49. 49.

    Mao DY, Watson JD, Yan PS, Barsyte-Lovejoy D, Khosravi F, Wong WW, Farnham PJ, Huang TH, Penn LZ: Analysis of Myc bound loci identified by CpG island arrays shows that Max is essential for Myc-dependent repression. Curr Biol 2003, 13(10):882–886. 10.1016/S0960-9822(03)00297-5

    CAS  PubMed  Google Scholar 

  50. 50.

    D'Sa C, Tolbert LM, Conti M, Duman RS: Regulation of cAMP-specific phosphodiesterases type 4B and 4D (PDE4) splice variants by cAMP signaling in primary cortical neurons. J Neurochem 2002, 81(4):745–757. 10.1046/j.1471-4159.2002.00878.x

    PubMed  Google Scholar 

  51. 51.

    Lin CS, Chow S, Lau A, Tu R, Lue TF: Identification and regulation of human PDE5A gene promoter. Biochem Biophys Res Commun 2001, 280(3):684–692. 10.1006/bbrc.2000.4220

    CAS  PubMed  Google Scholar 

  52. 52.

    Lin CS, Chow S, Lau A, Tu R, Lue TF: Regulation of human PDE5A2 intronic promoter by cAMP and cGMP: identification of a critical Sp1-binding site. Biochem Biophys Res Commun 2001, 280(3):693–699. 10.1006/bbrc.2000.4221

    CAS  PubMed  Google Scholar 

  53. 53.

    Mohamed MK, Taylor RE, Feinstein DS, Huang X, Pittler SJ: Structure and upstream region characterization of the human gene encoding rod photoreceptor cGMP phosphodiesterase alpha-subunit. J Mol Neurosci 1998, 10(3):235–250. 10.1007/BF02761777

    CAS  PubMed  Google Scholar 

  54. 54.

    Lerner LE, Gribanova YE, Whitaker L, Knox BE, Farber DB: The rod cGMP-phosphodiesterase beta-subunit promoter is a specific target for Sp4 and is not activated by other Sp proteins or CRX. J Biol Chem 2002, 277(29):25877–25883. 10.1074/jbc.M201407200

    CAS  PubMed  Google Scholar 

  55. 55.

    Lerner LE, Gribanova YE, Ji M, Knox BE, Farber DB: Nrl and Sp nuclear proteins mediate transcription of rod-specific cGMP-phosphodiesterase beta-subunit gene: involvement of multiple response elements. J Biol Chem 2001, 276(37):34999–35007. 10.1074/jbc.M103301200

    CAS  PubMed  Google Scholar 

  56. 56.

    Torras-Llort M, Azorin F: Functional characterization of the human phosphodiesterase 7A1 promoter. Biochem J 2003, 373(Pt 3):835–843. 10.1042/BJ20021829

    PubMed Central  CAS  PubMed  Google Scholar 

  57. 57.

    Delfino FJ, Walker WH: NF-kappaB induces cAMP-response element-binding protein gene transcription in sertoli cells. J Biol Chem 1999, 274(50):35607–35613. 10.1074/jbc.274.50.35607

    CAS  PubMed  Google Scholar 

  58. 58.

    Shell SA, Fix C, Olejniczak D, Gram-Humphrey N, Walker WH: Regulation of cyclic adenosine 3',5'-monophosphate response element binding protein (CREB) expression by Sp1 in the mammalian testis. Biol Reprod 2002, 66(3):659–666. 10.1095/biolreprod66.3.659

    CAS  PubMed  Google Scholar 

  59. 59.

    Mayr B, Montminy M: Transcriptional regulation by the phosphorylation-dependent factor CREB. Nat Rev Mol Cell Biol 2001, 2(8):599–609. 10.1038/35085068

    CAS  PubMed  Google Scholar 

  60. 60.

    Impey S, McCorkle SR, Cha-Molstad H, Dwyer JM, Yochum GS, Boss JM, McWeeney S, Dunn JJ, Mandel G, Goodman RH: Defining the CREB regulon: a genome-wide analysis of transcription factor regulatory regions. Cell 2004, 119(7):1041–1054.

    CAS  PubMed  Google Scholar 

  61. 61.

    Zhang X, Odom DT, Koo SH, Conkright MD, Canettieri G, Best J, Chen H, Jenner R, Herbolsheimer E, Jacobsen E, et al.: Genome-wide analysis of cAMP-response element binding protein occupancy, phosphorylation, and target gene activation in human tissues. Proc Natl Acad Sci U S A 2005, 102(12):4459–4464. 10.1073/pnas.0501076102

    PubMed Central  CAS  PubMed  Google Scholar 

  62. 62.

    Chao JR, Ni YG, Bolanos CA, Rahman Z, DiLeone RJ, Nestler EJ: Characterization of the mouse adenylyl cyclase type VIII gene promoter: regulation by cAMP and CREB. Eur J Neurosci 2002, 16(7):1284–1294. 10.1046/j.1460-9568.2002.02186.x

    PubMed  Google Scholar 

  63. 63.

    Wang D, Deng C, Bugaj-Gaweda B, Kwan M, Gunwaldsen C, Leonard C, Xin X, Hu Y, Unterbeck A, De Vivo M: Cloning and characterization of novel PDE4D isoforms PDE4D6 and PDE4D7. Cell Signal 2003, 15(9):883–891. 10.1016/S0898-6568(03)00042-1

    PubMed  Google Scholar 

  64. 64.

    Walker WH, Habener JF: Role of transcription factors CREB and CREM in cAMP-regulated transcription during spermatogenesis. Trends Endocrinol Metab 1996, 7(4):133–138. 10.1016/1043-2760(96)00035-5

    CAS  PubMed  Google Scholar 

  65. 65.

    Meyer TE, Waeber G, Lin J, Beckmann W, Habener JF: The promoter of the gene encoding 3',5'-cyclic adenosine monophosphate (cAMP) response element binding protein contains cAMP response elements: evidence for positive autoregulation of gene transcription. Endocrinology 1993, 132(2):770–780. 10.1210/en.132.2.770

    CAS  PubMed  Google Scholar 

  66. 66.

    Coven E, Ni Y, Widnell KL, Chen J, Walker WH, Habener JF, Nestler EJ: Cell type-specific regulation of CREB gene expression: mutational analysis of CREB promoter activity. J Neurochem 1998, 71(5):1865–1874.

    CAS  PubMed  Google Scholar 

  67. 67.

    Black AR, Black JD, Azizkhan-Clifford J: Sp1 and kruppel-like factor family of transcription factors in cell growth regulation and cancer. J Cell Physiol 2001, 188(2):143–160. 10.1002/jcp.1111

    CAS  PubMed  Google Scholar 

  68. 68.

    Merchant JL, Du M, Todisco A: Sp1 phosphorylation by Erk 2 stimulates DNA binding. Biochem Biophys Res Commun 1999, 254(2):454–461. 10.1006/bbrc.1998.9964

    CAS  PubMed  Google Scholar 

  69. 69.

    Milde-Langosch K: The Fos family of transcription factors and their role in tumourigenesis. Eur J Cancer 2005, 41(16):2449–2461. 10.1016/j.ejca.2005.08.008

    CAS  PubMed  Google Scholar 

  70. 70.

    Milde-Langosch K, Roder H, Andritzky B, Aslan B, Hemminger G, Brinkmann A, Bamberger CM, Loning T, Bamberger AM: The role of the AP-1 transcription factors c-Fos, FosB, Fra-1 and Fra-2 in the invasion process of mammary carcinomas. Breast Cancer Res Treat 2004, 86(2):139–152. 10.1023/B:BREA.0000032982.49024.71

    CAS  PubMed  Google Scholar 

  71. 71.

    Maurer GD, Leupold JH, Schewe DM, Biller T, Kates RE, Hornung HM, Lau-Werner U, Post S, Allgayer H: Analysis of specific transcriptional regulators as early predictors of independent prognostic relevance in resected colorectal cancer. Clin Cancer Res 2007, 13(4):1123–1132. 10.1158/1078-0432.CCR-06-1668

    CAS  PubMed  Google Scholar 

  72. 72.

    Pore N, Liu S, Shu HK, Li B, Haas-Kogan D, Stokoe D, Milanini-Mongiat J, Pages G, O'Rourke DM, Bernhard E, et al.: Sp1 is involved in Akt-mediated induction of VEGF expression through an HIF-1-independent mechanism. Mol Biol Cell 2004, 15(11):4841–4853. 10.1091/mbc.E04-05-0374

    PubMed Central  CAS  PubMed  Google Scholar 

  73. 73.

    Milanini-Mongiat J, Pouyssegur J, Pages G: Identification of two Sp1 phosphorylation sites for p42/p44 mitogen-activated protein kinases: their implication in vascular endothelial growth factor gene transcription. J Biol Chem 2002, 277(23):20631–20639. 10.1074/jbc.M201753200

    CAS  PubMed  Google Scholar 

  74. 74.

    Foulds CE, Nelson ML, Blaszczak AG, Graves BJ: Ras/mitogen-activated protein kinase signaling activates Ets-1 and Ets-2 by CBP/p300 recruitment. Mol Cell Biol 2004, 24(24):10954–10964. 10.1128/MCB.24.24.10954-10964.2004

    PubMed Central  CAS  PubMed  Google Scholar 

  75. 75.

    Yang BS, Hauser CA, Henkel G, Colman MS, Van Beveren C, Stacey KJ, Hume DA, Maki RA, Ostrowski MC: Ras-mediated phosphorylation of a conserved threonine residue enhances the transactivation activities of c-Ets1 and c-Ets2. Mol Cell Biol 1996, 16(2):538–547.

    PubMed Central  CAS  PubMed  Google Scholar 

  76. 76.

    Lee KW, Kim MS, Kang NJ, Kim DH, Surh YJ, Lee HJ, Moon A: H-Ras selectively up-regulates MMP-9 and COX-2 through activation of ERK1/2 and NF-kappaB: an implication for invasive phenotype in rat liver epithelial cells. Int J Cancer 2006, 119(8):1767–1775. 10.1002/ijc.22056

    CAS  PubMed  Google Scholar 

  77. 77.

    Kim BY, Gaynor RB, Song K, Dritschilo A, Jung M: Constitutive activation of NF-kappaB in Ki-ras-transformed prostate epithelial cells. Oncogene 2002, 21(29):4490–4497. 10.1038/sj.onc.1205547

    CAS  PubMed  Google Scholar 

  78. 78.

    Finco TS, Baldwin AS Jr.: Kappa B site-dependent induction of gene expression by diverse inducers of nuclear factor kappa B requires Raf-1. J Biol Chem 1993, 268(24):17676–17679.

    CAS  PubMed  Google Scholar 

  79. 79.

    Wong WK, Ou XM, Chen K, Shih JC: Activation of human monoamine oxidase B gene expression by a protein kinase C MAPK signal transduction pathway involves c-Jun and Egr-1. J Biol Chem 2002, 277(25):22222–22230. 10.1074/jbc.M202844200

    PubMed Central  CAS  PubMed  Google Scholar 

  80. 80.

    Rim M, Qureshi SA, Gius D, Nho J, Sukhatme VP, Foster DA: Evidence that activation of the Egr-1 promoter by v-Raf involves serum response elements. Oncogene 1992, 7(10):2065–2068.

    CAS  PubMed  Google Scholar 

  81. 81.

    Hodge C, Liao J, Stofega M, Guan K, Carter-Su C, Schwartz J: Growth hormone stimulates phosphorylation and activation of elk-1 and expression of c-fos, egr-1, and junB through activation of extracellular signal-regulated kinases 1 and 2. J Biol Chem 1998, 273(47):31327–31336. 10.1074/jbc.273.47.31327

    CAS  PubMed  Google Scholar 

  82. 82.

    Guha M, O'Connell MA, Pawlinski R, Hollis A, McGovern P, Yan SF, Stern D, Mackman N: Lipopolysaccharide activation of the MEK-ERK1/2 pathway in human monocytic cells mediates tissue factor and tumor necrosis factor alpha expression by inducing Elk-1 phosphorylation and Egr-1 expression. Blood 2001, 98(5):1429–1439. 10.1182/blood.V98.5.1429

    CAS  PubMed  Google Scholar 

  83. 83.

    Sears R, Nuckolls F, Haura E, Taya Y, Tamai K, Nevins JR: Multiple Ras-dependent phosphorylation pathways regulate Myc protein stability. Genes Dev 2000, 14(19):2501–2514. 10.1101/gad.836800

    PubMed Central  CAS  PubMed  Google Scholar 

  84. 84.

    Sears RC: The life cycle of C-myc: from synthesis to degradation. Cell Cycle 2004, 3(9):1133–1137.

    CAS  PubMed  Google Scholar 

  85. 85.

    Mo X, Kowenz-Leutz E, Xu H, Leutz A: Ras induces mediator complex exchange on C/EBP beta. Mol Cell 2004, 13(2):241–250. 10.1016/S1097-2765(03)00521-5

    CAS  PubMed  Google Scholar 

  86. 86.

    Nebl G, Mermod N, Cato AC: Post-transcriptional down-regulation of expression of transcription factor NF1 by Ha-ras oncogene. J Biol Chem 1994, 269(10):7371–7378.

    CAS  PubMed  Google Scholar 

  87. 87.

    Cho-Chung YS: Role of cyclic AMP receptor proteins in growth, differentiation, and suppression of malignancy: new approaches to therapy. Cancer Res 1990, 50(22):7093–7100.

    CAS  PubMed  Google Scholar 

  88. 88.

    Cho-Chung YS, Pepe S, Clair T, Budillon A, Nesterova M: cAMP-dependent protein kinase: role in normal and malignant growth. Crit Rev Oncol Hematol 1995, 21(1–3):33–61. 10.1016/1040-8428(94)00166-9

    CAS  PubMed  Google Scholar 

  89. 89.

    Spina AM, Chiosi E, Naviglio S, Valente F, Marchese M, Furgi A, Metafora S, Illiano G: ras oncogene-induced transformation of a rat seminal vesicle epithelial cell line produces a marked increase of adenylate cyclase and protein kinase C activities. FEBS Lett 1993, 331(1–2):150–154. 10.1016/0014-5793(93)80315-L

    CAS  PubMed  Google Scholar 

  90. 90.

    Franks DJ, Whitfield JF, Durkin JP: A viral K-RAS protein increases the stimulability of adenylate cyclase by cholera toxin in NRK cells. Biochem Biophys Res Commun 1987, 147(2):596–601. 10.1016/0006-291X(87)90972-7

    CAS  PubMed  Google Scholar 

  91. 91.

    Franks DJ, Whitfield JF, Durkin JP: Viral p21 Ki-RAS protein: a potent intracellular mitogen that stimulates adenylate cyclase activity in early G1 phase of cultured rat cells. J Cell Biochem 1987, 33(2):87–94. 10.1002/jcb.240330203

    CAS  PubMed  Google Scholar 

  92. 92.

    Beckner SK, Hattori S, Shih TY: The ras oncogene product p21 is not a regulatory component of adenylate cyclase. Nature 1985, 317(6032):71–72. 10.1038/317071a0

    CAS  PubMed  Google Scholar 

  93. 93.

    Colletta G, Corda D, Schettini G, Cirafici AM, Kohn LD, Consiglio E: Adenylate cyclase activity of v-ras-k transformed rat epithelial thyroid cells. FEBS Lett 1988, 228(1):37–41. 10.1016/0014-5793(88)80579-9

    CAS  PubMed  Google Scholar 

  94. 94.

    Levitzki A, Rudick J, Pastan I, Vass WC, Lowy DR: Adenylate cyclase activity of NIH 3T3 cells morphologically transformed by ras genes. FEBS Lett 1986, 197(1–2):134–138. 10.1016/0014-5793(86)80313-1

    CAS  PubMed  Google Scholar 

  95. 95.

    Liu H, Palmer D, Jimmo SL, Tilley DG, Dunkerley HA, Pang SC, Maurice DH: Expression of phosphodiesterase 4D (PDE4D) is regulated by both the cyclic AMP-dependent protein kinase and mitogen-activated protein kinase signaling pathways. A potential mechanism allowing for the coordinated regulation of PDE4D activity and expression in cells. J Biol Chem 2000, 275(34):26615–26624. 10.1074/jbc.M001634200

    CAS  PubMed  Google Scholar 

  96. 96.

    Houslay MD, Baillie GS: The role of ERK2 docking and phosphorylation of PDE4 cAMP phosphodiesterase isoforms in mediating cross-talk between the cAMP and ERK signalling pathways. Biochem Soc Trans 2003, 31(Pt 6):1186–1190. 10.1042/BST0311186

    CAS  PubMed  Google Scholar 

  97. 97.

    Salas TR, Reddy SA, Clifford JL, Davis RJ, Kikuchi A, Lippman SM, Menter DG: Alleviating the suppression of glycogen synthase kinase-3beta by Akt leads to the phosphorylation of cAMP-response element-binding protein and its transactivation in intact cell nuclei. J Biol Chem 2003, 278(42):41338–41346. 10.1074/jbc.M302972200

    CAS  PubMed  Google Scholar 

  98. 98.

    Xing J, Ginty DD, Greenberg ME: Coupling of the RAS-MAPK pathway to gene activation by RSK2, a growth factor-regulated CREB kinase. Science 1996, 273(5277):959–963. 10.1126/science.273.5277.959

    CAS  PubMed  Google Scholar 

  99. 99.

    Piccoli C, Scacco S, Bellomo F, Signorile A, Iuso A, Boffoli D, Scrima R, Capitanio N, Papa S: cAMP controls oxygen metabolism in mammalian cells. FEBS Lett 2006, 580(18):4539–4543. 10.1016/j.febslet.2006.06.085

    CAS  PubMed  Google Scholar 

  100. 100.

    Raha S, Myint AT, Johnstone L, Robinson BH: Control of oxygen free radical formation from mitochondrial complex I: roles for protein kinase A and pyruvate dehydrogenase kinase. Free Radic Biol Med 2002, 32(5):421–430. 10.1016/S0891-5849(01)00816-4

    CAS  PubMed  Google Scholar 

  101. 101.

    Warburg O: On the origin of cancer cells. Science 1956, 123(3191):309–314. 10.1126/science.123.3191.309

    CAS  PubMed  Google Scholar 

  102. 102.

    Chiaradonna F, Sacco E, Manzoni R, Giorgio M, Vanoni M, Alberghina L: Ras-dependent carbon metabolism and transformation in mouse fibroblasts. Oncogene 2006, 25(39):5391–5404. 10.1038/sj.onc.1209528

    CAS  PubMed  Google Scholar 

  103. 103.

    Chiaradonna F, Gaglio D, Vanoni M, Alberghina L: Expression of transforming K-Ras oncogene affects mitochondrial function and morphology in mouse fibroblasts. Biochim Biophys Acta 2006, 1757(9–10):1338–1356. 10.1016/j.bbabio.2006.08.001

    CAS  PubMed  Google Scholar 

  104. 104.

    Alberghina L, Hohmann S, Westerhoff HV: Systems Biology: necessary developments and trends. In Systems Biology: Definitions and Perspectives. Edited by: Alberghina L, Westerhoff HV. Springer-Verlag Berlin Heidelberg; 2005:389–402. [Hohmann S (Series Editor):Topics in Current Genetics, vol 13.] [Hohmann S (Series Editor):Topics in Current Genetics, vol 13.]

    Google Scholar 

  105. 105.

    Barrett T, Suzek TO, Troup DB, Wilhite SE, Ngau WC, Ledoux P, Rudnev D, Lash AE, Fujibuchi W, Edgar R: NCBI GEO: mining millions of expression profiles--database and tools. Nucleic Acids Res 2005, 33: D562–566. Database issue Database issue 10.1093/nar/gki022

    PubMed Central  CAS  PubMed  Google Scholar 

  106. 106.

    Su AI, Cooke MP, Ching KA, Hakak Y, Walker JR, Wiltshire T, Orth AP, Vega RG, Sapinoso LM, Moqrich A, et al.: Large-scale analysis of the human and mouse transcriptomes. Proc Natl Acad Sci U S A 2002, 99(7):4465–4470. 10.1073/pnas.012025199

    PubMed Central  CAS  PubMed  Google Scholar 

  107. 107.

    Wu F, Dassopoulos T, Cope L, Maitra A, Brant SR, Harris ML, Bayless TM, Parmigiani G, Chakravarti S: Genome-wide gene expression differences in Crohn's disease and ulcerative colitis from endoscopic pinch biopsies: insights into distinctive pathogenesis. Inflamm Bowel Dis 2007, 13(7):807–821. 10.1002/ibd.20110

    PubMed  Google Scholar 

  108. 108.

    Barnes MG, Aronow BJ, Luyrink LK, Moroldo MB, Pavlidis P, Passo MH, Grom AA, Hirsch R, Giannini EH, Colbert RA, et al.: Gene expression in juvenile arthritis and spondyloarthropathy: pro-angiogenic ELR+ chemokine genes relate to course of arthritis. Rheumatology (Oxford) 2004, 43(8):973–979. 10.1093/rheumatology/keh224

    CAS  Google Scholar 

  109. 109.

    Irizarry RA, Hobbs B, Collin F, Beazer-Barclay YD, Antonellis KJ, Scherf U, Speed TP: Exploration, normalization, and summaries of high density oligonucleotide array probe level data. Biostatistics (Oxford, England) 2003, 4(2):249–264.

    Google Scholar 

  110. 110.

    Cope LM, Irizarry RA, Jaffee HA, Wu Z, Speed TP: A benchmark for Affymetrix GeneChip expression measures. Bioinformatics 2004, 20(3):323–331. 10.1093/bioinformatics/btg410

    CAS  PubMed  Google Scholar 

  111. 111.

    Allison DB, Cui X, Page GP, Sabripour M: Microarray data analysis: from disarray to consolidation and consensus. Nature reviews 2006, 7(1):55–65. 10.1038/nrg1749

    CAS  PubMed  Google Scholar 

  112. 112.

    Oudes AJ, Campbell DS, Sorensen CM, Walashek LS, True LD, Liu AY: Transcriptomes of human prostate cells. BMC Genomics 2006, 7: 92. 10.1186/1471-2164-7-92

    PubMed Central  PubMed  Google Scholar 

  113. 113.

    Casorelli I, Tenedini E, Tagliafico E, Blasi MF, Giuliani A, Crescenzi M, Pelosi E, Testa U, Peschle C, Mele L, et al.: Identification of a molecular signature for leukemic promyelocytes and their normal counterparts: Focus on DNA repair genes. Leukemia 2006, 20(11):1978–1988. 10.1038/sj.leu.2404376

    CAS  PubMed  Google Scholar 

  114. 114.

    Irizarry RA, Bolstad BM, Collin F, Cope LM, Hobbs B, Speed TP: Summaries of Affymetrix GeneChip probe level data. Nucleic Acids Res 2003, 31(4):e15. 10.1093/nar/gng015

    PubMed Central  PubMed  Google Scholar 

  115. 115.

    Fisher RA: Statistical Methods for Research Workers. Edinburgh: Oliver and Boyd; 1925.

    Google Scholar 

  116. 116.

    Fisher LD vBG: Biostatistics: a methodology for the health sciences. New York: John Wiley & Sons Inc.; 1993.

    Google Scholar 

  117. 117.

    Kerr MK, Martin M, Churchill GA: Analysis of variance for gene expression microarray data. J Comput Biol 2000, 7(6):819–837. 10.1089/10665270050514954

    CAS  PubMed  Google Scholar 

  118. 118.

    Coombes KR, Highsmith WE, Krogmann TA, Baggerly KA, Stivers DN, Abruzzo LV: Identifying and quantifying sources of variation in microarray data using high-density cDNA membrane arrays. J Comput Biol 2002, 9(4):655–669. 10.1089/106652702760277372

    CAS  PubMed  Google Scholar 

  119. 119.

    Pritchard CC, Hsu L, Delrow J, Nelson PS: Project normal: defining normal variance in mouse gene expression. Proc Natl Acad Sci U S A 2001, 98(23):13266–13271. 10.1073/pnas.221465998

    PubMed Central  CAS  PubMed  Google Scholar 

  120. 120.

    Pavlidis P, Noble WS: Analysis of strain and regional variation in gene expression in mouse brain. Genome Biol 2001, 2(10):RESEARCH0042. 10.1186/gb-2001-2-10-research0042

    PubMed Central  CAS  PubMed  Google Scholar 

  121. 121.

    Johnson SC: Hierarchical clustering schemes. Psychometrika 1967, 32(3):241–254. 10.1007/BF02289588

    CAS  PubMed  Google Scholar 

  122. 122.

    Eisen MB, Spellman PT, Brown PO, Botstein D: Cluster analysis and display of genome-wide expression patterns. Proc Natl Acad Sci U S A 1998, 95(25):14863–14868. 10.1073/pnas.95.25.14863

    PubMed Central  CAS  PubMed  Google Scholar 

  123. 123.

    Forbes SA, Bhamra G, Bamford S, Dawson E, Kok C, Clements J, Menzies A, Teague JW, Futreal PA, Stratton MR: The Catalogue of Somatic Mutations in Cancer (COSMIC). In Current protocols in human genetics /editorial board Edited by: Jonathan L Haines et al. 2008. Chapter 10: Unit 10 11. Chapter 10: Unit 10 11.

    Google Scholar 

  124. 124.

    Joliffe IT, Morgan BJ: Principal component analysis and exploratory factor analysis. Statistical methods in medical research 1992, 1(1):69–95. 10.1177/096228029200100105

    CAS  PubMed  Google Scholar 

  125. 125.

    Raychaudhuri S, Stuart JM, Altman RB: Principal components analysis to summarize microarray experiments: application to sporulation time series. Pacific Symposium on Biocomputing 2000, 455–466.

    Google Scholar 

  126. 126.

    Frech K, Danescu-Mayer J, Werner T: A novel method to develop highly specific models for regulatory units detects a new LTR in GenBank which contains a functional promoter. J Mol Biol 1997, 270(5):674–687. 10.1006/jmbi.1997.1140

    CAS  PubMed  Google Scholar 

  127. 127.

    Cartharius K, Frech K, Grote K, Klocke B, Haltmeier M, Klingenhoff A, Frisch M, Bayerlein M, Werner T: MatInspector and beyond: promoter analysis based on transcription factor binding sites. Bioinformatics 2005, 21(13):2933–2942. 10.1093/bioinformatics/bti473

    CAS  PubMed  Google Scholar 

  128. 128.

    Zhao F, Xuan Z, Liu L, Zhang MQ: TRED: a Transcriptional Regulatory Element Database and a platform for in silico gene regulation studies. Nucleic acids research 2005, 33: D103–107. (Database Issue) (Database Issue) 10.1093/nar/gki004

    PubMed Central  CAS  PubMed  Google Scholar 

Download references


This work has been supported by grants from ITALBIONET to L.A. and FAR to M.V. and F.C. The authors wish to thank Neil Campbell for language checking.

This article has been published as part of BMC Bioinformatics Volume 10 Supplement 12, 2009: Bioinformatics Methods for Biomedical Complex System Applications. The full contents of the supplement are available online at

Author information



Corresponding author

Correspondence to Ferdinando Chiaradonna.

Additional information

Competing interests

The authors declare that they have no competing interests.

Authors' contributions

CB, MV and FC conceived the experiments and wrote the manuscript. CB and FC carried out the data analysis. LA, MV and FC participated in data analysis. All authors read, edit and approved the final manuscript.

Rights and permissions

Open Access This article is published under license to BioMed Central Ltd. This is an Open Access article is distributed under the terms of the Creative Commons Attribution License ( ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Reprints and Permissions

About this article

Cite this article

Balestrieri, C., Alberghina, L., Vanoni, M. et al. Data recovery and integration from public databases uncovers transformation-specific transcriptional downregulation of cAMP-PKA pathway-encoding genes. BMC Bioinformatics 10, S1 (2009).

Download citation

  • Published:

  • DOI:


  • Mutation Cell Line
  • Gene Expression Omnibus Database
  • Robust Multichip Average
  • Complete Linkage Cluster
  • Inducible cAMP Early Repressor