Coregulation of transcription factors and microRNAs in human transcriptional regulatory network
© Chen et al; licensee BioMed Central Ltd. 2011
Published: 15 February 2011
Skip to main content
© Chen et al; licensee BioMed Central Ltd. 2011
Published: 15 February 2011
MicroRNAs (miRNAs) are small RNA molecules that regulate gene expression at the post-transcriptional level. Recent studies have suggested that miRNAs and transcription factors are primary metazoan gene regulators; however, the crosstalk between them still remains unclear.
We proposed a novel model utilizing functional annotation information to identify significant coregulation between transcriptional and post-transcriptional layers. Based on this model, function-enriched coregulation relationships were discovered and combined into different kinds of functional coregulation networks.
We found that miRNAs may engage in a wider diversity of biological processes by coordinating with transcription factors, and this kind of cross-layer coregulation may have higher specificity than intra-layer coregulation. In addition, the coregulation networks reveal several types of network motifs, including feed-forward loops and massive upstream crosstalk. Finally, the expression patterns of these coregulation pairs in normal and tumour tissues were analyzed. Different coregulation types show unique expression correlation trends. More importantly, the disruption of coregulation may be associated with cancers.
Our findings elucidate the combinatorial and cooperative properties of transcription factors and miRNAs regulation, and we proposes that the coordinated regulation may play an important role in many biological processes.
Transcriptional regulatory networks describe the interactions between transcriptional regulatory proteins and their target genes [1–3]. These regulators, known as transcription factors (TFs), are proteins that bind to specific DNA sequences and thereby control the transcription of genetic information encoded in DNA sequences. The interactions between TFs and target genes regulate the transcriptional activities of genome and thus determine the global gene expression program of a living cell.
In the last decade, microRNAs (miRNAs) have emerged as another prominent class of gene regulators. miRNAs are endogenous small RNA molecules that are abundant in animals, plants, and some viruses. They can reduce stability and/or translation activity of fully or partially sequence-complementary messenger RNAs (mRNAs), thus regulating gene expression at the post-transcriptional level. It has been found that miRNAs may control many biological processes in development, differentiation, growth, and even cancer development and progression [4–6].
Recent studies have suggested that miRNAs and TFs are primary metazoan gene regulators, and they seem to function in a similar regulatory logic, such as pleiotropy, combinatorial and cooperative activity, regulation, and even network motifs [7, 8]. However, how miRNAs interplay and coordinate with TFs in the regulatory network still remains unclear. Since combinatorial interactions between miRNAs and TFs are complicated and thus hard to be validated by high-throughput experiments, computational modelling may provide a better clue to understand such complex relationships.
Currently, to uncover the coregulation interactions between miRNAs and TFs, researchers have to overcome two challenges. One is the incomplete knowledge of regulatory targets. Because the available experimentally verified targets of miRNAs and TFs are far from complete, the regulatory target datasets for global analysis were mainly from computational prediction. The other challenge is about how to integrate transcriptional and post-transcriptional layers to discover highly confident coregulation relationships. To solve these problems, previous studies have developed a bottom-up strategy; that is, they inferred the coordination between two upstream regulators from their downstream shared targets [9, 10]. These inferences were basically based on different probabilistic models and statistical tests to measure the significance of shared targets between regulators. Indeed, the methods successfully eliminated those insignificant coregulation interactions occurred merely by chance; however, the biological meanings were ignored in the integration of transcriptional and post-transcriptional regulation interactions.
Here we proposed a novel framework utilizing functional annotation information to identify significant coregulation between transcriptional and post-transcriptional layers. Based on this model, function-enriched coregulation pairs were discovered, and the regulators were subsequently linked by enriched functions. With these functional linkages, we further constructed functional coregulation networks between regulators and investigated their characteristics. Next, we searched for the network motifs consisting of those function-enriched coregulation pairs, and found that an abundance of pairs were closely connected in their upstream. Finally, the expression patterns of function-enriched coregulation pairs were analyzed. Different coregulation types showed distinct expression correlation trends. More importantly, we found that the disruption of coregulation may be closely related to cancers.
The transcriptional regulation relationships between human transcription factors and their target genes were collected from TRED (Transcriptional Regulatory Element Database) . The database provides genome-wide promoter annotation and transcription factor binding information from computational prediction and experimental evidence.
To collect all human TF-target regulation relationships in TRED, we firstly queried the list of all human TFs in the database. A total of 178 human TFs were obtained by this step. Next, we searched TF target genes for each TF using default parameters (promoter quality from "known, curated" to "with RNA" and "all" binding quality). The results showed that only 133 TFs were found to have at least one target gene by these criteria, and the final number of unique TF-target relationships was 6,764, which were used to construct the human transcriptional regulatory network for our analysis.
Since the available experimentally verified human miRNA targets are far from complete and thus not enough for global analysis, we used predicted miRNA targets from the TargetScan database (release 4.2) to perform the analysis . In addition, different mature miRNAs may have identical seed regions and thereby target the same binding sites. To eliminate those coregulation interactions among the miRNAs with identical seed regions, we grouped mature miRNAs into families based on the miRNA family information from TargetScan. A total of 162 miRNA families and 7,521 target genes with 44,782 interactions were collected.
It is still difficult to predict the promoter region of miRNA genes in the genome. But it has been known that embedded miRNAs frequently coexpress with their host genes [13, 14]. Therefore, we extracted miRNA host gene information from miRBase  and integrated the embedded miRNAs biogenesis information into the established transcriptional regulation network. A total of 310 premature miRNAs were found embedded in 259 host genes. Most of them (93%) were resided in introns.
Secondly, for each group of shared targets, the distribution of Gene Ontology (GO) annotations  at the second level in the biological process namespace was calculated. We chose the second level ontology because most of the genes were generally well-annotated at this level and these annotations provided a good balance between the sensitivity and the specificity in the following functional enrichment test. The distributions were considered as the functional profiles or fingerprints for these coregulation pairs.
Next, we utilized a randomization method to perform a permutation test for functional enrichment. For each group of shared targets, we randomly selected a null group of the same size from whole human genome as background. After 10,000 iterations, the log-likelihood score under multivariate hypergeometric distribution was measured to quantify the significance of functional enrichment. The correction for multiple comparisons was made under 0.05 false discovery rate (FDR) . The final results of significant coregulation pairs were listed in additional file 1.
We searched for network motifs preferentially occurred in function-enriched coregulation pairs rather than in random pairs by a resampling process. The predicted TF-targeting interactions for miRNA genes were collected from miRBase  and from literature . In addition, we assumed that those embedded miRNA genes have same transcription units as their host genes and would be regulated together.
A total of 10,000 background sets of regulator pairs that have the same size as the set of function-enriched pairs were randomly selected from the global network. For each type of network patterns (sub-graphs), the observed frequency from the function-enriched coregulation pairs was first calculated and compared to the background distribution for assessment of significance. Only those network patterns with occurrence probabilities less than 0.001 were considered significant motifs (see additional file 2 for these significant motifs).
The miRNA and mRNA expression profiles were adopted from a previous study . A total of 217 miRNAs and ~16,000 mRNAs across 8 human tissues (colon, pancreas, kidney, bladder, prostate, uterus, lung, and breast) were measured using miRNA bead-arrays and mRNA microarrays. Both normal and tumor samples were profiled for each tissue. For each type of coregulation, we first generated background distribution by calculating the Pearson's correlation coefficients (PCCs) of expression profiles between the two paired regulators in all possible pairs (i.e., those pairs shared no targets and/or those pairs not identified as function-enriched). After that, the distribution of enriched coregulation pairs was calculated and shown against the background.
After the integration of miRNA regulation into human transcriptional regulation network, we adopted a novel strategy utilizing functional information to identify function-enriched coregulation pairs, and establish function linkages for each pair. Traditional analysis of functional enrichment was aimed at elucidating the regulatory roles of each individual regulator only, inevitably leaving some significant coregulation hidden in the traditional views. Instead, based on our model, different regulation types involving single regulators or combinations of regulators can all be studied and compared.
In contrast, miRNA-involving regulation shows divergent distributions in Figure 3B. The top ranked biological processes of individual miRNA regulation were biological regulation, cellular process, and developmental process, which were the previously known miRNA-involving processes [4–6]. On the other hand, biological adhesion was relatively high in miRNA-miRNA coregulation, suggesting that miRNAs may regulate this process majorly in a coordination manner.
Moreover, many biological processes enriched in TF-miRNA coregulation were relatively poor in the regulation involving miRNAs only. In other words, those processes may be the typical processes needed to be coordinately regulated by TFs and miRNAs, and the coordination may provide a mechanism to switch expression programs. More importantly, it suggested that, by coordinating with TFs, miRNAs may engage in a wider diversity of biological processes, and these undiscovered processes were failed to be identified by traditional analysis of functional enrichment for a single regulator.
In the previous section, different regulators were connected by identified functional linkages, which represented that the two paired regulators may function in coordination with each other in a specific biological process. We further built up functional coregulation networks from these linkages and found interesting properties in the networks.
Many studies have been devoted to understanding network structures in gene regulatory networks, and have found that most networks seem to be largely composed of occurring patterns, called network motifs. The functions associated with common network motifs, such as auto-regulation and feed-forward loops (FFLs), were discovered and revealed by several researches both theoretically and experimentally [1, 9, 10, 19–22].
Expression data across human normal/tumor tissues have recently become available. A previous study measured miRNA and mRNA expression profiles across 8 tissues (colon, pancreas, kidney, bladder, prostate, uterus, lung, and breast) and each tissue contained both normal and tumor samples . By analyzing the expression profiles, we investigated the correlations between the expression profiles of each coregulation pair in both normal and cancer samples.
On the contrary, all coregulation types turn into an identical trend in tumor tissues. All of them show similar zero-centered distributions resembled to their backgrounds. This trend suggests that the function-enriched coregulation pairs lost their correlation in tumor tissues, implying the disruption of coregulation may be closely associated to cancers. Together these results may support the functionality of identified coregulation pairs.
We proposed a novel strategy aimed at identifying potential coordinated regulation by utilizing functional annotation information and discovered many biological processes that emerged only in coregulation. Compared to traditional function enrichment analysis, our strategy considered whole function profiles rather than single annotations. In addition, it also solves the restriction of traditional methods that only focus on single regulator. For example, a miRNA can potentially regulate an abundance of target genes. To find enriched functions of the miRNA, all its potential targets will be tested for any enriched function. However, since the target size of a miRNA may be huge, some meaningful biological processes involving only a small subset of genes will be hidden. In fact, these hidden processes may be significantly impacted by miRNAs in coordination with other regulators, namely, other miRNAs or TFs. After all, a biological process may be regulated not only at the transcriptional layer, but also at the posttranscriptional layer [7, 8, 23, 24].
Interestingly, our results show that pigmentation and reproductive process are two typical biological processes specifically emerging in TF-miRNA coregulation. It is suggested that miRNAs may provide genetic switch mechanisms to essentially inactivate the target genes, thus leading to detectable phenotypic consequences. In model organisms, there have been many studies investigating the switch-like role of miRNAs in pigmentation. For example, miRNAs can regulate the eye pigmentation genes in Drosophila . The influence of miRNAs on pigmentation in zebrafish was also reported . Another study found that miR-434-5p may mediate skin whitening and lightening in mouse . And in melanoma cell lines, it is shown that miR-137 may target a pigmentation regulator .
The analysis of functional coregulation networks provided other clues. We found that a TF may regulate in coordination with different miRNAs in different biological processes, and vice versa. It suggested that the cross-layer coregulation may have higher specificity than intra-layer coregulation.
We also performed network motif analysis to see if any recurring pattern exists in coregulation network structure. Different types of feed-forward loops were found in TF-TF and TF-miRNA coregulation, and these results were consistent with several previous studies on transcriptional network [1, 9, 10, 19–22]. Among these FFLs, a special kind of miRNA-mediated FFLs emerged in TF-miRNA coregulation. In this kind of FFLs, a miRNA may simultaneously repress a TF and its target genes, thus contributing to a switch-like control of expression programs. More importantly, we go further this time to investigate the upstream structure of coregulation pairs and found closely interaction in their upstream. It implies that the network structures of coregulation may have extensive crosstalk in the higher levels.
Finally, the expression analysis of coregulation discovered distinct trends in different coregulation types; namely, TF-TF showed no correlation, whereas miRNA-miRNA had a preference of positive correlation, and TF-miRNA appeared both positive and negative correlation. A previous study investigated only TF-miRNA correlation and found the same tendencies . The authors rationalized this trend by pointing out the distinct function roles that TFs and miRNAs may play. We further supported this idea by showing the results of TF-TF and miRNA-miRNA coregulation, which were also consistent with the same interpretation. In addition, TF activities are under control at protein level; that is, TFs may be activated or deactivated by a number of mechanisms including phosphorylation, ligand binding, and interaction with other regulatory proteins. Therefore, it is not surprising that co-function TFs may show no correlation in mRNA expression level. Notably, a large proportion of TF-miRNA pairs showed negative correlation in expression profiles, which could be explained by the structure of the miRNA-mediated-FFLs discussed before, supporting the idea that many miRNAs in TF-miRNA coregulation contributed to switch-like regulation.
More significantly, by comparing the expression correlations between normal and tumor tissues, we found a common trend in function-enriched coregulation pairs; that is, the function-enriched pairs lost their correlation in tumor tissues. It suggested that the disruption of coregulation may lead to abnormal expression programs and may be directly associated to cancers.
Our findings shed light on the coregulation of miRNAs in transcriptional regulatory network. Future experimental works will provide more complete knowledge in transcriptional network and miRNA regulation, thus allowing the elucidation of more precise co-regulatory mechanisms.
The authors wish to thank Wen-Hsiung Li and Chun-Chieh Arthur Shih for their helpful discussions. This work was supported by National Science Council of Taiwan, National Health Research Institutes (NHRI-EX98-9819PI), and NTU Frontier and Innovative Research Projects.
This article has been published as part of BMC Bioinformatics Volume 12 Supplement 1, 2011: Selected articles from the Ninth Asia Pacific Bioinformatics Conference (APBC 2011). The full contents of the supplement are available online at http://www.biomedcentral.com/1471-2105/12?issue=S1.
This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.