Skip to main content

Advertisement

We’d like to understand how you use our websites in order to improve them. Register your interest.

OncomiRdbB: a comprehensive database of microRNAs and their targets in breast cancer

Abstract

Background

Given the estimate that 30% of our genes are controlled by microRNAs, it is essential that we understand the precise relationship between microRNAs and their targets. OncomiRs are microRNAs (miRNAs) that have been frequently shown to be deregulated in cancer. However, although several oncomiRs have been identified and characterized, there is as yet no comprehensive compilation of this data which has rendered it underutilized by cancer biologists. There is therefore an unmet need in generating bioinformatic platforms to speed the identification of novel therapeutic targets.

Description

We describe here OncomiRdbB, a comprehensive database of oncomiRs mined from different existing databases for mouse and humans along with novel oncomiRs that we have validated in human breast cancer samples. The database also lists their respective predicted targets, identified using miRanda, along with their IDs, sequences, chromosome location and detailed description. This database facilitates querying by search strings including microRNA name, sequence, accession number, target genes and organisms. The microRNA networks and their hubs with respective targets at 3'UTR, 5'UTR and exons of different pathway genes were also deciphered using the 'R' algorithm.

Conclusion

OncomiRdbB is a comprehensive and integrated database of oncomiRs and their targets in breast cancer with multiple query options which will help enhance both understanding of the biology of breast cancer and the development of new and innovative microRNA based diagnostic tools and targets of therapeutic significance. OncomiRdbB is freely available for download through the URL link http://tdb.ccmb.res.in/OncomiRdbB/index.htm.

Background

Although microRNAs were discovered two decades ago, their importance as an abundant class of regulatory non-coding RNAs came to light only after their rediscovery by Reinhart et al., in 2000 [1]. To date, thousands of miRNAs have been identified across a range of human diseases and have been shown to be deregulated in the majority of cancers [2]. Identifying the signatures of oncomiRs in human cancer samples is extremely difficult due to the paucity of sample material. Also, cloning of these microRNAs has proven difficult due to their extremely small size and complexity of spatial and temporal expression patterns. However, the precursor transcripts of ~70-80 nt with a characteristic stem-loop fold back structure from which the miRNAs are derived, are conserved and this typical structure of precursor-microRNAs enables its identification through computational tools.

Finding the targets of a given miRNA is as important as identifying the miRNA itself to know the gene(s) it controls hence infer possible biological function through either negative or positive regulation of the target gene(s) [3]. Computational prediction of target genes for miRNAs in animals is a challenging task for both experimental and computational groups, due to the complexity in miRNA target recognition [4]. Eventhough perfect complementarity in miRNA-mRNA sequences is not required for target recognition and interaction, a small region in the 5′end (termed the seed region “position 2-8”) of miRNA is apparently needed for a near perfect base-pairing with its 3′UTR to inhibit the translation of mRNA [5].

Several databases of miRNAs, based on sequence annotation and miRNA genomics such as miRBase, [6] miRNAMap2.0 [7], miRGen [8], miRGator v2.0 [9], miRecords [10] have been developed. Amongst these databases, miRBase serves as a central database of mature miRNA sequences. MicroCosm, PhenomiR 2.0 [11] and mir2Disease [12] provide data pertinent to various diseases, whilst TargetScan [13], PicTar [14] and TargetMiner [15] are databases based on algorithms designed for microRNA target prediction for the complementary base-pairing in the seed region of the targets. TarBase [16] is a resource of experimentally validated microRNA targets, while RNAhybrid [17] is a resource based on calculation and hybridization between miRNA and mRNA. Recently, S-MED database has been created [18], which gives details of oncomiR expression profiles in sarcomas. However, there is currently no comprehensive oncomiR database for breast cancer that will provide an overview of microRNAs, their targets and the pathways that are regulated by them during breast cancer development. Our aim was to develop a database compiling all oncomiRs and their target genes involved in breast cancer and validate them using human breast cancer samples. The known oncomiRs responsible for breast cancer in two mammalian genomes, human and mouse were obtained from different databases including MicroCosm, PhenomiR and mir2Disease. The target genes were taken from the signaling pathways that are known to be affected during breast cancer. By using the miRanda algorithm [19], target genes were predicted for both human and mouse. Prediction programs including PicTar and TargetScan were used as additional sources. In nutshell, OncomiRdbB comprises of 782 human microRNAs and 246 murine microRNAs, along with 711 and 490 targets for both genomes respectively. It also provides the precise target location within the 3′UTR, 5′UTR or exon regions of the target genes. A proportion of the microRNAs have also been validated in breast cancer tissues using Taqman Low Density Arrays (TLDA). Amongst these, 34 microRNAs have not previously been reported to be associated with human breast cancer in any database. Hence, this is the first report of these novel microRNAs associated with breast cancer. Finally, using the ‘R’ algorithm and ‘GeneGo MetaCore’ (http://www.genego.com) we have characterized some of these microRNAs and their target interactions, generating examples of delineated networks and hubs of microRNA-target interaction. We envisage these network analyses will further support the identification of oncogenic targets of therapeutic value.

Construction

Mining of microRNAs and finding their targets

MicroRNAs of both human and mouse genomes were retrieved from different databases including miRBase, PhenomiR 2.0 and miR2Disease. A Perl subroutine was written to automate the retrieval and filtering of miRBase sequences. In order to find out the targets of these microRNAs, genes of various key signaling pathways including Wnt, JAKSTAT, Notch, Apoptosis, VEGF and MAPK were downloaded from KEGG (Kyoto Encyclopedia of Genes and Genomes) miRBase (MicroCosm), Cancer Gene Expression Data Base (CGED) [20]. The 3′UTR, 5′UTR and exon sequences of these genes were then extracted from the Ensembl database in FASTA format using the BioMart tool. miRanda was used at 3 different energy levels (EL-15,-20 and −25) to predict putative targets for the miRNAs involved in breast cancer at the 3′UTR, 5′UTR and exonic regions of the genes of the different pathways. The parameters used for the search were at the cutoff score of 120.

The miRanda software was initially designed to predict miRNA target genes in Drosophila. It is similar to the alignment proposed by Smith and Waterman [21], where scores were based on sequence complementarity and not on sequence identity. These targets were further confirmed using the TargetScan and PicTar algorithms.

Experimental validation using Taqman Low Density Arrays

Taqman Low Density Array (TLDA) was performed with 50 individual human samples followed by confirmation with LNA arrays. Reverse Transcription reaction was carried out using TaqMan MicroRNA Reverse Transcription Kit (Applied Biosystems, USA) followed by PCR Reaction. Real-time PCR was performed using Applied Biosystems 7900- TLDA Real-Time PCR System. Each TaqMan Assay was run in quadruplicate. All the samples displayed good RIN value, linearity (R2 > 0.96), good abundance (average CT range 22–28) and NTC CT >38. MicroRNA profiling was done using TaqMan MicroRNA Arrays, which contains 667 human microRNAs covering Sanger miRBase version10. A pre amplification step of cDNA with preamp megaplex pool primers was performed to enhance the ability to detect miRNAs with low expression. The TaqMan human microRNA arrays consisted of 2 plates: pool A and pool B. RNU 46 and RNU 48 were used as endogenous controls and are included for data normalization. One TaqMan microRNA Assay, not related to human, was also included as a negative control. The set enabled accurate quantification of 667 human microRNAs. Reverse transcription followed by PCR reactions were carried out in ABI 7900 HT. The results were analysed using spotfire (statminer) software and the data was normalized using RNU 46 & RNU 48 as endogenous controls. Fold changes were represented as log 10 2 - Δ Δ CT.

Pathway analysis of miRNA-Target

Functional enrichment analysis of microRNAs and their target genes was performed to identify unique, similar and common sets of genes/proteins. Enrichment analysis of miRNAs and their targets in oncogenic signaling pathways was performed using the GeneGo MetaCore software. This software is a web-based program and performs functional analysis of experimental data or computationally compiled datasets on ontologies such as GeneGo process networks, GeneGo diseases, disease biomarkers, gene ontology processes (GO) metabolic processes, various signaling pathways, canonical pathways, protein-DNA interactions, protein-protein interactions etc. This enrichment analysis creates a degree of relevance of the datasets to the GeneGo MetaCore suite, defined by p-values where lower p-values were assigned higher priorities. The statistical enrichment analysis of miRNAs and their targets were performed based on the p-values generated. We have used ClustalX to identify sequence similarity with a conserved biological function of miRNAs in breast cancer.

‘R’ algorithm

‘R’ is an open source statistical language for statistical analysis and graphics. It provides a wide variety of statistical and graphical techniques (linear and nonlinear modeling, statistical tests, time series analysis, classification, matrix manipulation and clustering). In this study R (v2.14) packages (ggplot2) [22] were used for building the network of miRNA and their target genes. To build-up the miRNA-target interaction network we constructed a matrix using the binary concept 0 & 1. As an example, hits between microRNAs and specific target genes were assigned a binary value of 1, while 0 implies absence of recognition. This matrix was uploaded in ‘R’ which generated a graphical miRNA-target interaction network. The ‘R’ program was used to generate such types of network where a single miRNA was hitting several targets or where several miRNAs hit single target in each oncogenic signaling pathway.

Contents

miRNAs responsible for breast cancer

The mined miRNAs were classified as breast cancer miRNAs based on their complementarity to the target genes of different oncogenic pathways (MicroCosm) deregulated in breast cancer. This was further confirmed using miRanda, TargetScan and PicTar (Figure 1). A comparison between OncomiRdbB and existing databases demonstrated the utility of integrating various databases into one comprehensive database (Figure 2). A phylogenetic functional relationship between microRNAs of human and mouse was established by aligning the sequences using ClustalX. A conserved biological function of breast cancer microRNAs between mice and human genomes was observed after clustering both data sets. In the cladogram, branches from the same node represent descendents of a similar ancestor or cluster of the same family indicating their origin from a common ancestor. For example, hsa-miR-145, miR-151-3p and miR-30 families align with their mmu-miR counterparts indicating a conserved biological function in breast cancer development in both the genomes (Additional file 1: Figure S1). We retrieved a total of 782 human and 246 mouse microRNAs and their respective sequences associated with breast cancer from existing miRNA databases including miRBase, miR2Disease and PhenomiR. We have validated these miRNAs with two different platforms, Taqman low density arrays consisting of 667 human microRNAs (version 10) and LNA arrays using human breast cancer samples of grade 2 and grade 3 each consisting of Stages I to III. Approximately 400 significant and valid miRNAs lighted up in one or the other grades/stages classifying them as breast cancer microRNAs.

Figure 1
figure1

Schematic illustration of construction of OncomirdbB using different bioinformatics approaches. MicroRNAs were mined from different databases like miRbase, PhenomiR2.0 and miR2Disease. In order to find targets for these miRNAs, different pathway genes were downloaded from KEGG database and removed the repeated entries by using Perl script. We used miRanda for finding the targets at different energy levels.

Figure 2
figure2

Comparison of human and mouse miRNAs from different databases with oncomiRdbB. Human and mouse miRNAs were mined from various databases listed and compiled as OncomiRdbB. This database has a maximum number of miRNAs compared to MicroCosm, PhenomiR and miR2Disease which list 460, 322 and 83 human miRNAs and 183, 63 and 0 mouse microRNAs respectively.

Identification of novel microRNAs using Taqman Low Density Arrays and LNA microarray

Among the 400 significant miRNAs validated experimentally using 2 array platforms, a set of 34 novel miRNAs was identified as novel breast cancer miRNAs since they were not classified as breast cancer miRNAs earlier. The targets of these miRNAs were also identified using miRanda at 3 energy levels, EL-15, -20 and −25 and the most stringent ones (EL −25) are listed in Table 1. These miRNAs and their targets were implicated in various signaling pathways, as revealed by the MetaCore software (Figure 3a and Table 1). The network of interactions showed involvement of key oncogenic signaling pathways like Wnt, JAK-STAT, PI3K and AKT. A spectrum of miRNAs including miR-337-5p, miR-17-1, miR-15a, miR-491-5p, miR-339, miR-337-3p, miR-241, miR-19a were predicted to down regulate oncogenic targets like TGFβ, BCLXW, BCL-Xl, STATs, c-MYC and SMAD (as represented by red lines). It was also observed that miR-141 positively controls the canonical pathway involving SP1 transcription factor via TGFβ, receptor, which activates matrix metalloproteinases, the deregulation of which plays a crucial role in metastasis.

Table 1 Lists the experimentally validated novel microRNAs and their putative targets in different signaling pathways at EL −25 ΔG kcal/mol
Figure 3
figure3

Interaction and enrichment analysis of the novel miRNA. a. The interaction of novel miRNAs with various signaling molecules reveals positive and negative regulation with downstream effector molecules as shown by MetaCore analysis. Red arrow indicates a down regulation whereas green arrow indicates up regulation of a particular pathway by the given miRNA ( generic binding protein; microRNA; - Transcription factor; - receptor ligand; - generic enzyme; - protein and - regulators). b. Histogram representing enrichment analysis of these novel miRNAs in different diseased conditions in humans. c. The spectrum of novel miRNAs involved in GM-CSF signaling. d & e. Enrichment analysis of miRNAs and their targets shows the list of different signaling networks they are involved and are deregulated in different carcinomas.

miRNA targets

The miRanda program was used to predict microRNA targets in the various oncogenic signaling pathways. This algorithm was run at energy level EL-15 and 711 genes in human and 490 in mice were identified. The stringency level was increased by decreasing the energy levels to EL-20 and EL-25, in order to increase the accuracy and to identify fewer targets with increased specificity (Figure 4a). By computing the ratio of miRNA targets with the total number of genes in the respective pathways, the percentage cooperation among different pathways including the Notch, VEGF, Wnt, MAPK, Apoptotic and JAK-STAT pathways was deciphered and thus their potential involvement in breast cancer development. Notch signaling pathway was found to have the highest (~50%) percentage of cooperativity between the novel miRNAs and their signaling molecules compared to other pathways analyzed in both murine and human hosts (Figure 4b).

Figure 4
figure4

MiRNA target identification at various energy levels in different pathways. a. Target identification of microRNAs at 3 different energy levels was performed on the retrieved sequences from different oncogenic signaling pathways using miRanda. b. shows the percentage cooperation of different microRNA targets of these pathways involved in the development of breast cancer.

We next characterized microRNA target locations within the genes and found that almost 70% of miRNAs target the 3′UTR, 10% of them target the 5′UTR and another 10% were found to target the exons. 5% of miRNAs targeted exons at their 3′ and 5′ UTR, respectively within the gene and the remaining 5% targeted 5′UTR, 3′UTR & exons (Figure 5a). There were many mRNAs which were targeted by multiple miRNAs and among such targets, the Dvl3 gene in the wnt pathway was chosen as an example for further analysis. Interestingly, a number of microRNAs were found to target this gene at its 3′UTR, 5′UTR, exons, or both 3′UTR & 5′UTR or both 3′UTR & exons. The microRNAs targeting the 3′UTR of Dvl3 showed a phylogenetic relationship when their sequences were aligned using ClustalX (Figure 5b). Regulation of Dvl3 by these sets of microRNAs was also deciphered using GeneGo MetaCore (Figure 5c). On the other hand, single miRNAs hitting multiple targets at various genomic locations such as 3′UTR, 5′UTR and exons were also identified. One example, miR-let-7b, had multiple targets at all positions as shown schematically in Figure 5d. Those microRNAs targeting exons and 5′UTR were seen to target key regulators of oncogenic signaling pathways (Additional file 2: Figure S2). The interaction between such microRNAs and their targets in various signaling pathways were delineated using GeneGo MetaCore. The microRNA-target networks and hubs and their therapeutic targets were also deciphered (Figure 3a and Additional file 3: Figure S3a and b).

Figure 5
figure5

Schematic representation of miRNA targets deciphered by miRanda and miRNA interaction with Dvl 3 target gene. a. Represents percentage target hits by miRNAs of different pathway genes at 5'UTR, exons and 3'UTR compiled in oncomiRdbB. b. shows an example where several miRNAs are targeting Dvl3 gene at its various positions. c. Network interaction of miRNAs in regulating Dvl3 target gene. d. Shows miRNA let-7b targeting multiple genes at different positions, 5'UTR, 3'UTR and exons.

Generation of pathway maps using enrichment analysis

Enrichment analysis was performed for the predicted microRNAs and their targets to further support their involvement in breast cancer development in both human and mouse using GeneGo MetaCore. This analysis statistically enriched the miRNAs and their targets with the available dataset in GeneGo MetaCore and generated p-values where high negative log p-values of microRNAs and their targets reflected significant association with breast oncogenesis. The enrichment analysis also supported our percentage co-operative analysis of different oncogenic pathways in breast cancer development and progression (Figure 3b and c). Similarly, further enrichment analysis for relative expression of these novel miRNAs in different disease conditions in humans showed the highest score for immunoproliferative disorders followed by lymphoproliferative disorders (Figure 3d). The spectrum of miRNAs was found to be crucial in GM-CSF signaling as depicted by the pathway maps generated using MetaCore software (Figure 3e).

OncomiRdbB database structure

OncomiRdbB is fully designed and developed as a web interface where Perl-CGI is used to connect the Apache web server which works as a back-end to generate dynamically user friendly HTML front end queries, an open-source graphics system that compiles into java code. OncomiRdbB provides easy search options for users to retrieve meaningful information of oncomiRs of two different genomes, their mature sequences, accession no, their respective targets in different oncogenic pathways, location in chromosome and their respective genes. Users can search query by name, accession no, target gene and sequence of miRNAs. It also facilitates downloading of specific microRNAs along with their target details or the complete list of microRNAs and their target information depending on the query. It provides useful interaction data for these microRNAs and their targets in various signaling pathways at the 3′UTR, 5′UTR and exons. Finally this database also provides links to the parental biological databases from which the information is retrieved. OncomiRdbB is hosted on the web and is freely accessible to those who wish to retrieve the information compiled in this database.

Utility and discussion

MicroRNAs are involved in the regulation of important signaling pathways by controlling the expression of a variety of oncoproteins which are responsible for cancer development and progression [23]. A few existing databases listed these microRNAs and their association with many diseases, including different cancers from diverse organisms as described above. However, none of these specifically describe the class of oncomiRs and their targets for a given cancer. Due to the lack of a properly compiled database, these data remain under utilized. Therefore, we present here a database, OncomiRdbB (OncomiR database for Breast cancer), which contains all the mined and compiled information along with ~400 validated oncomiRs of breast cancer. We also report additional novel oncomiRs experimentally identified in human cancer samples, along with their specific target genes. Compared to miRBase, PhenomiR and miR2Disease, OncomiRdbB collates significantly more microRNAs and their targets in both human and mouse. Thus OnmiRdbB provides better compiled information regarding the oncomiRs involved in breast cancer.

MicroRNAs with oncogenic or tumor-suppressive function are capable of modulating several targets in multiple genetic pathways. Mouse models are the predominant mammalian platform chosen to model cancer and as such they represent ideal platform to study the microRNA cancer association in human and to validate targets [24]. With this premise, we performed sequence alignment using ClustalX in order to find any evolutionary relationships which would suggest functional similarity between these miRNAs. Most of the miRNAs of human and mouse formed tight phylogenetic clusters. Comparative cluster analysis additionally confirmed that these miRNAs have common biological functionality and possess similar mechanisms of action in breast cancer. This analysis also interrogated any conserved functional relationships in the development of breast cancer. For example, hsa-miR-145, hsa-miR-151-3p and hsa-miR-30 showed sequence and functional similarity with mmu-miR-145, mmu-miR-151-3p and mmu-miR-30 respectively.

MicroRNA target identification is an important step in determining its specific roles in regulating a cellular process. MicroRNAs are known to regulate mRNAs post-transcriptionally by binding at the 3′ UTR. However, miRNA target prediction in animals is still in its infancy due to limited knowledge about parameters involved in the interaction between miRNAs and their target. Also, the uneven distribution of miRNA binding locations within the target transcript poses a further challenge to the sensitivity and efficiency of computational prediction [25]. Therefore, finding a consistent microRNA target across different miRNA target prediction tools is a challenging task [26]. In this study we used miRanda at different energy levels and PicTar and TargetScan algorithms to minimize the false positive rates during miRNA target prediction. Apart from the complementarity between the mRNA and miRNA, the miRanda algorithm also takes into account the weighted sum of match and mismatch scores for base pairs and gap penalties, allowing one wobbling pair in the seed region. Thus, use of this algorithm takes into consideration different characteristics of the miRNA-mRNA interaction [27]. The miRanda program predicted ‘perfect’ pairing of miRNA-mRNA at lower energy (EL-25) levels, while not ‘so-perfect’ pairing at higher energy levels (EL-15). Thus, miRNA-target prediction at different energy levels will help users to identify putative targets of all miRNAs and also aid validation of specific breast cancer targets in both genomes.

In OncomiRdbB, a total of 711 human and 490 mouse targets are listed which is high compared to the targets listed in other databases. One of the additional features of OncomiRdbB is that these targets are also classified based on various oncogenic signaling pathways which is not yet available in other databases. McCubrey et al., [28] have shown that there is interaction among Ras/Raf/MEK/ERK/PI3K/PTEN and Akt pathways during carcinogenesis. Likewise, cooperative interaction of Notch and Ras/MAPK pathway in human breast carcinogenesis and hepatocarcinogenesis was demonstrated by Fan et al., [29]. Therefore, the percentage cooperative analysis was carried out to address the role of miRNAs in deregulating multiple oncogenic pathway genes in breast cancer development along with the percentage of target hits by the miRNAs among the total number of targets present in each pathway. Around 30–50 percent target hits in Notch, VEGF, Wnt, Apoptosis, MAPKinase and JAK-STAT pathways indicated a significant intervention by oncomiRs in these pathways during breast cancer development. Although the exact percentage cooperative analysis has not been demonstrated, it is evident that these miRNAs are deregulating various oncogenic pathways. The information generated also led to the identification of candidate microRNAs regulating multiple therapeutic targets to combat breast cancer effectively.

Although the majority of miRNAs have been reported to target the 3′UTR, Zhou et al., [30] identified miRNAs targeting coding region and 5′ UTR, but these were less represented and less effective in translation repression. Inhan et al., [31] showed that while most of the miRNAs interact with 3′UTR of the target genes with its 5′end, a small proportion of miRNAs binding to 5′UTR use the 3′end for effective interaction. Hence, in order to define the interacting locations of miRNAs within the target genes, 3′UTR, 5′UTR and exon sequences were downloaded and used for target location identification. This analysis indicated that majority (around 70%) lie in the 3′UTR, whilst 10% miRNAs target the 5′UTR, 10% targeted exons, 5% of miRNAs targeted exons at their 3′ and 5′ UTR respectively within the gene and the remaining 5% targeted 5′UTR, 3′UTR & exons. Interaction between miRNAs and their targets was analyzed using the ‘R’ algorithm. Through this statistical analysis program, it was observed that either multiple targets are being controlled by a single miRNA or that several miRNAs target single transcripts. Nevertheless, the former situation was found to be a more common phenomenon than the latter. Dvl3 is one example of a target gene where several miRNAs act at the 3′UTR, 5′UTR and exons. A phylogenetic clustering of the 3′UTR miRNAs indicated a conserved functional relationship amongst them. On the other hand, we observed that a single miRNA, let-7b, hit multiple targets of several oncogenic pathways at their 3′UTR, 5′UTR and exons. It was also observed that targets with location in 5′UTR and exons were the key players of oncogenic pathways.

The interaction of miRNAs and their respective targets was deciphered using the GeneGo MetaCore software suite to identify miRNAs with multiple potential targets for therapeutic intervention in breast cancer. The interactive networks and hubs revealed functional co operativity of several miRNAs and target groups in the various signaling pathways. Engels et al., [32] also demonstrated that several miRNAs bind at 3′UTR targets and inhibit translation via co-operative actions [33]. Our analysis also corroborated with this and revealed the possibility of using multiple targets for possible therapeutic intervention. To the best of our knowledge, information on the cooperative interaction of miRNAs and their targets is a unique feature of OncomiRdbB. To support the authenticity of our database as specific to breast cancer, we have performed enrichment analyses of miRNAs and their targets from different oncogenic signaling pathways and provided the degree of relevance of miRNAs and their targets based on p-values, where lower p-values were assigned higher priority (GeneGo MetaCore software). These p-values indicated the probability of a given number of miRNAs and target genes matching with a certain number of miRNAs and target genes responsible for given disease and signaling pathways available in the GeneGo MetaCore suite. The target enrichment analysis supported our hypothesis that there is co-operation among different oncogenic pathways during cancer progression and it also indicated the extent of their potential involvement in cancer progression. This further suggested that our miRNA database for human and mouse could be an acceptable methodology for probing breast cancer miRNAs interrelationships.

We have experimentally validated our database by 2 different array platforms using human breast cancer samples. Approximately 400 miRNAs were valid and significant in one or the other stage/grade. In addition to these, this approach identified 34 novel miRNAs which are not yet reported as breast cancer miRNAs in any of the databases so far. The target identification of these novel miRNAs using miRanda algorithm at 3 energy levels together revealed 162 gene targets. The interaction between the novel miRNAs and their respective targets using MetaCore software delineated the networks and the hubs involved in various oncogenic signaling pathways. Anti-apoptotic molecules such as BCLXW, BCL-Xl were found to be down regulated by the novel miRNAs identified in this study (as predicted by GeneGo pathway), indicating possible roles of these miRNAs in development of cancer and metastatic progression. For example, miR-337-5p, miR-17-1, miR-15a, miR-491-5p, miR-339, miR-337-3p, miR-241, miR-19a were found to modulate oncogenic targets including TGFβ, STATs, c-MYC and SMAD. Taken together, this established a clear mechanistic interaction underlying miRNA deregulation and tumorigenesis.

Our data also indicated potential functional cooperativity of miRNAs and their targets in the development of breast cancer [34]. Hence our finding of cooperation among pathways was supported by the GeneGo MetaCore pathway interaction enrichment analysis. Moreover, we have also deciphered various potential therapeutic targets and their networks. More recently, it has been shown that circulating miRNAs are potential biomarkers for the detection and subsequent staging and grading of breast cancer [35, 36]. Unsupervised hierarchical cluster analysis of the miRNA expression between cancerous and normal tissue samples from patients showed major differences in miRNA expression. This study provides a basis for the blood-borne testing of miRNAs as biomarkers for the detection and subsequent staging of breast cancer. Thus, the in silico cum experimental validation data regarding the novel miRNA generated using OncomiRdbB could be used for future miRNA-based biomarkers and/or targeted therapeutics.

Conclusions

OncomiRdbB is designed as a comprehensive user friendly database which lists miRNAs and their respective targets for breast cancer in both the human and mouse. We have experimentally validated ~400 miRNAs in human breast cancer tissues and found 34 miRNAs which are as yet unreported. Users can retrieve information using miRNA name, sequence, accession no., gene name, organism or simply cancer type. OncomiRdbB gives details about the target genes, their chromosome location and position of the miRNA hits within the gene from various oncogenic signaling pathways. This is the first attempt to delineate the complicated interaction network involving different miRNAs and their targets in five oncogenic signaling pathways for breast cancer. OncomiRdbB is a central resource for cancer biologists and clinicians for further experimental validation of these targets and will also help clinicians in the selection of potential candidates for the development of novel clinical biomarkers and ultimately novel therapeutic interventions.

Availability and requirements

This database is freely available at http://tdb.ccmb.res.in/OncomiRdbB/index.htm. There is no restriction of use for non-academics.

Ethical clearance

This work has been conducted with the approval of human ethical committees of RCC and CCMB, respectively.

References

  1. 1.

    Reinhart BJ, Slack FJ, Basson M, Pasquinelli AE, Bettinger JC, Rougvie AE, Horvitz HR, Ruvkun G: The 21-nucleotide let-7 RNA regulates developmental timing in Caenorhabditis elegans. Nature. 2000, 403 (6772): 901-906. 10.1038/35002607.

  2. 2.

    Hede K: MicroRNAs as Onco-miRs, drivers of cancer. J Natl Cancer Inst. 2010, 102 (17): 1306-1308. 10.1093/jnci/djq349.

  3. 3.

    Wang S, Raghavachari S: Quantifying negative feedback regulation by micro-RNAs. Phys Biol. 2011, 8 (5): 055002-10.1088/1478-3975/8/5/055002.

  4. 4.

    Rajewsky N, Socci ND: Computational identification of microRNA targets. Dev Biol. 2004, 267 (2): 529-535. 10.1016/j.ydbio.2003.12.003.

  5. 5.

    Dai X, Zhuang Z, Zhao PX: Computational analysis of miRNA targets in plants: current status and challenges. Brief Bioinform. 2011, 12 (2): 115-121. 10.1093/bib/bbq065.

  6. 6.

    Griffiths-Jones S, Grocock RJ, van Dongen S, Bateman A, Enright AJ: miRBase: microRNA sequences, targets and gene nomenclature. Nucleic Acids Res. 2006, 34 (Database issue): D140-D144.

  7. 7.

    Hsu PW, Huang HD, Hsu SD, Lin LZ, Tsou AP, Tseng CP, Stadler PF, Washietl S, Hofacker IL: miRNAMap: genomic maps of microRNA genes and their target genes in mammalian genomes. Nucleic Acids Res. 2006, 34 (Database issue): D135-D139.

  8. 8.

    Megraw M, Sethupathy P, Corda B, Hatzigeorgiou AG: miRGen: a database for the study of animal microRNA genomic organization and function. Nucleic Acids Res. 2007, 35 (Database issue): D149-D155.

  9. 9.

    Nam S, Kim B, Shin S, Lee S: miRGator: an integrated system for functional annotation of microRNAs. Nucleic Acids Res. 2008, 36 (Database issue): D159-D164.

  10. 10.

    Xiao F, Zuo Z, Cai G, Kang S, Gao X, Li T: miRecords: an integrated resource for microRNA-target interactions. Nucleic Acids Res. 2009, 37 (Database issue): D105-D110.

  11. 11.

    Ruepp A, Kowarsch A, Schmidl D, Buggenthin F, Brauner B, Dunger I, Fobo G, Frishman G, Montrone C, Theis FJ: PhenomiR: a knowledgebase for microRNA expression in diseases and biological processes. Genome Biol. 2010, 11 (1): R6-10.1186/gb-2010-11-1-r6.

  12. 12.

    Jiang Q, Wang Y, Hao Y, Juan L, Teng M, Zhang X, Li M, Wang G, Liu Y: miR2Disease: a manually curated database for microRNA deregulation in human disease. Nucleic Acids Res. 2009, 37 (Database issue): D98-D104.

  13. 13.

    Lewis BP, Shih IH, Jones-Rhoades MW, Bartel DP, Burge CB: Prediction of mammalian microRNA targets. Cell. 2003, 115 (7): 787-798. 10.1016/S0092-8674(03)01018-3.

  14. 14.

    Krek A, Grun D, Poy MN, Wolf R, Rosenberg L, Epstein EJ, MacMenamin P, da Piedade I, Gunsalus KC, Stoffel M, et al: Combinatorial microRNA target predictions. Nat Genet. 2005, 37 (5): 495-500. 10.1038/ng1536.

  15. 15.

    Bandyopadhyay S, Mitra R: TargetMiner: microRNA target prediction with systematic identification of tissue-specific negative examples. Bioinformatics. 2009, 25 (20): 2625-2631. 10.1093/bioinformatics/btp503.

  16. 16.

    Sethupathy P, Corda B, Hatzigeorgiou AG: TarBase: a comprehensive database of experimentally supported animal microRNA targets. RNA. 2006, 12 (2): 192-197.

  17. 17.

    Kruger J, Rehmsmeier M: RNAhybrid: microRNA target prediction easy, fast and flexible. Nucleic Acids Res. 2006, 34 (web server issue): W451-W454.

  18. 18.

    Sarver AL, Phalak R, Thayanithy V, Subramanian S: S-MED: sarcoma microRNA expression database. Lab Invest. 2010, 90 (5): 753-761. 10.1038/labinvest.2010.53.

  19. 19.

    Enright AJ, John B, Gaul U, Tuschl T, Sander C, Marks DS: MicroRNA targets in Drosophila. Genome Biol. 2003, 5 (1): R1-10.1186/gb-2003-5-1-r1.

  20. 20.

    Kato K, Yamashita R, Matoba R, Monden M, Noguchi S, Takagi T, Nakai K: Cancer gene expression database (CGED): a database for gene expression profiling with accompanying clinical information of human cancer tissues. Nucleic Acids Res. 2005, 33 (Database issue): D533-D536.

  21. 21.

    Smith TF, Waterman MS: Identification of common molecular subsequences. J Mol Biol. 1981, 147 (1): 195-197. 10.1016/0022-2836(81)90087-5.

  22. 22.

    Wickham H: ggplot2: elegant graphics for data analysis. J Stat Softw. 2010, 35 (1): 65-88.

  23. 23.

    Krutovskikh VA, Herceg Z: Oncogenic microRNAs (OncomiRs) as a new class of cancer biomarkers. Bioessays. 2010, 32 (10): 894-904. 10.1002/bies.201000040.

  24. 24.

    Delay C, Hebert SS: MicroRNAs and Alzheimer’s disease mouse models: current insights and future research avenues. Int J Alzheimers Dis. 2011, 2011: 894938-

  25. 25.

    Witkos TM, Koscianska E, Krzyzosiak WJ: Practical aspects of microRNA target prediction. Curr Mol Med. 2011, 11 (2): 93-109. 10.2174/156652411794859250.

  26. 26.

    Alexiou P, Vergoulis T, Gleditzsch M, Prekas G, Dalamagas T, Megraw M, Grosse I, Sellis T, Hatzigeorgiou AG: miRGen 2.0: a database of microRNA genomic information and regulation. Nucleic Acids Res. 2010, 38: D137-D141. 10.1093/nar/gkp888. Database issue

  27. 27.

    Betel D, Wilson M, Gabow A, Marks DS, Sander C: The microRNA.org resource: targets and expression. Nucleic Acids Res. 2008, 36: D149-D153. Database issue

  28. 28.

    McCubrey JA, Steelman LS, Franklin RA, Abrams SL, Chappell WH, Wong EW, Lehmann BD, Terrian DM, Basecke J, Stivala F, Libra M, Evangelisti C, Martelli AM: Targeting the RAF/MEK/ERK, PI3K/AKT and p53 pathways in hematopoietic drug resistance. Adv Enzyme Regul. 2007, 47: 64-103. 10.1016/j.advenzreg.2006.12.013.

  29. 29.

    Fan R, Chen P, Zhao D, Tong JL, Li J, Liu F: Cooperation of deregulated Notch signaling and Ras pathway in human hepatocarcinogenesis. J Mol Histol. 2011, 42 (5): 473-481. 10.1007/s10735-011-9353-3.

  30. 30.

    Zhou X, Duan X, Qian J, Li F: Abundant conserved microRNA target sites in the 5′-untranslated region and coding sequence. Genetica. 2009, 137 (2): 159-164. 10.1007/s10709-009-9378-7.

  31. 31.

    Lee I, Ajay SS, Yook JI, Kim HS, Hong SH, Kim NH, Dhanasekaran SM, Chinnaiyan AM, Athey BD: New class of microRNA targets containing simultaneous 5′-UTR and 3′-UTR interaction sites. Genome Res. 2009, 19 (7): 1175-1183. 10.1101/gr.089367.108.

  32. 32.

    Engels BM, Hutvagner G: Principles and effects of microRNA-mediated post-transcriptional gene regulation. Oncogene. 2006, 25 (46): 6163-6169. 10.1038/sj.onc.1209909.

  33. 33.

    Clancy JL, Wei GH, Echner N, Humphreys DT, Beilharz TH, Preiss T: mRNA isoform diversity can obscure detection of miRNA-mediated control of translation. RNA. 2011, 17 (6): 1025-1031. 10.1261/rna.2567611.

  34. 34.

    Wang J, Sen S: MicroRNA functional network in pancreatic cancer: from biology to biomarkers of disease. J Biosci. 2011, 36 (3): 481-491. 10.1007/s12038-011-9083-4.

  35. 35.

    Chen W, Cai F, Zhang B, Barekati Z, Zhong XY: The level of circulating miRNA-10b and miRNA-373 in detecting lymph node metastasis of breast cancer:potential biomarkers. Tumour Biol. 2012, 34 (1): 455-462.

  36. 36.

    Liu H: MicroRNAs in breast cancer initiation and progression. Cell Mol Life Sci. 2012, 69 (21): 3587-3599. 10.1007/s00018-012-1128-9.

Download references

Acknowledgement

This work was supported by Department of Biotechnology,Ministry of Science & Technology,Government of India,(BT/PR10024/AGR/36/28/2007) & CSIR-TFYP (BSC121-GENESIS) to LDK and programme grant from CR-UK to ARC. The authors acknowledge Meghana D Kumar and Neha Krishna for their help in designing the database and Dr. Dinesh Kumar and Velumani Selvaraj for critical evaluation of the manuscript. We acknowledge Dr.Jayasree K and Dr. Jem Prabhakar, Regional Cancer Centre for providing the breast cancer samples for experimentation.

Author information

Affiliations

Authors

Corresponding authors

Correspondence to Alan R Clarke or Lekha Dinesh Kumar.

Additional information

Competing interests

The authors have declared no financial and non-financial competing interests.

Authors’ contributions

RK, VKV and AR developed the database under the guidance of LDK, ST and ARC. RAN collected and graded the samples, LDK and RAN designed the experimental validation, LDK and VKV wrote the paper, ARC edited the document, RK, VKV, MG and MMI generated pathway maps in GeneGo MetaCore, AR incorporated all changes in the database as per reviewer’s comments and is responsible for updating and maintaining the database. All authors read and approved the final manuscript.

Rimpi Khurana, Vinod Kumar Verma contributed equally to this work.

Electronic supplementary material

Additional file 1: Figure S1: A phylogenetic functional relationship between mi.RNAs of human and mouse using ClustalX: Tight clustering showing the phylogenetic relationship of breast cancer miRNAs in human and mouse as depicted by ClustalX. In the cladogram, branches from the same node represent descendents of a similar ancestor or cluster of the same family indicating their origin from a common ancestor. (TIFF 2 MB)

Additional file 2: Figure S2: MiRNA targets interaction in oncogenic pathways. MiRNAs targeting different ongogenic pathways are shown here. Those microRNAs targeting exons and 5′UTR were seen to target key regulators of oncogenic signaling pathways. These miRNAs could be designated as key regulators of Wnt signaling pathways. (TIFF 2 MB)

Additional file 3: Figure S3: Interaction of different signaling pathways and therapeutic targets depicted using metacore 3a: Represents microRNA-target network interaction and their hubs between several oncogenic pathways and their cooperation in breast cancer development. Figure 4b depicts therapeutic targets in different signaling pathways as deciphered by MetaCore software suite. (TIFF 6 MB)

Authors’ original submitted files for images

Rights and permissions

Reprints and Permissions

About this article

Cite this article

Khurana, R., Verma, V.K., Rawoof, A. et al. OncomiRdbB: a comprehensive database of microRNAs and their targets in breast cancer. BMC Bioinformatics 15, 15 (2014). https://doi.org/10.1186/1471-2105-15-15

Download citation

Keywords

  • MicroRNAs
  • Breast cancer
  • Targets
  • 3'UTR
  • miRanda
  • TLDA