- Software
- Open access
- Published:
Cirscan: a shiny application to identify differentially active sponge mechanisms and visualize circRNA–miRNA–mRNA networks
BMC Bioinformatics volume 25, Article number: 53 (2024)
Abstract
Background
Non-coding RNAs represent a large part of the human transcriptome and have been shown to play an important role in disease such as cancer. However, their biological functions are still incompletely understood. Among non-coding RNAs, circular RNAs (circRNAs) have recently been identified for their microRNA (miRNA) sponge function which allows them to modulate the expression of miRNA target genes by taking on the role of competitive endogenous RNAs (ce-circRNAs). Today, most computational tools are not adapted to the search for ce-circRNAs or have not been developed for the search for ce-circRNAs from user’s transcriptomic data.
Results
In this study, we present Cirscan (CIRcular RNA Sponge CANdidates), an interactive Shiny application that automatically infers circRNA–miRNA–mRNA networks from human multi-level transcript expression data from two biological conditions (e.g. tumor versus normal conditions in the case of cancer study) in order to identify on a large scale, potential sponge mechanisms active in a specific condition. Cirscan ranks each circRNA–miRNA–mRNA subnetwork according to a sponge score that integrates multiple criteria based on interaction reliability and expression level. Finally, the top ranked sponge mechanisms can be visualized as networks and an enrichment analysis is performed to help its biological interpretation. We showed on two real case studies that Cirscan is capable of retrieving sponge mechanisms previously described, as well as identifying potential novel circRNA sponge candidates.
Conclusions
Cirscan can be considered as a companion tool for biologists, facilitating their ability to prioritize sponge mechanisms for experimental validations and identifying potential therapeutic targets. Cirscan is implemented in R, released under the license GPL-3 and accessible on GitLab (https://gitlab.com/geobioinfo/cirscan_Rshiny). The scripts used in this paper are also provided on Gitlab (https://gitlab.com/geobioinfo/cirscan_paper).
Background
Non-coding RNAs (ncRNAs) are by definition, genome sequences not translated into proteins. For years, their biological functions was largely underestimated as they were considered to be junk transcriptional products. Recently, it has however been demonstrated that ncRNAs represent a large part of the human transcriptome and have important functional roles, particularly in cancers [1]. Advances in sequencing technologies have led to the identification of different types of ncRNAs in the cell, such as long non-coding RNAs and microRNAs (miRNAs) [2]. In this study, we are interested in circular RNAs (circRNAs), a category of recently discovered ncRNAs known for their miRNA sponge function [3, 4]. MicroRNAs typically bind to mRNA through miRNA Recognition Elements (MREs), resulting in the degradation of their mRNA targets. However, when a circRNA shares MRE sites for a specific miRNA, it can function as a competitive endogenous RNA (ce-circRNA): the circRNA acts as a sponge, sequestering the miRNA and preventing it from binding to its mRNA targets, thereby indirectly regulating the mRNA expression [5,6,7]. One of the most wellknown ce-circRNA is ciRS-7 (CDR1as), mainly expressed in the brain, which acts as a regulator of the miR-7 with more than 70 binding sites and leads to increased expression levels of miRNA-7 targets [8, 9]. Thanks to their closed-loop structure formed by reverse-splicing, circRNAs are very stable and their capacity to bind to miRNAs seems to be stronger than that of any other ceRNA, leading to their super-sponge naming [10]. These interactions between coding and non-coding RNAs can be represented as circRNA–miRNA–mRNA interaction networks.
Few computational tools have been developed for the identification of ceRNAs but are not adapted to the identification of ce-circRNAs. This constraint is primarily due to their input requirements, which accept only two types of RNA dataset simultaneously (i.e. miRNA and mRNA expression data), and to their predictions databases restricted to interactions between miRNA and mRNA [11, 12]. Moreover, the studies focusing on circRNA–miRNA–mRNA networks have not developed a computational tool that automates the search for large-scale ce-circRNA [13,14,15,16,17,18,19]. Two circRNA functional annotation databases predicting sponge mechanisms involving circRNAs in several tissues have been developed, but they do not allow users to query their own transcriptomic datasets on a large scale, they most often limite queries to circRNA identifiers or sequences [20, 21]. Recently, a computational Nextflow pipeline has been developed that covers the circRNA and miRNA identification and quantification from sequencing data and includes a module for the search of ce-circRNA [22]. For the identification of ceRNA networks, the Nextflow pipeline uses the SPONGE R package [23], which was not initially designed to detect ce-circRNA as it requires miRNA and mRNA expression expression matrix as input. Although it presents an interesting approach, SPONGE provides ce-relationship between targets of miRNA but does not provide ce-mechanisms in the form of circRNA–miRNA–mRNA triplet subnetworks, rendering it not possible to conduct a comparison with our tool. Finally, it is worth noting that, contrary to the tool we propose, the SPONGE tool requires paired samples between each type of RNA expression data and is restricted to sequencing-based data within the Nextflow pipeline. Consequently, it is not feasible to evaluate this pipeline on the benchmark datasets available in our study as they are based on microarray technology and do not necessarily have sample concordance between each RNA type.
Here, we present Cirscan (CIRcular RNA Sponge CANdidates), an interactive Shiny application specifically developed to (i) infer circRNA–miRNA–mRNA networks based on human multi-level transcript expression data (circRNAs, mRNAs, and optionally miRNAs in the case of cancer study, with internal cancer-specific miRNA signatures) from two biological conditions (e.g. tumor versus normal conditions) provided by the user and by any technology (microarray or RNA-seq), and (ii) identify sponge mechanisms, active in a specific condition. The originality of our approach is to take into account multiple criteria (i.e. binding affinity, expression level) arising from ce-circRNA mechanisms to infer accurate ce-circRNA candidates on a large scale. In addition, our multiple criteria strategy leads to the calculation of a sponge score for each circRNA–miRNA–mRNA subnetwork, that greatly facilitates the prioritization of experimental validations.
We evaluated Cirscan on public multi-level transcript expression dataset from two cancers: colorectal cancer [24] and hepatocarcinoma cancer [25] and showed that Cirscan was able to retrieve previously described sponge mechanisms in the top ranked circRNA–miRNA–mRNA subnetworks, as well as identifying novel circRNA sponge candidates.
In summary, Cirscan provides a user-friendly tool to identify and visualize circRNA–miRNA–mRNA networks with potential sponge mechanisms from user’s human multi-level transcript expression data, and may provide novel therapeutic targets. Cirscan Shiny app is freely available on Gitlab (https://gitlab.com/geobioinfo/cirscan_Rshiny). The scripts used in this paper are also provided on Gitlab (https://gitlab.com/geobioinfo/cirscan_paper).
Implementation
Overview of the tool
Cirscan infers circRNA–miRNA–mRNA interactions from coding and non-coding transcriptomic data (Fig. 1A) and identifies circRNA candidates that can act as miRNA sponges (ce-circRNA). The tool takes as input transcriptomic data from two biological conditions (e.g. tumor versus normal conditions), restricted to differentially expressed entities from any technology (microarray or RNA-seq). It requires from the user to normalized and log-transformed the data and a minimum of 3 samples per condition is required to ensure reliability of the downstream analysis. The tool comprises the following main steps: (1) Construction of a score matrix including multiple criteria based on interaction reliability and effectiveness of a sponge mechanism as defined in the following sections (Fig. 1B, Step 1), (2) Calculation of a sponge score for each circRNA–miRNA–mRNA subnetwork (Fig. 1B, Step 2). Cirscan outputs the top ranked circRNA sponge candidates based on the ranked sponge scores. It also enables the visualization of the identified sponge mechanisms as networks and biological enrichment analysis of the genes in the networks of interest (Fig. 1C). A video tutorial and an example dataset are available online to guide the user along the different steps of the application (https://gitlab.com/geobioinfo/cirscan_Rshiny).
Effective sponge mechanisms prerequisites
The identification of circRNAs acting as miRNA sponges requires considering both sequence and expression information. Several prerequisites have been established to ensure the effectiveness of this identification process: (1) A miRNA must be sufficiently expressed whatever the biological conditions to have a functional impact on its targets and must target at least one circRNA and one mRNA, with a sufficient affinity. Importantly we do not assume a differential expression of the miRNA between the two conditions as we hypothesise that a sponge mechanism involving a circRNA affects the bioavailability of the miRNA but not its expression level. (2) Targets of the miRNA (mRNAs and circRNAs) must be co-expressed, i.e differentially expressed between the two conditions in the same direction. Indeed, it is expected that the increased presence of a circRNA that can sponge a miRNA will induce an over-expression of its mRNA targets, as the miRNA can no longer induce their degradation.
Input files
Cirscan requires two types of CSV files from the user. First, multi-level transcript expression data are required as input files, including circRNA, mRNA and optionally miRNA expression matrices. If the user does not have miRNA expression data, miRNA signatures for different types of cancer (respectively 32 and 20 from the TCGA consortium (https://www.cancer.gov/tcga) and CCLE database (www.broadinstitute.org/ccle, [26])) are internally available in Cirscan. There is no constraint concerning the technology used to obtain the expression matrices (array or sequencing), which in return requires the user to properly normalize his data beforehand. Each expression matrix needs to be normalized (e.g. RMA for expression array, TPM for RNA-seq data) and log-transformed, and a minimum of 3 samples per condition is required to ensure reliability of the downstream analysis. Based on the sponge mechanism prerequisites, circRNA and mRNA expression matrices should be restricted to the differentially expressed circRNAs and mRNAs prior to the use of Cirscan. As mentioned above, this restriction is not relevant for miRNAs and no filter is required. Cirscan will automatically keep miRNAs that are sufficiently expressed considering the following assumptions: (1) the sum of the expression of a given miRNA in all samples must be nonzero, (2) for a given miRNA, its expression must be higher that a defined cutoff in more that 90% of the samples, the cutoff being the quartile Q1 of the overall miRNA expression distribution. This selection was also made for the establishment of the miRNA signatures from the TCGA (https://www.cancer.gov/tcga) and CCLE (www.broadinstitute.org/ccle, [26]) datasets.
CircRNAs identifiers must correspond to the human circRNAs referenced in circBase (“\(\textrm{has}\_\textrm{circ}\_\)xxxxxxx”) [27], human miRNAs from miRBase (“hsa-miR-xxx” or “hsa-let-xxx”) [28], and mRNAs must be referenced as official gene symbols. Furthermore, the user must provide another CSV file corresponding to the sample annotation file. This file must contain two columns: “samples” with the sample names referenced in the RNA expression matrices and “conditions” with the corresponding condition it refers to (“\(\textrm{condition}\_{1}\)”, “\(\textrm{condition}\_{2}\)”), knowing that the first condition refers to the control condition. The file format required by Cirscan is detailed in the Home Panel of the Shiny application and in the example dataset provided in the Interaction Ranking Panel.
Identification of sponge mechanisms
In-house database of miRNA-target interactions based on predicted and experimentally validated interactions
A miRNA-target interaction occurs between the “seed sequence” of the miRNA and a complementary sequence called the “miRNA Recognition Element” (MRE) on the target (circRNA or mRNA). We used TargetScan [29] to predict miRNA–circRNA and miRNA–mRNA interactions, providing an affinity score for each interaction.
For miRNA-circRNA interactions, we modified the affinity score calculated by TargetScan (context+ score, [30]) by excluding the 3’-compensatory pairing feature. This modification prevents the introduction of potential bias since, given the recent description of circRNAs, the specific mechanisms underlying circRNA–miRNA pairing remain poorly described in the literature. To reduce false positive interactions predicted by TargetScan, we set a cutoff for the context+ score based on its distribution restricted to experimentally validated miRNA-circRNA interactions extracted from the ENCORI database [31], known for containing experimentally validated RNA interactions. The cutoff was defined as the 95th percentile of the context+ score distribution for interactions supported by at least two CLIP-seq and two degradome-seq experiments [see more details in Additional file 1 and Additional file 1: Figure S1]. For miRNA–mRNA interactions, we used the affinity score provided by TargetScan that incorporates additional criteria relevant to miRNA–mRNA interactions that minimize false positive predictions, such as 3’ UTR length, ORF length, and probability of conserved targeting between species (context++ score, [29]) [see more details in Additional file 1].
In total, the miRNA-target interaction database includes 1,244,932 miRNA–mRNA interactions and 31,045,182 miRNA-circRNA interactions.
Criteria of interaction reliability and sponge effectiveness
In order to identify potential sponge mechanisms involving circRNA, different scores of reliability are calculated. We defined the affinity score \(S_{\textit{Affinity}}\) between a miRNA and it targets as the context score calculated by Targetscan rescaled between 0 and 1: the closer to 1, the stronger the interaction affinity. For each miRNA-target interaction, we also considered the number of MRE sites that we named \(S_{\textit{NbMRE}}\). If an interaction involves multiple MREs for the same miRNA and target of interest, the interaction with the maximum \(S_{\textit{Affinity}}\) is retained. To reduce the potential bias arising from the extensive prediction of MRE influenced by the circRNA target sequence length and their pairing along the entire circRNA sequence, the number of MREs for miRNA-circRNA interactions was normalized by the length of the circRNA sequence as described in the Cerina tool [21] and referenced here as the \(S_{\textit{NbMRE}}\). In order to identify circRNAs that are significantly enriched for MRE binding sites of specific miRNAs, we used a binomial model conditioned on the total number of MRE for the given miRNA on each circRNA and the number of all other miRNA-MRE sites on the given circRNA as background, as proposed in the recent scanMiR tool [32]. The adjusted p-value (Benjamini–Hochberg, BH) of the enrichment test is defined as \(S_{\textit{EnrichMRE}}\). A circRNA–miRNA interaction is kept if the Binomial enrichment test adjusted p-value (BH) is lower than 0.05.
In order to take into account the expression level information, different scores are calculated: the target expression fold change between conditions 1 and 2 (\(S_{\textit{FC}}\)), the median expression of each miRNA under all conditions (\(S_{\textit{MirExpr}}\)) and the highest mean expression of each target between the two conditions (\(S_{\textit{TargetExpr}}\)). Only networks with at least one circRNA and one mRNA with fold change values in the same direction are selected.
Calculation of a sponge score
For each miRNA–mRNA interaction, Cirscan calculates a global score defined as \(SG_{\textit{miRNA-mRNA}}\), which integrates the different criteria described above \(S_{\textit{Affinity}}\), \(S_{\textit{NbMRE}}\), \(S_{\textit{FC}}\), \(S_{\textit{MirExpr}}\), \(S_{\textit{TargetExpr}}\) using the TOPSIS method [33]. Briefly, the TOPSIS method is a multi-criteria decision analysis method which aims to determine the best alternative when different criteria need to be considered together. In our case, the best alternative for each miRNA-mRNA interaction consists in maximizing the following criteria: highest affinity score (\(S_{\textit{Affinity}}\)), high number of MRE binding site (\(S_{\textit{NbMRE}}\)) and highly expressed miRNAs and differentially expressed circRNAs and mRNAs (\(S_{\textit{MirExpr}}\), \(S_{\textit{TargetExpr}}\), \(S_{\textit{FC}}\)).
Then, a score for each miRNA-circRNA interaction is calculated and defined as a sponge score, \(SS\). The sponge score integrates the scores \(S_{\textit{Affinity}}\), \(S_{\textit{NbMRE}}\), \(S_{\textit{EnrichMRE}}\), \(S_{\textit{FC}}\), \(S_{\textit{MirExpr}}\), \(S_{\textit{TargetExpr}}\), \(S_{\textit{EnrichSG}}\), where \(S_{\textit{EnrichSG}}\) corresponds to the adjusted p-value (BH) of the enrichment test of the miRNA–mRNA interactions for the miRNA of interest over all the miRNA–mRNA interactions ranked on the \(SG_{\textit{miRNA-mRNA}}\) score (R package fgsea). In other words, for a given circRNA–miRNA–mRNA subnetwork, if the miRNA–mRNA interactions have high \(SG_{\textit{miRNA-mRNA}}\) scores, this will contribute to increasing the sponge score. This sponge score is calculated using the TOPSIS method and in our case, the best alternative for each miRNA-circRNA interaction consists in maximizing the following criteria; \(S_{\textit{Affinity}}\), \(S_{\textit{NbMRE}}\), \(S_{\textit{MirExpr}}\), \(S_{\textit{TargetExpr}}\), \(S_{\textit{FC}}\), and minimizing the adjusted p-value of enrichment tests, \(S_{\textit{EnrichSG}}\) and \(S_{\textit{EnrichMRE}}\).
MiRNA-circRNA interactions are then ranked according to their sponge score, the top ranked interactions with the highest scores being the most relevant sponge circRNA candidates. Finally, all this information is compiled into a score matrix with 16 columns available and detailed in Cirscan, where \(Rank_{\textit{miRNA-circRNA}}\) is the rank of miRNA-circRNA interactions based on the sponge score \(SS\) and the top ones involving the most relevant sponge circRNA candidates.
Visualization of the sponge mechanisms of interest
The identified circRNA–miRNA–mRNA subnetworks involving potential sponge mechanisms can be visualized in the Network Visualization Panel using the visNetwork R package (v2.0.8, https://github.com/datastorm-open/visNetwork). It is possible to visualize a network by selecting an interaction from the score matrix or by specifying the name of an RNA of interest: nodes correspond to the different types of RNAs (orange miRNAs, green mRNAs, and pink circRNAs) and edges to the miRNA-target interactions. The edge thickness is proportional to the sponge scores (the higher the value, the thicker the edge and the stronger the interaction affinity) and the RNA node of interest is larger than the others.
GO and KEGG enrichment analysis of mRNAs
In order to help the user in the biological interpretation of the visualized network, an enrichment of biological terms on the genes of the visualized network is performed using Enrichr tool [34] with KEGG (\(\textrm{KEGG}\_\textrm{2021}\_\textrm{Human}\)) and GO (\(\textrm{GO}\_\textrm{Biological}\_\textrm{Process}\_2021\)) databases.
Results
Cirscan performance evaluation
We evaluated Cirscan on two public multi-level transcript expression datasets from colorectal cancer (CRC) (n=20, 10 normal and 10 tumor samples) [24] and hepatocarcinoma (HCC) (n=14, 7 normal samples and 7 tumor samples) [35] in order to identify sponge mechanisms active in normal or tumor conditions.
For these two cancers, we retrieved sponge mechanisms referenced in the NcPath database [36], as well as described in the literature [37,38,39,40,41,42] and in a recent review by [43]. We selected sponge mechanisms involving circRNAs for which a circBase identifier exists and a reference is associated, i.e. 30 sponge mechanisms described in the literature in colorectal cancer and 93 in hepatocarcinoma. The tables of sponge mechanisms are provided on our Gitlab web page (https://gitlab.com/geobioinfo/cirscan_paper).
Using the Cirscan tool on the CRC dataset pre-processed [see more details in Additional file 1], we identified 12,850 potential circRNA–miRNA sponge mechanisms (i.e. 1,413 unique circRNAs) [Table available at https://gitlab.com/geobioinfo/cirscan_paper]. Among them, 3 sponge mechanisms were already described in the literature in CRC [40, 44] and significantly enriched in the top ranked sponge mechanisms given by Cirscan (GSEA enrichment [45], p-value = 4.04e\(-\)07) (Fig. 2). Similar results were observed using the TCGA CRC miRNA signature available internally in Cirscan (p-value = 1.51e\(-\)06) [Table available at https://gitlab.com/geobioinfo/cirscan_paper]. We also showed the relevance of TargetScan predictions by confronting the result obtained with randomly predicted interactions [Additional file 1 and Additional file 1: Figure S2.A].
The same approach was applied to the HCC circRNA and mRNA transcriptomic datasets and the TCGA HCC miRNA signature available internally in Cirscan, as no miRNA expression dataset was generated by the same authors [see more details in Additional file 1]. We identified 16,190 potential circRNA–miRNA sponge mechanisms (i.e. 2,089 unique circRNAs) [Table available at https://gitlab.com/geobioinfo/cirscan_paper] including 3 sponge mechanisms already described in the literature in HCC [42, 46,47,48] that were also significantly enriched in the top ranked sponge mechanisms (p-value = 0.03) (Fig. 2). The relevance of TargetScan predictions was also assessed on this application by confronting the result obtained with randomly predicted interactions [Additional file 1 and Additional file 1: Figure S2.B].
This significant enrichment of the sponge mechanisms identified in colorectal cancer and hepatocarcinoma in the literature among the top candidates of Cirscan confirms the reliability of our tool.
Identification of known and novel sponge mechanisms in colorectal cancer
Using the public colorectal cancer dataset, Cirscan identified the \(\textrm{hsa}\_\textrm{circ}\_\textrm{0000977}\):hsa-miR-135b-5p subnetwork, described in the literature by Ding et al. [44], at rank 18/12,850 out of all identified mechanisms (in the top 1%) and rank 17/5,468 out of all mechanisms identified in the normal condition (Fig. 3A). This result was consistent with the study of Ding et al. [44], as the authors describe this mechanisms as downregulated in the tumor condition. The KEGG biological term enrichment analysis of the mRNAs of this subnetwork revealed biological pathways associated with cell proliferation and differenciation involving the tumor suppressor genes APC and SMAD2 (Fig. 3B). Among the sponge mechanisms involved in the tumor condition and described in the literature, Cirscan identified the \(\textrm{has}\_\textrm{circ}\_\)0001806:hsa-miR-193a-5p subnetwork (Fig. 3C) (rank 105/12,850 (top 1%) and rank 32/7,382 (top 1%) mechanisms identified in the tumor condition) and the \(\textrm{has}\_\textrm{circ}\_\)0001955:hsa-miR-145-5p subnetwork (rank 119/12,850 (top 1%) and rank 37/7,382 (top 1%)). The result of the KEGG enrichment of the \(\textrm{has}\_\textrm{circ}\_\)0001806:hsa-miR-193a-5p subnetwork is consistent with the involvement of this sponge mechanism in the cancer with the enrichment of the TCF3 oncogene in the transcriptional misregulation in cancer pathway (Fig. 3D).
We next focused on the best ce-circRNA candidates having a sponge role in the tumor condition. Interestingly, within the top 10 ce-circRNA candidates, the first ce-circRNA has been already shown to have a sponge role in lung and colorectal cancer in several studies (\(\textrm{hsa}\_\textrm{circ}\_\)0072088: [49,50,51,52]). Cirscan highlighted potential novel sponge mechanisms for this circRNA, involving for example the miRNA hsa-miR-214-5p (Fig. 3E) targeting genes associated to signaling pathways enriched with oncogenes, such as PDGFRA and AKT3 (Fig. 3F). Other ce-circRNAs within the top 10 ce-circRNA candidates in the tumor condition have been identified in other cancers as hepatocellular carcinoma, lung or esophageal cancer (\(\textrm{has}\_\textrm{circ}\_\)0000517 [53, 54], \(\textrm{has}\_\textrm{circ}\_\)0000326 [55], \(\textrm{has}\_\textrm{circ}\_\)0000518 [56]). The other best ce-circRNA candidates (\(\textrm{has}\_\textrm{circ}\_\)0000644, \(\textrm{has}\_\textrm{circ}\_\)0011385, \(\textrm{has}\_\textrm{circ}\_\)0006174, \(\textrm{has}\_\textrm{circ}\_\)0003945, \(\textrm{has}\_\textrm{circ}\_\)0008274, \(\textrm{has}\_\textrm{circ}\_\)0043438) have not yet been described in the literature as associated with cancer, making them interesting subjects for further experimental validations.
Identification of known and novel sponge mechanisms in hepatocarcinoma
The same approach was applied to the HCC transcriptomic data. Cirscan identify the \(\textrm{has}\_\textrm{circ}\_\)0074854:hsa-miR-338-3p subnetwork, ranked 1,279/16,190 out of all identified mechanisms (in the top 10%) and ranked 639/8,468 out of all mechanisms identified in the tumor condition (Fig. 4.A), as well as described by Li et al. [47]. The biological term enrichment analysis of the mRNAs of this subnetwork revealed pathways associated with cancer, such as the enrichment of the GNAQ, STAT1 and AKT3 oncogenes in the MAPK signaling pathway (Fig. 4B). Moreover, Cirscan identified at lower ranks the subnetworks \(\textrm{has}\_\textrm{circ}\_\)0000130:hsa-miR-141-3p (rank 2,764/16,190 and rank 1,430/8,468) and \(\textrm{has}\_\textrm{circ}\_\)0001461:hsa-miR-30a-5p (rank 14,037/16,190 and rank 7,333/8,468) already described in the literature ( [46] and [48]), making them relevant mechanisms in the study of HCC.
Interestingly, within the top 10 ce-circRNA candidates described by Cirscan and active in the tumor condition, the third \(\textrm{has}\_\textrm{circ}\_\)0000517 (Fig. 4C) and the seventh \(\textrm{has}\_\textrm{circ}\_\)0001834 were already described in the literature as differentially expressed in HCC [53, 57]. Cirscan identified a potential novel sponge mechanisms for the \(\textrm{has}\_\textrm{circ}\_\)0000517, involving the hsa-miR-330-5p that targets genes, for example the oncogene NRAS, enriched in different cancers, such as hepatocellular carcinoma and bladder cancer (Fig. 4D). Among the 10 ce-circRNA candidates, \(\textrm{has}\_\textrm{circ}\_\)0048122 is identified as being involved in colorectal cancer [25] and \(\textrm{has}\_\textrm{circ}\_\)0000644 in bladder cancer [58]. The other best ce-circRNA candidates (\(\textrm{has}\_\textrm{circ}\_\)0000516, \(\textrm{has}\_\textrm{circ}\_\)0087643, \(\textrm{has}\_\textrm{circ}\_\)0052523, \(\textrm{has}\_\textrm{circ}\_\)0046599, \(\textrm{has}\_\textrm{circ}\_\)0042521, \(\textrm{has}\_\textrm{circ}\_\)0064288) have not yet been described in the literature as associated with cancer, making them interisting subjects for further experimental validations.
Conclusions
Cirscan is a tool that takes as input any human multi-level transcript expression data to identify on a large scale and visualize condition-specific sponge mechanisms involving circRNAs. As shown on two different applications, Cirscan allows the identification of known and novel potential sponge mechanisms that may be further investigated and validated experimentally. This tool can be considered as a companion tool for biologists, facilitating their ability to prioritize sponge mechanisms for experimental validations. The mechanisms revealed by Cirscan could open new avenues for the development of novel RNA-targeted therapies. In particular, it would be possible to use antisense oligonucleotides, which can bind by complementarity to circRNA sequences of interest to inhibit the sponge mechanisms active in a specific condition [59]. Finally, the framework established in this study could be adapted for the search of sponge mechanisms involving long non-coding RNAs and extended to other species.
Data availability
Cirscan is implemented in R, released under the license GPL-3 and accessible on GitLab (https://gitlab.com/geobioinfo/cirscan_Rshiny). The scripts used in this paper are also provided on Gitlab (https://gitlab.com/geobioinfo/cirscan_paper).
Abbreviations
- BH:
-
Benjamini–Hochberg
- ce-circRNA:
-
Competitive Endogenous circRNA
- circRNA:
-
Circular RNA
- Cirscan:
-
CIRcular RNA Sponge CANdidates
- CRC:
-
Colorectal Cancer
- HCC:
-
Hepatocarcinoma
- MRE:
-
MiRNA Recognition Elements
- miRNA:
-
MicroRNA
- ncRNA:
-
Non-coding RNA
References
Anastasiadou E, Jacob LS, Slack FJ. Non-coding RNA networks in cancer. Nat Rev Cancer. 2018;18(1):5–18. https://doi.org/10.1038/nrc.2017.99.
Esteller M. Non-coding RNAs in human disease. Nat Rev Genet. 2011;12(12):861–74. https://doi.org/10.1038/nrg3074.
Lacazette R, Diallo LH, Tatin F, Garmy-Susini B, Prats A-C. LARN circulaire nous joue-t-il des tours? Medicine/Sciences. 2020;36(1):38–43. https://doi.org/10.1051/medsci/2019267.
Liu J, Liu T, Wang X, He A. Circles reshaping the RNA world: from waste to treasure. Mol Cancer. 2017;16(1):58. https://doi.org/10.1186/s12943-017-0630-y.
Zhang Z, Yang T, Xiao J. Circular RNAs: promising biomarkers for human diseases. EBioMedicine. 2018;34:267–74. https://doi.org/10.1016/j.ebiom.2018.07.036.
Hua X, Sun Y, Chen J, Wu Y, Sha J, Han S, Zhu X. Circular RNAs in drug resistant tumors. Biomed Pharmacother. 2019;118: 109233. https://doi.org/10.1016/j.biopha.2019.109233.
Kristensen LS, Jakobsen T, Hager H, Kjems J. The emerging roles of circRNAs in cancer and oncology. Nat Rev Clin Oncol. 2022;19(3):188–206. https://doi.org/10.1038/s41571-021-00585-y.
Memczak S, Jens M, Elefsinioti A, Torti F, Krueger J, Rybak A, Maier L, Mackowiak SD, Gregersen LH, Munschauer M, Loewer A, Ziebold U, Landthaler M, Kocks C, le Noble F, Rajewsky N. Circular RNAs are a large class of animal RNAs with regulatory potency. Nature. 2013;495(7441):333–8. https://doi.org/10.1038/nature11928.
Hansen TB, Jensen TI, Clausen BH, Bramsen JB, Finsen B, Damgaard CK, Kjems J. Natural RNA circles function as efficient microRNA sponges. Nature. 2013;495(7441):384–8. https://doi.org/10.1038/nature11993.
Peng L, Yuan XQ, Li GC. The emerging landscape of circular RNA ciRS-7 in cancer (review). Oncol Rep. 2015;33(6):2669–74. https://doi.org/10.3892/or.2015.3904.
Chiu H-S, Llobet-Navas D, Yang X, Chung W-J, Ambesi-Impiombato A, Iyer A, Kim HR, Seviour EG, Luo Z, Sehgal V, Moss T, Lu Y, Ram P, Silva J, Mills GB, Califano A, Sumazin P. Cupid: simultaneous reconstruction of microRNA-target and ceRNA networks. Genome Res. 2015;25(2):257–67. https://doi.org/10.1101/gr.178194.114.
Do D, Bozdag S. Cancerin: a computational pipeline to infer cancer-associated ceRNA interaction networks. PLOS Comput Biol. 2018;14(7):1006318. https://doi.org/10.1371/journal.pcbi.1006318.
Yi Y, Liu Y, Wu W, Wu K, Zhang W. Reconstruction and analysis of circRNA–miRNA–mRNA network in the pathology of cervical cancer. Oncol Rep. 2019;41(4):2209–25. https://doi.org/10.3892/or.2019.7028.
Das A, Shyamal S, Sinha T, Mishra SS, Panda AC. Identification of potential circRNA–microRNA–mRNA regulatory network in skeletal muscle. Front Mol Biosci. 2021;8:762185.
Gong K, Yang K, Xie T, Luo Y, Guo H, Tan Z, Chen J, Wu Q, Gong Y, Wei L, Luo J, Yao Y, Yang Y, Xie L. Identification of circRNA–miRNA–mRNA regulatory network and its role in cardiac hypertrophy. PLoS ONE. 2023;18(3):0279638. https://doi.org/10.1371/journal.pone.0279638.
Xiong D, Dang Y, Lin P, Wen D, He R, Luo D, Feng Z, Chen G. A circRNA–miRNA–mRNA network identification for exploring underlying pathogenesis and therapy strategy of hepatocellular carcinoma. J Transl Med. 2018;16(1):220. https://doi.org/10.1186/s12967-018-1593-5.
Sun Q, Liu Z, Xu X, Yang Y, Han X, Wang C, Song F, Mou Y, Li Y, Song X. Identification of a circRNA/miRNA/mRNA ceRNA network as a cell cycle-related regulator for chronic sinusitis with nasal polyps. J Inflamm Res. 2022;15:2601–15. https://doi.org/10.2147/JIR.S358387.
Bai S, Wu Y, Yan Y, Shao S, Zhang J, Liu J, Hui B, Liu R, Ma H, Zhang X, Ren J. Construct a circRNA/miRNA/mRNA regulatory network to explore potential pathogenesis and therapy options of clear cell renal cell carcinoma. Sci Rep. 2020;10(1):13659. https://doi.org/10.1038/s41598-020-70484-2.
Ma Y, Zou H. Identification of the circRNA–miRNA–mRNA prognostic regulatory network in lung adenocarcinoma. Genes. 2022;13(5):885. https://doi.org/10.3390/genes13050885.
Chen Y, Yao L, Tang Y, Jhong J-H, Wan J, Chang J, Cui S, Luo Y, Cai X, Li W, Chen Q, Huang H-Y, Wang Z, Chen W, Chang T-H, Wei F, Lee T-Y, Huang H-D. CircNet 2.0: an updated database for exploring circular RNA regulatory networks in cancers. Nucleic Acids Res. 2022;50:93–101. https://doi.org/10.1093/nar/gkab1036.
Cardenas J, Balaji U, Gu J. Cerina: systematic circRNA functional annotation based on integrative analysis of ceRNA interactions. Sci Rep. 2020;10(1):22165. https://doi.org/10.1038/s41598-020-78469-x.
Hoffmann M, Schwartz L, Ciora O-A, Trummer N, Willruth L-L, Jankowski J, Lee HK, Baumbach J, Furth PA, Hennighausen L, List M. circRNA-sponging: a pipeline for extensive analysis of circRNA expression and their role in miRNA sponging. Bioinform Adv. 2023;3(1):093. https://doi.org/10.1093/bioadv/vbad093.
List M, Dehghani Amirabad A, Kostka D, Schulz MH. Large-scale inference of competing endogenous RNA networks with sparse partial correlation. Bioinformatics. 2019;35(14):596–604. https://doi.org/10.1093/bioinformatics/btz314.
Chen Z, Ren R, Wan D, Wang Y, Xue X, Jiang M, Shen J, Han Y, Liu F, Shi J, Kuang Y, Li W, Zhi Q. Hsa_circ_101555 functions as a competing endogenous RNA of miR-597-5p to promote colorectal cancer progression. Oncogene. 2019;38(32):6017–34. https://doi.org/10.1038/s41388-019-0857-8.
Fang Q, Ni C, Cai Z, Li W, Xie J. Prognostic significance of hsa_circ_0048122 to predict liver metastasis in early-stage colorectal cancer. J Clin Lab Anal. 2022;36(8):24577. https://doi.org/10.1002/jcla.24577.
Barretina J, Caponigro G, Stransky N, Venkatesan K, Margolin AA, Kim S, Wilson CJ, Lehr J, Kryukov GV, Sonkin D, Reddy A, Liu M, Murray L, Berger MF, Monahan JE, Morais P, Meltzer J, Korejwa A, Jan-Valbuena J, Mapa FA, Thibault J, Bric-Furlong E, Raman P, Shipway A, Engels IH, Cheng J, Yu GK, Yu J, Aspesi P, de Silva M, Jagtap K, Jones MD, Wang L, Hatton C, Palescandolo E, Gupta S, Mahan S, Sougnez C, Onofrio RC, Liefeld T, MacConaill L, Winckler W, Reich M, Li N, Mesirov JP, Gabriel SB, Getz G, Ardlie K, Chan V, Myer VE, Weber BL, Porter J, Warmuth M, Finan P, Harris JL, Meyerson M, Golub TR, Morrissey MP, Sellers WR, Schlegel R, Garraway LA. The cancer cell line encyclopedia enables predictive modelling of anticancer drug sensitivity. Nature. 2012;483(7391):603–7. https://doi.org/10.1038/nature11003.
Glaar P, Papavasileiou P, Rajewsky N. circBase: a database for circular RNAs. RNA. 2014. https://doi.org/10.1261/rna.043687.113.
McGeary SE, Lin KS, Shi CY, Pham TM, Bisaria N, Kelley GM, Bartel DP. The biochemical basis of microRNA targeting efficacy. Science. 2019;366(6472):1741. https://doi.org/10.1126/science.aav1741.
Agarwal V, Bell GW, Nam J-W, Bartel DP. Predicting effective microRNA target sites in mammalian mRNAs. eLife. 2015;4:05005. https://doi.org/10.7554/eLife.05005.
Garcia DM, Baek D, Shin C, Bell GW, Grimson A, Bartel DP. Weak seed-pairing stability and high target-site abundance decrease the proficiency of lsy-6 and other microRNAs. Nat Struct Mol Biol. 2011;18(10):1139–46. https://doi.org/10.1038/nsmb.2115.
Li J, Liu S, ZhouH H, Qu L-H, Yang J-H. starBase v2.0: decoding miRNA-ceRNA, miRNA-ncRNA and protein-RNA interaction networks from large-scale CLIP-seq data. Nucleic Acids Res. 2014;42:92–7. https://doi.org/10.1093/nar/gkt1248.
Soutschek M, Gross F, Schratt G, Germain P-L. scanMiR: a biochemically based toolkit for versatile and efficient microRNA target prediction. Bioinformatics. 2022;38(9):2466–73. https://doi.org/10.1093/bioinformatics/btac110.
Hwang C-L, Lai Y-J, Liu T-Y. A new approach for multiple objective. Comput Oper Res Decis Mak. 1993;20(8):889–99. https://doi.org/10.1016/0305-0548(93)90109-V.
Chen EY, Tan CM, Kou Y, Duan Q, Wang Z, Meirelles GV, Clark NR, Ma’ayan A. Enrichr: interactive and collaborative HTML5 gene list enrichment analysis tool. BMC Bioinform. 2013;14:128. https://doi.org/10.1186/1471-2105-14-128.
Han D, Li J, Wang H, Su X, Hou J, Gu Y, Qian C, Lin Y, Liu X, Huang M, Li N, Zhou W, Yu Y, Cao X. Circular RNA circMTO1 acts as the sponge of microRNA9 to suppress hepatocellular carcinoma progression. Nature. 2017;66(4):1151. https://doi.org/10.1002/hep.29270.
Li Z, Zhang Y, Fang J, Xu Z, Zhang H, Mao M, Chen Y, Zhang L. platform for visualization and enrichment analysis of human non-coding RNA and KEGG signaling pathways. Bioinformatics. 2023;39(1):812. https://doi.org/10.1093/bioinformatics/btac812.
Liu Y, Li H, Ye X, Ji A, Fu X, Wu H, Zeng X. Hsa_circ_0000231 knockdown inhibits the glycolysis and progression of colorectal cancer cells by regulating miR-502-5p/MYO6 axis. World J Surg Oncol. 2020;18(1):255. https://doi.org/10.1186/s12957-020-02033-0.
Jing L, Wu J, Tang X, Ma M, Long F, Tian B, Lin C. Identification of circular RNA hsa_circ_0044556 and its effect on the progression of colorectal cancer. Cancer Cell Int. 2020;20(1):427. https://doi.org/10.1186/s12935-020-01523-1.
Wang L, Wu H, Chu F, Zhang L, Xiao X. Knockdown of circ_0000512 inhibits cell proliferation and promotes apoptosis in colorectal cancer by regulating miR-296-5p/RUNX1 axis. OncoTargets Ther. 2020;13:7357–68. https://doi.org/10.2147/OTT.S250495.
Sun J, Liu J, Zhu Q, Xu F, Kang L, Shi X. Hsa_circ_0001806 acts as a ceRNA to facilitate the stemness of colorectal cancer cells by increasing COL1a1. OncoTargets Ther. 2020;13:6315–27. https://doi.org/10.2147/OTT.S255485.
Zhang L, Liu Y, Tao H, Zhu H, Pan Y, Li P, Liang H, Zhang B, Song J. Circular RNA circUBE2j2 acts as the sponge of microRNA-370-5p to suppress hepatocellular carcinoma progression. Cell Death Dis. 2021;12(11):1–16. https://doi.org/10.1038/s41419-021-04269-4.
Niu Z-S, Wang W-H. Circular RNAs in hepatocellular carcinoma: recent advances. World J Gastrointest Oncol. 2022;14(6):1067–85. https://doi.org/10.4251/wjgo.v14.i6.1067.
Zhou Y, Mao X, Peng R, Bai D. CircRNAs in hepatocellular carcinoma: characteristic, functions and clinical significance. Int J Med Sci. 2022;19(14):2033. https://doi.org/10.7150/ijms.74713.
Ding B, Yao M, Fan W, Lou W. Whole-transcriptome analysis reveals a potential hsa_circ_0001955/hsa_circ_0000977-mediated miRNA–mRNA regulatory sub-network in colorectal cancer. Aging. 2020;12(6):5259–79. https://doi.org/10.18632/aging.102945.
Korotkevich G, Sukhov V, Budin N, Shpak B, Artyomov MN, Sergushichev A. Fast gene set enrichment analysis. BioRxiv. 2021. https://doi.org/10.1101/060012.
Huang X-Y, Huang Z-L, Zhang P-B, Huang X-Y, Huang J, Wang H-C, Xu B, Zhou J, Tang Z-Y. CircRNA-100338 is associated with mTOR signaling pathway and poor prognosis in hepatocellular carcinoma. Front Oncol. 2019;9:392. https://doi.org/10.3389/fonc.2019.00392.
Li Q, Pan X, Zhu D, Deng Z, Jiang R, Wang X. Circular RNA MAT2b promotes glycolysis and malignancy of hepatocellular carcinoma through the miR-338-3p/PKM2 axis under hypoxic stress. Hepatology (Baltimore, Md). 2019;70(4):1298–316. https://doi.org/10.1002/hep.30671.
Wei H, Yan S, Hui Y, Liu Y, Guo H, Li Q, Li J, Chang Z. CircFAT1 promotes hepatocellular carcinoma progression via miR-30a-5p/REEP3 pathway. J Cell Mol Med. 2020;24(24):14561–70. https://doi.org/10.1111/jcmm.16085.
Liang L, Zhang L, Zhang J, Bai S, Fu H. Identification of circRNA-miRNA-mRNA networks for exploring the fundamental mechanism in lung adenocarcinoma. OncoTargets Ther. 2020;13:2945–55. https://doi.org/10.2147/OTT.S235664.
Wang Z, Pei H, Liang H, Zhang Q, Wei L, Shi D, Chen Y, Zhang J. Construction and analysis of a circRNA-mediated ceRNA network in lung adenocarcinoma. OncoTargets Ther. 2021;14:3659–69. https://doi.org/10.2147/OTT.S305030.
Liu Y, Wang X, Bi L, Huo H, Yan S, Cui Y, Cui Y, Gu R, Jia D, Zhang S, Cai L, Li X, Xing Y. Identification of differentially expressed circular RNAs as miRNA sponges in lung adenocarcinoma. J Oncol. 2021;5193913:2021. https://doi.org/10.1155/2021/5193913.
Yin T-F, Zhao DY, Zhou Y-C, Wang Q-Q, Yao S-K. Identification of the circRNA-miRNA-mRNA regulatory network and its prognostic effect in colorectal cancer. World J Clin Cases. 2021;9(18):4520–41. https://doi.org/10.12998/wjcc.v9.i18.4520.
He S, Guo Z, Kang Q, Wang X, Han X. Circular RNA hsa_circ_0000517 modulates hepatocellular carcinoma advancement via the miR-326/SMAD6 axis. Cancer Cell Int. 2020;20(1):360. https://doi.org/10.1186/s12935-020-01447-w.
Wang X, Wang X, Li W, Zhang Q, Chen J, Chen T. Up-regulation of hsa_circ_0000517 predicts adverse prognosis of hepatocellular carcinoma. Front Oncol. 2019;9:1105.
Xu Y, Yu J, Huang Z, Fu B, Tao Y, Qi X, Mou Y, Wang Y, Cao Y, Jiang D, Xie J, Xu Y, Zhao J, Xiong W. Circular RNA hsa_circ_0000326 acts as a miR-338-3p sponge to facilitate lung adenocarcinoma progression. J Exp Clin Cancer Res. 2020;39(1):57. https://doi.org/10.1186/s13046-020-01556-4.
Su H, Lin F, Deng X, Shen L, Fang Y, Fei Z, Zhao L, Zhang X, Pan H, Xie D, Jin X, Xie C. Profiling and bioinformatics analyses reveal differential circular RNA expression in radioresistant esophageal cancer cells. J Transl Med. 2016;14(1):225. https://doi.org/10.1186/s12967-016-0977-7.
Qiu L, Wang T, Ge Q, Xu H, Wu Y, Tang Q, Chen K. Circular RNA signature in hepatocellular carcinoma. J Cancer. 2019;10(15):3361–72. https://doi.org/10.7150/jca.31243.
Zhou H, Huang J. CircRNAs in hepatocellular carcinoma: characteristic, functions and clinical significance. Int J Med Sci. 2023;104: 110590. https://doi.org/10.1016/j.cellsig.2023.110590.
Quemener AM, Bachelot L, Forestier A, Donnou-Fournet E, Gilot D, Galibert M-D. The powerful world of antisense oligonucleotides: from bench to bedside. Wiley Interdiscip Rev RNA. 2020;11(5):1594. https://doi.org/10.1002/wrna.1594.
Acknowledgements
The authors would like to thank the Gene Expression and Oncogenesis team (UMR6290) for helpful discussions. We also acknowledge the GenOuest bioinformatics core facility for providing the computing infrastructure and BioRender software (BioRender.com) for the creation of Fig. 2.
Availability and requirements
Project name: Cirscan Project home page: https://gitlab.com/geobioinfo/cirscan_Rshiny Project application page: https://gitlab.com/geobioinfo/cirscan_paper Operating system(s): Platform independent Programming language: R and CSS Other requirements: R 4.2.2 Licence: GNU General Public Licence Restrictions for academic use: none.
Funding
This study received financial support from the Ligue National Contre le Cancer (LNCC) Départements du Grand-Ouest. RMF is a recipient of a doctoral fellowship from the LNCC and YSA is a recipient of a doctoral fellowship from the LNCC Grand Ouest.
Author information
Authors and Affiliations
Contributions
YB supervised the project and conceptualized the tool. YB & RMF developed and implemented the Cirscan tool. MDG, SC and YSA contributed to the biological inputs. MA contributed to the in-house interaction database construction. YB and RMF wrote the original draft of the manuscript. All authors read, improved and approved the final manuscript.
Corresponding author
Ethics declarations
Ethics approval and consent to participate
Not applicable.
Consent for publication
Not applicable.
Competing interests
The authors declare that they have no competing interests.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Supplementary Information
Additional file 1.
Supplementary materials and figures.
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
About this article
Cite this article
Fraboulet, RM., Si Ahmed, Y., Aubry, M. et al. Cirscan: a shiny application to identify differentially active sponge mechanisms and visualize circRNA–miRNA–mRNA networks. BMC Bioinformatics 25, 53 (2024). https://doi.org/10.1186/s12859-024-05668-y
Received:
Accepted:
Published:
DOI: https://doi.org/10.1186/s12859-024-05668-y