GBCdb: RNA expression landscapes and ncRNA–mRNA interactions in gallbladder carcinoma
BMC Bioinformatics volume 24, Article number: 12 (2023)
Gallbladder carcinoma (GBC), an aggressive malignant tumor of the biliary system, is characterized by high cellular heterogeneity and poor prognosis. Fewer data have been reported in GBC than other common cancer types. Multi-omics data will contribute to the understanding of the molecular mechanisms of cancer, cancer diagnosis and prognosis. Herein, to provide better understanding of the molecular events in GBC pathogenesis, we developed GBCdb (http://tmliang.cn/gbc/), a user-friendly interface for the query and browsing of GBC-associated genes and RNA interaction networks using published multi-omics data, which also included experimentally supported data from different molecular levels. GBCdb will help to elucidate the potential biological roles of different RNAs and allow for the exploration of RNA interactions in GBC. These resources will provide an opportunity for unraveling the potential molecular features of Gallbladder carcinoma.
Gallbladder carcinoma (GBC), the most common cancer of the biliary tract [1, 2], has a dismal survival rate, which is largely caused by late diagnosis. Most patients with symptoms are found with incurable tumors, and the clinical outcome is very poor: the median survival time is less than 1 year and the 5-year overall survival rate is less than 5% . Currently, the most effective treatment for GBC is surgery. However, because of asymptomatic characteristics at the early stage and the insidious onset and rapid progression of disease, few patients (less than 10%) are eligible for surgery . Other treatments, such as chemotherapy, targeted therapy, and immune therapy, are available, but only a few patients have a promising prognosis. Therefore, early diagnosis of GBC is essential, and the identification of specific and sensitive biomarkers is critical to improve patient outcome.
Gene mutations and aberrant signaling pathways play key roles in GBC tumorigenesis. Mutations in TP53, ERBB2/ERBB3 and KRAS genes are frequently detected in GBC and are associated with clinical outcomes and treatment efficacy [5,6,7,8]. HER2 gene (ERBB2) amplification may be a low-frequency driver with potential predictive value . RIP-1 inhibits the ability of GBC cells to grow and invade in vitro , and p53 gene expression is a prognostic factor for subserosal GBC . Non-coding RNAs (ncRNAs), mainly including microRNAs (miRNAs), long non-coding RNAs (lncRNAs) and circular RNAs (circRNAs), can function as important regulators in gene expression. Many ncRNAs have critical roles in tumorigenesis, and some ncRNAs function as competing endogenous RNAs (ceRNAs) to perturb gene expression. For example, miR-365 inhibits the progression of GBC by directly targeting RAC1 and may be a novel prognosis biomarker for GBC . The lncRNA TMPO-AS1 promotes cell proliferation, migration, invasion and epithelial-to-mesenchymal transition by regulating the miR-1179/E2F2 axis . miR-4733-5p promotes GBC progression by directly targeting kruppel like factor 7 , and miR-4461 may inhibit the progression of GBC by regulating EGFR/AKT signaling . The mechanisms underlying the occurrence and development of GBC may involve aberrant alterations of multiple molecular pathways. Thus, multi-omics analysis of the molecular landscape of GBC is critical to understand the pathogenesis of this complex disease.
The molecular events underlying GBC pathogenesis, especially from multi-omics levels, are still unclear. Because of the highly aggressive nature and poor prognosis of this cancer, and with the significant differences among different grades (Fig. 1A), it poses a great challenge to relevant studies. To address these limitations and understand the molecular landscapes from the multiple levels, we constructed GBCdb (http://tmliang.cn/gbc) to exhibit the GBC-associated RNA expression landscape and potential RNA interactions (among mRNAs, miRNAs, lncRNAs and circRNAs), along with experimentally verified GBC-associated genes obtained from the literatures (Fig. 1B). GBCdb is a user-friendly database for browsing, searching, and downloading of GBC-related multi-omics results, especially RNA interaction networks that can provide potential ncRNA relationships in the gene expression process. Our findings provide a platform to improve understanding of the detailed multi-omics RNA landscapes of GBC, especially for GBC-associated RNA interactions, which will support future studies on cancer treatment.
The overall GBC-associated RNA expression profiles
mRNA expression data were collected from public data (Additional file 1: Table S1). Many genes showed consistent expression patterns among different datasets (Fig. 1C), and most exhibited dominant expression patterns (Fig. 1D). A total of 119 differentially expressed mRNAs were detected in at least two datasets. Gene Ontology analysis indicated that these candidate GBC-associated genes have potential roles in nuclear division and cell cycle pathways (Fig. 1E). Because of the small sample sizes and limited datasets, all differentially expressed mRNAs were used in subsequent analyses.
To understand the potential RNA interactions among different RNAs, especially the regulatory roles of ncRNAs, expression analysis was performed to screen GBC-associated ncRNAs. Some differentially expressed miRNAs were identified (Fig. 2A, B), and 304 miRNAs were detected in GSE104165 . Some homologous miRNAs, such as those in the let-7 gene family, showed similar expression patterns (Fig. 2C), indicating that they may exhibit similar functions via homologous sequence. From the experimentally validated miRNA-mRNA interactions and expression patterns, a complex candidate miRNA-mRNA interaction network was constructed (Fig. 2D). Most involved mRNAs showed significantly up-regulated expression and their negative regulators showed down-regulated expression. Some miRNAs had multiple target mRNAs, especially miR-29a-3p, indicating this miRNA has multiple regulatory roles in gene expression. From the miRNA-lncRNA interactions, the RNA network was further constructed using three different RNAs that presented potential regulatory relationships based on ceRNA network (Fig. 2E). lncRNAs may act as miRNA sponges to perturb mRNA expression. All these RNAs showed abnormal expression patterns in tumor samples (Fig. 2F), implicating the complex interactions among different RNAs. Potential relationships were detected among miRNAs, mRNAs and circRNAs, but no significantly dysregulated circRNAs were obtained due to the limited circRNA data.
We performed further analysis to screen candidate hub genes from experimentally supported GBC-associated proteins. A total of 26 candidate hub genes were identified (Fig. 3A), including RB1, MUC1, SKP2, HP, APCS and AZGP1 genes. MUC1 has been associated with the progression of GBC , and Skp2 may be an independent prognostic factor for GBC ; these studies suggested the potential roles of these factors in the occurrence and development of cancer. These genes were also potential drug targets (Fig. 3B), and most exhibited significantly dysregulated expression patterns in some datasets (such as in GSE76633) (Fig. 3C). Experimentally supported GBC-related genes, proteins and ncRNAs and the mutation or methylation patterns represent a basis for further study for this complex disease.
Based on the collected data and primary analysis, GBCdb was developed to present information on the molecular events in GBC pathogenesis. GBCdb contained results from the primary analysis of expression profiles of RNAs, including mRNA, miRNA, lncRNA and circRNA, RNA interaction networks, TF regulatory networks and experimentally supported GBC-associated genes. GBCdb has a user-friendly web interface (Fig. 3D), which allows users to query the database via multiple modules. (1) The “Search” module can be used to search different RNA types, including mRNA, miRNA, lncRNA and circRNA, and the detailed expression patterns in different datasets are presented to explore expression patterns. Because of the limited GBC-related data, to obtain more data, additional experimentally supported data for a specific gene (such as mutation or methylation data) are also presented. For lncRNA, the significant drug-lncRNA correlations are also presented. While no significantly dysregulated circRNAs were detected because of the limited data, miRNA-circRNA interactions are also presented to provide potential insights of the role of circRNAs as miRNA sponges. (2) The “RNA network” module presents the total originally screened candidate mRNA-miRNA-lncRNA network. For each miRNA-mRNA or miRNA-lncRNA pair, potential expression relationships are first screened. LncRNAs can act as miRNA sponges to perturb mRNA expression, and the RNA interaction network provides a complex regulatory network. The detailed interactions for each gene are presented to indicate the potential regulatory relationships. The TF-regulatory network is also presented for the differentially expressed genes. Users can select the type of input molecule (RNA or TF-target) using the pull-down menu and then enter the name in the search box. Fuzzy search and input prompt are supported here. Users can quickly obtain the upstream and downstream genes or transcription regulators of the target molecule. (3) The “Download” module is used to download all the relevant differential expression profiles in different datasets. (4) The “Help” module contains detailed documentation and tutorials. GBCdb welcomes any feedback via the email address provided on the “Contact Us” page.
Discussion and future prospects
GBC, an extremely malignant tumor, has high invasion and metastasis rates and is characterized with poor prognosis and a high mortality rate. The precise molecular mechanisms of GBC remain unclear. Although many studies have reported critical GBC-associated genes, few studies have focused on the expression landscapes via integrative analysis of different RNAs. Many genes and molecules, including diverse ncRNAs, play critical roles in the pathophysiological process, and better understanding of their interactions is important to understand the molecular mechanisms of GBC. Single-cell RNA sequencing allows for the exploration of intratumoral heterogeneity and cancer progression  and insights into the occurrence and development of cancer. In this study, we aimed to provide detailed molecular expression profiles and GBC-associated RNA interaction networks that will help contribute to a better understanding of cancer from multi-omics data. We collected and analyzed relevant data from public databases and literatures, and then developed GBCdb, a database containing multi-omics data and RNA interaction networks. Multi-omics data were mainly obtained from GEO database (Additional file 1: Table S1), including expression profiles of mRNAs, miRNAs, lncRNAs and circRNAs. Other relevant experimentally supported data were obtained from published studies, mainly including GBC-associated genes and molecular features. The detailed expression patterns and the potential RNA interactions among diverse RNAs helped establish GBC-associated RNA landscapes, which were used for screening candidate critical RNAs. Although these RNA interactions were primarily obtained from different datasets due to limited GBC-related data, the candidate RNA networks still provided the potential interactions or cross-talks among different RNAs, even among different biological pathways.
In the future, GBCdb will contain more data from multiple molecular levels. The current data are not sufficient for a systematic analysis because of the limited datasets, which is partly because of the poor prognosis of GBC. The experimentally validated RNA interactions will be contained to construct RNA interaction network to track the coding-non-coding RNA regulatory network, especially on the basis of the ceRNA network. We will update GBCdb by collecting and reanalyzing single-cell sequencing data. Finally, using screened GBC-associated genes, a pan-cancer analysis will be performed to understand the potential expression patterns and RNA interaction networks in different cancer types, which will contribute to further understanding of the biological roles in GBC.
Taken together, GBCdb might provide a useful resource for understanding the detailed expression landscapes, RNA interaction networks among different RNAs and experimentally supported data from published studies. GBCdb provides a user-friendly interface for the query and browsing of detailed information and will help understand the potential RNA interactions and biological functions associated with GBC. The database will be updated as more multi-omics data are available. We believe that GBCdb will be a valuable resource for understanding the RNA expression landscapes and interaction networks that will contribute to exploring the potential molecular mechanisms of GBC.
Materials and methods
GBC-related multi-omics data, mainly including expression data of mRNA, miRNA, lncRNA and circRNA, were retrieved from the NCBI GEO database [16, 20,21,22,23,24] (Additional file 1: Table S1). In order to further understand GBC-associated genes, relevant experimentally supported data were also collected from published studies. For interactions between different RNAs, particularly ncRNA–mRNA interactions, experimentally supported miRNA-mRNA interactions and miRNA-lncRNA interactions were obtained from starBase 2.0 , and miRNA-circRNA interactions were mainly downloaded from circbank . The drug-lncRNA correlations were obtained from lncMAP to present the potential roles of lncRNA in cancer treatment . TF-target data were downloaded from hTFtarget to explore the TF-regulatory network .
Differential RNA expression profiles and function enrichment analysis
For the obtained RNA dataset, differentially expressed RNA profiles were estimated with limma package . To reduce the impacts of batch effects, we used the ComBat  in the process. A dysregulated RNA was defined if |log2FC|> 1.2 and padj < 0.05. Functional analysis of candidate genes was performed with The Database for Annotation, Visualization and Integrated Discovery (DAVID) version 6.8  and clusterProfiler 4.0  to understand their potential biological roles. Additionally, to estimate the potential correlations of candidate hub genes and drugs, drug sensitivity analysis was performed using GSCA .
To evaluate whether cancer grades had potential prognostic values in GBC patients, survival analysis was performed using the Surveillance, Epidemiology, and End Results (SEER) dataset using survival R package. All cases were obtained from the SEER Program (http://www.seer.cancer.gov) SEER*Stat database released in May 2022: version 8.4.1. The log-rank test was used to calculate the differences among the difference grades. p < 0.05 indicated statistical significance.
Network visualization and statistical analysis
From the potential interactions among different RNAs, an RNA interaction network was constructed using Cytoscape 3.8.2 . The collected experimentally supported GBC-associated proteins were used to survey the potential hub genes via protein–protein interaction (PPI) networks using the STRING online database . The Wilcoxon rank-sum test was used to validate the potential differences between different groups. All analyses were performed with R programming language (version 4.0.5).
Using the collected data and primary analysis, we developed GBCdb to query and browse GBC-associated RNA expression profiles, RNA interactions and experimentally supported multi-omics data, TF-regulatory network, and etc.
Availability of data and materials
GBCdb (http://tmliang.cn/gbc/) is freely available to the public without registration or login requirements.
Varshney S, Butturini G, Gupta R. Incidental carcinoma of the gallbladder. Eur J Surg Oncol. 2002;28:4–10.
Wistuba II, Gazdar AF. Gallbladder cancer: lessons from a rare tumour. Nat Rev Cancer. 2004;4:695–706.
Mao W, Deng F, Wang D, Gao L, Shi X. Treatment of advanced gallbladder cancer: a SEER-based study. Cancer Med. 2020;9:141–50.
Hundal R, Shaffer EA. Gallbladder cancer: epidemiology and outcome. Clin Epidemiol. 2014;6:99–109.
Li M, Liu F, Zhang F, Zhou W, Jiang X, Yang Y, et al. Genomic ERBB2/ERBB3 mutations promote PD-L1-mediated immune escape in gallbladder cancer: a whole-exome sequencing analysis. Gut. 2019;68:1024–33.
Li M, Zhang Z, Li X, Ye J, Wu X, Tan Z, et al. Whole-exome and targeted gene sequencing of gallbladder carcinoma identifies recurrent mutations in the ErbB pathway. Nat Genet. 2014;46:872–6.
Moreno M, Pimentel F, Gazdar AF, Wistuba II, Miquel JF. TP53 abnormalities are frequent and early events in the sequential pathogenesis of gallbladder carcinoma. Ann Hepatol. 2005;4:192–9.
Shukla SK, Singh G, Shahi KS, Bhuvan, Pant P. Genetic Changes of P(53) and Kras in Gallbladder Carcinoma in Kumaon Region of Uttarakhand. J Gastrointest Cancer. 2020;51:552–9.
Albrecht T, Rausch M, Roessler S, Geissler V, Albrecht M, Halske C, et al. HER2 gene (ERBB2) amplification is a low-frequency driver with potential predictive value in gallbladder carcinoma. Virchows Arch. 2020;476:871–80.
Zhu G, Chen X, Wang X, Li X, Du Q, Hong H, et al. Expression of the RIP-1 gene and its role in growth and invasion of human gallbladder carcinoma. Cell Physiol Biochem. 2014;34:1152–65.
Roa EI, Lantadilla HS, Ibacache SG. de Aretxabala UX [p53 and p27 gene expression in subserosal gallbladder carcinoma]. Rev Med Chil. 2009;137:1017–22.
Jiang ZB, Ma BQ, Feng Z, Liu SG, Gao P, Yan HT. miR-365 inhibits the progression of gallbladder carcinoma and predicts the prognosis of Gallbladder carcinoma patients. Cell Cycle. 2021;20:308–19.
Sui Z, Sui X. Long non-coding RNA TMPO-AS1 promotes cell proliferation, migration, invasion and epithelial-to-mesenchymal transition in gallbladder carcinoma by regulating the microRNA-1179/E2F2 axis. Oncol Lett. 2021;22:855.
Hu X, Zhang J, Bu J, Yang K, Xu S, Pan M, et al. MiR-4733-5p promotes gallbladder carcinoma progression via directly targeting kruppel like factor 7. Bioengineered. 2022;13:10691–706.
Yan X, Yang P, Liu H, Zhao Y, Wu Z, Zhang B. miR-4461 inhibits the progression of Gallbladder carcinoma via regulating EGFR/AKT signaling. Cell Cycle. 2022;21:1166–77.
Goeppert B, Truckenmueller F, Ori A, Fritz V, Albrecht T, Fraas A, et al. Profiling of gallbladder carcinoma reveals distinct miRNA profiles and activation of STAT1 by the tumor suppressive miRNA-145-5p. Sci Rep. 2019;9:4796.
Park SY, Roh SJ, Kim YN, Kim SZ, Park HS, Jang KY, et al. Expression of MUC1, MUC2, MUC5AC and MUC6 in cholangiocarcinoma: prognostic impact. Oncol Rep. 2009;22:649–57.
Li SH, Li CF, Sung MT, Eng HL, Hsiung CY, Huang WW, et al. Skp2 is an independent prognosticator of gallbladder carcinoma among p27(Kip1)-interacting cell cycle regulators: an immunohistochemical study of 62 cases by tissue microarray. Mod Pathol. 2007;20:497–507.
Chen P, Wang Y, Li J, Bo X, Wang J, Nan L, et al. Diversity and intratumoral heterogeneity in human gallbladder cancer progression revealed by single-cell RNA sequencing. Clin Transl Med. 2021;11: e462.
Qin Y, Zheng Y, Huang C, Li Y, Gu M, Wu Q. Downregulation of miR-181b-5p inhibits the viability, migration, and glycolysis of gallbladder cancer by upregulating PDHX under hypoxia. Front Oncol. 2021;11: 683725.
Li H, Hu Y, Jin Y, Zhu Y, Hao Y, Liu F, et al. Long noncoding RNA lncGALM increases risk of liver metastasis in gallbladder cancer through facilitating N-cadherin and IL-1beta-dependent liver arrest and tumor extravasation. Clin Transl Med. 2020;10: e201.
Wu XS, Wang F, Li HF, Hu YP, Jiang L, Zhang F, et al. LncRNA-PAGBC acts as a microRNA sponge and promotes gallbladder tumorigenesis. EMBO Rep. 2017;18:1837–53.
Xu S, Zhan M, Jiang C, He M, Yang L, Shen H, et al. Genome-wide CRISPR screen identifies ELP5 as a determinant of gemcitabine sensitivity in gallbladder cancer. Nat Commun. 2019;10:5492.
Wang S, Zhang Y, Cai Q, Ma M, Jin LY, Weng M, et al. Circular RNA FOXP1 promotes tumor progression and Warburg effect in gallbladder cancer by regulating PKLR expression. Mol Cancer. 2019;18:145.
Li JH, Liu S, Zhou H, Qu LH, Yang JH. starBase v2.0: decoding miRNA-ceRNA, miRNA-ncRNA and protein-RNA interaction networks from large-scale CLIP-Seq data. Nucleic Acids Res. 2014;42:D92–7.
Liu M, Wang Q, Shen J, Yang BB, Ding X. Circbank: a comprehensive database for circRNA with standard nomenclature. RNA Biol. 2019;16:899–905.
Li Y, Li L, Wang Z, Pan T, Sahni N, Jin X, et al. LncMAP: Pan-cancer atlas of long noncoding RNA-mediated transcriptional network perturbations. Nucleic Acids Res. 2018;46:1113–23.
Zhang Q, Liu W, Zhang HM, Xie GY, Miao YR, Xia M, et al. hTFtarget: a comprehensive database for regulations of human transcription factors and their targets. Genom Proteom Bioinf. 2020;18:120–8.
Ritchie ME, Phipson B, Wu D, Hu Y, Law CW, Shi W, et al. limma powers differential expression analyses for RNA-sequencing and microarray studies. Nucleic Acids Res. 2015;43: e47.
Leek JT, Johnson WE, Parker HS, Jaffe AE, Storey JD. The sva package for removing batch effects and other unwanted variation in high-throughput experiments. Bioinformatics. 2012;28:882–3.
Huang DW, Sherman BT, Lempicki RA. Systematic and integrative analysis of large gene lists using DAVID bioinformatics resources. Nat Protoc. 2009;4:44–57.
Wu T, Hu E, Xu S, Chen M, Guo P, Dai Z, et al. clusterProfiler 4.0: a universal enrichment tool for interpreting omics data. Innovation (Camb). 2021;2:100141.
Liu CJ, Hu FF, Xia MX, Han L, Zhang Q, Guo AY. GSCALite: a web server for gene set cancer analysis. Bioinformatics. 2018;34:3771–2.
Shannon P, Markiel A, Ozier O, Baliga NS, Wang JT, Ramage D, et al. Cytoscape: a software environment for integrated models of biomolecular interaction networks. Genome Res. 2003;13:2498–504.
Szklarczyk D, Gable AL, Lyon D, Junge A, Wyder S, Huerta-Cepas J, et al. STRING v11: protein–protein association networks with increased coverage, supporting functional discovery in genome-wide experimental datasets. Nucleic Acids Res. 2019;47:D607–13.
This work was supported by National Natural Science Foundation of China (Nos. 62171236 and 61771251), the key project of social development in Jiangsu Province (No. BE2022799), the key projects of Natural Science Research in Universities of Jiangsu Province (22KJA180006), Sponsored by NUPTSF (No. NY220041), the Qinglan Project in Jiangsu Province, the Open Research Fund of State Key Laboratory of Bioelectronics, Southeast University (SKLB2022-K03), funding from Shandong Provincial Key Laboratory of Biophysics (2022KFKT001), and the Priority Academic Program Development of Jiangsu Higher Education Institution (PAPD).
Ethics approval and consent to participate
Consent for publication
The authors declare no competing interests.
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
About this article
Cite this article
Guo, L., Xiang, Y., Dou, Y. et al. GBCdb: RNA expression landscapes and ncRNA–mRNA interactions in gallbladder carcinoma. BMC Bioinformatics 24, 12 (2023). https://doi.org/10.1186/s12859-023-05133-2
- Gallbladder carcinoma (GBC)
- Expression landscape
- RNA interaction