- Open Access
Identification of cuproptosis-related lncRNAs to predict prognosis and immune infiltration characteristics in alimentary tract malignancies
BMC Bioinformatics volume 24, Article number: 184 (2023)
Alimentary tract malignancies (ATM) caused nearly one-third of all tumor-related death. Cuproptosis is a newly identified cell death pattern. The role of cuproptosis-associated lncRNAs in ATM is unknown.
Data from The Cancer Genome Atlas (TCGA) and Gene Expression Omnibus (GEO) databases were used to identify prognostic lncRNAs by Cox regression and LASSO. Then a predictive nomogram was constructed based on seven prognostic lncRNAs. In addition, the prognostic potential of the seven-lncRNA signature was verified via survival analysis, the receiver operating characteristic (ROC) curve, calibration curve, and clinicopathologic characteristics correlation analysis. Furthermore, we explored the associations between the signature risk score and immune landscape, and somatic gene mutation.
We identified 1211 cuproptosis-related lncRNAs and seven survival-related lncRNAs. Patients were categorized into high-risk and low-risk groups with significantly different prognoses. ROC and calibration curve confirmed the good prediction capability of the risk model and nomogram. Somatic mutations between the two groups were compared. We also found that patients in the two groups responded differently to immune checkpoint inhibitors and immunotherapy.
The proposed novel seven lncRNAs nomogram could predict prognosis and guide treatment of ATM. Further research was required to validate the nomogram.
Alimentary tract malignancies (ATM), comprising a spectrum of cancers occurring in the digestive tract, have seriously endangered public health and human life . Arnold et al. reported the mortality of gastric (approximately 1.0 million new cases in 2018), esophagus (570,000 cases), and colorectum (1.8 million cases) cancer, which caused nearly one-third of all tumor-related deaths . The alimentary tract, from the oropharynx to the anal canal, is closely related in mainly organ functions and development, suggesting common etiological pathways and mechanisms. However, the mechanisms of digestive tract tumorigenicity in common remain to be explored.
There is currently no efficient and established early-stage screening protocol for ATM patients. As a result, many patients are in the middle and advanced stages when they are diagnosed . The average survival duration of individuals with advanced ATM continues to be extremely low, despite advancements in several therapy approaches. Therefore, it is essential to explore a reliable prognostic signature to predict the prognosis of ATM patients and direct clinical practice.
According to Science, a novel type of cell death known as cuproptosis is brought on by an excessive buildup of copper. Copper causes toxic protein stress and, as a result, cell death by binding specifically to lipoylated parts of the tricarboxylic acid (TCA) cycle . A trace element called copper is essential for many biological activities. Key elements of the course of cancer, including angiogenesis, metastasis, and proliferation, are influenced by copper buildup [4, 5]. Growing research over the past few years has demonstrated that copper homeostasis dysregulation may influence the onset and development of ATM. Jacinta et al. demonstrated that copper played a significant part in the biological activity that was seen to be amplified after being associated with a lipid-based nanosystem for the treatment of colorectal cancer . Rebecca et al. used a multi-technology strategy to examine the mechanism of sensitivity in esophageal cancer to copper-dependent cell death, which could be targeted in the future . In another study, Du et al. presented that disulfiram (DSF) was highly toxic to gastric cancer cells in a copper-dependent manner. And DSF/Cu exerted antitumor activity against gastric cancer cells in vitro and in vivo .
Long non-coding RNAs (lncRNAs) are a subset of untranslated RNAs that comprise more than 200 nucleotides . Growing data reveals that lncRNAs have the capacity to control tumor metastasis, cancer immunity, and programmed cell death in recent years [10,11,12,13]. Additionally, fresh possible prognostic markers known as lncRNAs have been discovered for ATM patients [14, 15]. Sun et al. explored the lncRNA-mRNA regulatory networks in ATM progression and performed bioinformatic analysis, indicating that THBS2 was a potential key regulator and therapeutic target . Hu et al. reported that lncRNA EGFR-AS1 could regulate the expression of EGFR via heightening EGFR mRNA stability to active phosphatidylinositol-3 kinase (PI3K)/AKT pathway for the furtherance of the malignant progression of ATM . Besides, Hao et al. identified immune-related lncRNA pairs and constructed a predictive nomogram, which demonstrated good predictive ability . Nevertheless, research on cuproptosis-related lncRNAs in ATM is still limited.
Given that integrated analyses usually emphasized cancer heterogenicity, this study tried another view of homogeneity of cancer prognosis, immunological landscape, and somatic gene mutation. By using bioinformatics analysis, we identified novel cuproptosis-related risk lncRNAs and constructed a predictive model of digestive tract cancers, which could eventually guide doctors to make better clinical decisions.
Materials and methods
TCGA and GEO data
We downloaded RNA sequencing (RNA-seq), expression files, and mutation files from the Cancer Genome Atlas (TCGA) database (https://portal.gdc.cancer.gov/repository), including 375 tumor samples and 32 normal samples in STAD, 163 tumor samples and 11 normal samples in ESCA, 480 tumor samples and 41 normal samples in COAD, and 167 tumor samples and 10 normal samples in READ. These data were arranged in accordance with TCGA procedures and combined into transcripts per million data (TPM). GSE40967, GSE53622, and GSE84437 from the Gene Expression Omnibus (GEO) database (https://www.ncbi.nlm.nih.gov/geo/) were downloaded and used as an external validation cohort.
Identification of cuproptosis-related lncRNAs
A list of 16 cuproptosis regulators was retrieved from the lipoylated TCA cycle pathway of copper-induced cell death (FDX1, LIPTI, LIAS, DLD, MTF1, GLS, CDKN2A, DLAT, PDHA1, PDHB, DBT, GCSH, and DLST)  and copper transport protein (SLC31A1, ATP7A, and ATP7B) [19, 20].
The co-expression analysis between lncRNAs and cuproptosis-related genes was then carried out (|cor|> 0.4 and p < 0.001). The "DEseq2" package was used to identify lncRNAs that were differentially expressed between tumor and normal samples. The following requirements were met by differentially expressed lncRNAs: p < 0.05 and (log2FC|> 1 (FC, fold change). A Sankey diagram was mapped to assess the degree of association between genes involved in cuproptosis and lncRNAs.
Cuproptosis-related lncRNAs signature for ATM prognosis
To create and validate the cuproptosis-associated lncRNAs signatures, the overall patients were randomly divided into either the training cohort or the test cohort in a 7:3 ratio. Univariate Cox regression analysis was initially applied to identify the risk lncRNAs. The LASSO regression analysis based on tenfold cross-validation was then used to reduce the overfitting effect. Finally, the multivariate Cox regression analysis was used to identify the best prognostic signs.
We then created a risk score algorithm for ATM by calculating the relevant coefficients for the risk lncRNAs. The risk score for each patient was calculated using the following formula: Risk score = ∑ coefi*αi, where αi and coefi denoted the expression level of each prognostic lncRNA and its accompanying coefficient.
Validation of the lncRNA risk model
Based on the median risk score cutoff, patients in each cohort were divided into low-risk and high-risk groups. Using the "survminer" R package, the Kaplan–Meier survival analysis was used to compare the overall survival (OS) between two risk cohorts. The receiver operating characteristic (ROC) curve was constructed to evaluate the prognostic accuracy of our risk signatures. A principal component analysis (PCA) analysis was used to examine the distribution of high-risk and low-risk groupings. The prognostic signatures were analyzed both within the testing cohort and across the full cohort to determine the model's viability. The risk score and clinicopathologic parameters were subjected to univariate and multivariate Cox regression analysis to ascertain the independence of the cuproptosis-related lncRNAs signature (gender, age, grade, and TNM stage). To determine whether the signature maintained its predictive power in patient subgroups, stratified analysis was performed in the end.
Establishment and assessment of the nomogram
The signatures and clinical data were incorporated into the suggested prediction nomogram. Then, ROC curves were employed to assess the predictive ability of the nomogram at 1-, 3-, and 5-year. The 1-, 3-, and 5-year calibration plots (1000 bootstrap resamples) were displayed to compare the projected overall survival with what was observed in the research. The 45-degree line was shown to be the best prediction.
Functional and pathway enrichment
The "limma" package was used to find the differentially expressed genes (DEGs) between low- and high-risk groups. The following requirements were met by DEGs: false discovery rate (FDR) < 0.05 and |log2FC|≥ 1. The "clusterProfiler" package was used to conduct functional and pathway enrichment studies using the Gene Ontology (GO) and Kyoto Encyclopedia of Genes and Genomes (KEGG) databases . Additionally, the top six important pathways in the high-risk and low-risk subgroups were found using the gene set enrichment analysis (GSEA) .
We used seven algorithms to investigate the relationship between the risk score and tumor-infiltrating immune cells, namely, CIBERSORT , CIBERSORT-ABS , EPIC , MCPcounter , quanTIseq , TIMER , and xCell . The normalized enrichment score (NES) was utilized to produce individual enrichment scores for immunological pathways using single-sample GSEA (ssGSEA). The level of coordinated regulation of the genes within a sample was indicated by each ssGSEA enrichment score. Next, the immune checkpoint gene expression was investigated. Finally, the tumor immune dysfunction and exclusion (TIDE) algorithm was used to detect T cell malfunction signatures and signatures that excluded T cell infiltration into tumors to predict the therapeutic response of immune checkpoint blockades (ICBs) .
Tumor mutation burden
Additionally, the TCGA Somatic Mutation Database was used to retrieve the data on somatic mutations for the ATM samples. We examined the tumor mutational burden (TMB) in both groups using the "maftools" package. The whole population was separated into high-TMB and low-TMB subsets by median TMB, and the Kaplan–Meier survival curve was presented for each group.
R software (version 4.2.1) and Strawberry Perl (version 5.3.1) were used to perform all statistical analyses. P < 0.05 was regarded as statistical significance.
Identification of cuproptosis-regulated lncRNAs
The workflow of this project was presented in Fig. 1. From the TCGA database, RNA-sequencing information and clinical annotation for ATM were retrieved. In a recent publication, 16 cuproptosis regulators were identified in the lipoylated TCA cycle pathway of copper-induced cell death (Additional file 1: Table S1). The Ensembl gene annotation dataset discovered 16,901 lncRNAs and 19,962 mRNAs altogether. Based on the Pearson correlation analysis (|cor|> 0.4 and p < 0.001), 1211 cuproptosis-related lncRNAs were discovered. The co-expression network between the cuproptosis-associated lncRNAs and cuproptosis genes was then presented by the Sankey plot (Fig. 2A). Then, using the criterion of p < 0.05 and (log2FC|> 1, 174 differentially expressed lncRNAs between normal and cancer samples were chosen for additional study (Fig. 2B). 118 of these genes showed an upregulation, while 56 showed a downregulation.
We randomly divided all ATM cases into the training set and the internal validation set at a 7:3 ratio. The training set was used to create the model, and the internal validation set was used to validate the model. The chi-square test revealed that the demographic and clinicopathologic characteristics of the two groups were comparable (Table 1). First, 138 lncRNAs that were significantly linked with the OS (p < 0.05) were originally screened using a univariate Cox proportional hazard regression analysis. Then 16 lncRNAs were retrieved by LASSO regression analysis (Fig. 2C, D). Finally, using the multivariate Cox regression model analysis, seven lncRNAs (AC016737.1, AC048344.4, AL590483.4, GK-IT1, LINC01555, SETBP1-DT, and SNHG4) were obtained for creating the ideal prognostic signature (Fig. 2E). The risk score was calculated as follows: risk score = (0.3505 × AC016737.1 expression) + (0.1843 × AC048344.4 expression) + (− 0.3853 × AL590483.4 expression) + (0.1398 × GK-IT1 expression) + (− 0.3871 × LINC01555 expression) + (0.1955 × SETBP1-DT expression) + (− 0.1491 × SNHG4 expression). The heatmap showed a close correlation between the seven risk lncRNAs and cuproptosis genes (Fig. 2F).
In the training, internal validation, and overall sets, the samples were regrouped into high-risk and low-risk groups based on the median risk scores (Fig. 3). The distribution of risk scores, survival time patterns, survival status, and the associated expression of seven risk lncRNAs were validated in the three groups. For all three analysis groups, the same trend results were observed. The Kaplan–Meier curves showed better survival in the low-risk group than in the high-risk group (all p < 0.001) (Fig. 3A–C). The risk curves and scatterplots showed that samples in high-risk groups had significantly higher mortality rates (Fig. 3D–I). The heatmap showed that three protective lncRNAs were noticeably downregulated whereas four risk lncRNAs were noticeably increased in the high-risk group (Fig. 3J–L). These all suggested that the risk prediction model demonstrated a good capacity for prediction.
Assessment of the risk model
To determine if the risk score may function as an independent prognostic factor for ATM, univariate and multivariate Cox regression analyses were used. Age, gender, grade, stage, and risk score were all positively correlated with the prognosis of ATM in the entire sample, according to the results of the univariate Cox regression analysis (p < 0.001). (Fig. 4A; Additional file 2: Table S2). Age, stage, and risk score were also shown to be independent prognostic factors in ATM by the multivariate Cox regression analysis (p < 0.05) (Fig. 4B; Additional file 2: Table S2). In the univariate and multivariate analyses, the HR values for risk scores were 1.732 (1.559–1.925) and 1.314 (1.092–1.582).
The predictive power of the risk signature for the OS was evaluated using the ROC curve. At 1-, 3-, and 5-year, this risk score's predictive ability was strong (Fig. 4C). When compared to other clinicopathologic variables, the AUC associated with the risk score was the highest (Fig. 4D). Additionally, we investigated how the risk model affected various subgroups using Kaplan–Meier curves, which showed good performance in most subgroups (p < 0.05) (Additional file 5: Fig. S1). Above all, our findings showed that the risk score based on the signatures of the seven risk lncRNAs was useful for prognostic analysis.
We created a nomogram based on the training cohort to compute the overall survival rate of 1-, 3-, and 5-year using the risk score in combination with clinicopathological variables such as age, gender, grade, and stage (Fig. 5A). The nomogram's AUC values for the 1-, 3-, and 5-year periods were 0.728, 0.765, and 0.741, respectively, demonstrating its strong predictive power (Fig. 5B). The calibration plots showed good agreement for the accuracy of 1-, 3-, and 5-year overall survival prediction (Fig. 5C). The nomogram displayed a greater AUC compared to the single parameter in the model. Then the model was verified by using samples from the GEO cohort. The results indicated ideal predictive ability in external validation (Fig. 5D, E).
PCA and biological pathways analyses
PCA was used to analyze the aggregation characteristics of the low-risk and high-risk sets. The outcomes presented that risk lncRNAs had superior classification capacity than all genes, cuproptosis regulators, and cuproptosis-related lncRNAs (Fig. 6A–C).
DEGs with the cut-off criteria of log2|FC|> 1 and FDR < 0.05 were chosen to further investigate the variations in biological processes and signaling pathways in the two groups. The GO analysis indicated that DEGs were involved in epidermal cell differentiation, anterior/posterior pattern specification, epidermis development, embryonic organ development, regionalization, and pattern specification process in the biological process (BP) category. The apical part of cell, cornified envelope, apical plasma membrane, Golgi lumen, anchored component of membrane, and keratin filament were enriched in DEGs at the cell component (CC) category. Besides, DEGs were mainly correlated with the bile acid binding, oxidoreductase activity, structural constituent of skin epidermis, extracellular matrix structural constituent, and DNA-binding transcription activator activity for the molecular function (MF) category (Fig. 6D, E; Additional file 3: Table S3).
In the KEGG analysis, these DEGs presented more enrichment in signaling pathways regulating pluripotency of stem cells, pancreatic secretion, staphylococcus aureus infection, steroid hormone biosynthesis, and protein digestion and absorption (Fig. 6F, G; Additional file 4: Table S4).
To compare the variations in biological processes and pathways between the high-risk and low-risk categories, we further conducted GSEA analyses (Additional file 5: Figs. S2, S3). The findings showed that the low-risk group had higher levels of base excision repair, citrate cycle, TCA cycle, oxidative phosphorylation, peroxisome, and ribosome. In the high-risk group, linoleic acid metabolism was enhanced.
Correlation analysis between risk scores and gene mutations
The somatic mutations between the two groups were contrasted. TP53, TTN, APC, MUC16, SYNE1, KRAS, LRP1B, PIK3CA, FAT4, and CSMD3 were the ten most frequently altered genes. The low-risk set had more frequent TP53, APC, and KRAS mutations (Fig. 7A, B). And the alternation of APC mutation frequency was the most significant (from 13% in the high-risk group to 73% in the low-risk group). Detailed information on somatic mutation was presented in Fig. 7C and D. Then the relationship between TMB and risk sets was investigated. The result indicated no significant difference in TMB in low-risk and high-risk sets (Fig. 7E). The ATM patients were then separated into low-mutation and high-mutation categories by median TMB. According to the Kaplan–Meier curves, the high-mutation samples presented better survival than the low-mutation group (p < 0.001) (Fig. 7F). When TMB and risk scores were combined to evaluate the prognosis, we discovered that patients with higher TMB in the low-risk fraction presented the best prognosis, whereas patients with lower TMB in the high-risk subset had the worst survival rate (Fig. 7G).
Immune landscape of ATM patients
Immune cell infiltration research revealed a link between risk scores and tumor-infiltrating immune cells. Immune cells were found to be closely related to high-risk ratings on various algorithms (Fig. 8A). According to the ssGSEA data, the high-risk group was abundant in immune cells such as aDCs, B cells, CD8 + T cells, DCs, macrophage, mast cells, neutrophils, NK cells, T helper cells, Tfh, Th1 cells, TIL, and Treg (Fig. 8B, C). Furthermore, immunological scores revealed that the immune function in the high-risk group was enriched, including APC co-inhibition, check-point, cytolytic activity, inflammation-promoting, T cell co-inhibition, type I IFN response, and type II IFN response (Fig. 8D).
Given the importance of ICIs in tumor treatment, we investigated immune checkpoint genes in both groups, indicating that major checkpoint gene expression differed between the two groups. And more immune checkpoint expression was observed in the high-risk set, such as CD27, CD40, CD86, LAG3, and NRP1 (Fig. 8E). TIDE scores in the high-risk group were considerably higher than in the low-risk group. This suggested that TIDE might be used to assess ATM patients' susceptibility to ICB therapy (Fig. 8F). Above all, these findings revealed that high-risk patients might be more sensitive to immunotherapy.
Despite a decrease in ATM morbidity, it continues to be the world's leading cause of mortality from malignant tumors today . Thus, research into carcinogenesis and tumor growth pathways remains critical. However, most studies focus on a specific cancer type while providing little concrete information about the homogeneity of cancers. The common mechanisms of digestive tract tumorigenicity have been explored recently and present encouraging results [30, 31]. So, this study tried another view of the homogeneity of cancer development. According to Science, cuproptosis was a novel type of cell death, which played a critical role in the occurrence and development of ATM . And the prognostic significance of cuproptosis-related lncRNAs in ATM hadn’t been investigated before. By using bioinformatics analysis, we identified novel biomarkers associated with clinical traits and common regulatory mechanisms of digestive tract cancers.
Many ATM patients are initially diagnosed in the middle or advanced stages. As a result, early diagnosis biomarkers and effective therapy techniques are essential for ATM patients. Numerous studies have shown that lncRNAs have an important role in the early detection of ATM. Li et al. conducted a meta-analysis of 40 original research articles involving 6,772 individuals, indicating that serum or plasma lncRNAs had high sensitivity and specificity in identifying gastric cancer . Cheng et al. discovered that the lncRNA LINC00662 affected CLDN8/IL22 co-expression and stimulated the ERK signaling pathway to enhance colon cancer growth and metastasis . Huang et al. demonstrated that the lncRNA-encoded peptide HOXB-AS3 inhibited the growth of colon cancer cells . Roohinejad et al. found that PVT1 and CCAT1 lncRNAs were great markers for the early diagnosis of ESCC, which played a critical role in cancer cell growth and regulation .
Cuproptosis, a novel mode of cell death, was recently described. It was produced by excessive copper binding to lipoylated components of the tricarboxylic acid (TCA) cycle, leading to toxic protein stress and cell death [3, 36]. Researchers discovered that cuproptosis might play an important role in tumor proliferation, metastasis, and angiogenesis . Nonetheless, the prognostic significance of cuproptosis-related lncRNAs in ATM is little understood. The goal of this work was to create a new cuproptosis-related lncRNAs profile to predict survival and tumor immunity in ATM patients.
We started by downloading clinical data, transcriptome sequencing data, and survival data on ATM patients from the TCGA and GEO databases. Then seven risk lncRNAs were identified and used for signature construction. AC016737.1, GK-IT1, LINC01555, SETBP1-DT, and SNHG4 were proven to play critical roles in multiple malignancies [37,38,39,40,41]. The genes AC048344.4 and AL590483.4 were undocumented, and it was currently unknown what the underlying mechanism of these lncRNAs in ATM was. These discovered lncRNAs were intriguing targets for cancer therapy and might very well contribute to the mechanism of ATM.
The risk value produced by the model, just as some recognized prognostic indicators like pathologic stages, could be used to independently predict the prognosis of ATM patients. Additionally, the risk model was more effective in predicting patient outcomes because it comprised only seven identified lncRNAs. The nomogram was subsequently approved as a technique for predicting the 1-, 3-, and 5-year OS of ATM patients. The congruence between the predictions made by the nomogram and the actual results was shown via calibration curves. Statistical investigation revealed that our prognostic signature was highly accurate and sensitive.
TMB has been shown to be an accurate predictor of the efficacy of immunotherapy [42, 43]. Then we analyzed the TMB landscape in both groups, and a higher level of TMB was discovered in the high-risk cohort. The results indicated that our signature might be used to identify individuals who might benefit from immunotherapy, potentially improving treatment results and reducing the risk of serious immune-related side events.
Numerous studies have shown that the tumor immunological microenvironment is critical in the formation and progression of ATM [44, 45]. Between the high-risk and low-risk groups in our investigation, the ssGSEA algorithm found a substantial difference in immune cell infiltration. These findings showed that cuproptosis was closely related to immune infiltration and the tumor-immune microenvironment in patients with ATM. A higher proportion of immune cells, such as aDCs, B cells, CD8 + T cells, DCs, macrophages, mast cells, neutrophils, NK cells, T helper cells, Tfh, Th1 cells, TIL, and Treg, were infiltrated in the high-risk fraction compared to the low-risk group. And patients in the low-risk group survived longer than those in the high-risk fraction. The findings suggested that worse survival outcomes for ATM patients were predicted by the enrichment of these immune cells.
Immune checkpoint inhibitors have ushered in a revolutionary age of cancer immunotherapy, with the potential to improve the treatment results of cancer patients . Therefore, we investigated the differences in immune checkpoint gene expression between the high-risk and low-risk groups. According to the findings, immunological checkpoints such as CD27, CD40, CD86, LAG3, and NRP1, were more active in the high-risk group. Previous studies had partly explained the role of these immune checkpoints in ATM patients. For example, Rhyner et al. reported that the LAG3 expression on tumor-infiltrating lymphocytes was significantly associated with prognosis. LAG3 testing might aid in predicting outcomes for colon cancer patients and might help to find those who would benefit from adjuvant chemotherapy . Additionally, gastric cancer and esophageal cancer were tightly linked to LAG3, which was thought to be a prospective therapeutic target for antitumor therapy [44, 48]. Above all, our signature suggested that for ATM patients at higher risk, medicines targeting these immune checkpoints would offer a viable therapy option.
Several studies indicated the role of copper in ATM. For instance, disulfiram (DSF)/Cu-induced formation of reactive oxygen species (ROS) resulted in the growth inhibition of GC cells via glycolysis . Besides, Hu et al. found that copper could induce autophagic cell death by targeting ULK1 in colorectal cancer . Based on the GO and KEGG analysis, we reasonably guessed that cuproptosis might be involved in ATM progression by influencing the activity of apical plasma membrane and D-binding transcription activator. And signaling pathways regulating pluripotency of stem cells and metabolism of xenobiotics by cytochrome P450 might be involved in pathway mechanism in ATM development. According to the GSEA analysis, the signaling pathway of the citrate cycle TCA cycle was enriched in the low-risk group. Cuproptosis occurs when copper binds to lipoylated enzymes in the TCA cycle, causing protein aggregation, proteotoxic stress, and cell death. As a result, the TCA cycle might be a critical route in ATM copper-dependent cell death. These findings suggested inhibiting the TCA cycle pathway could have anti-cancer effects in ATM. Nonetheless, these findings required additional investigation.
The advancement of interaction prediction research in various fields of computational biology has provided valuable insights into genetic markers and molecular mechanisms [50, 51]. At present, the interactions between lncRNA and miRNA are mainly obtained through biological experiments, but such experiments are often time-consuming and labor-intensive, it is necessary to design a computational method that can predict the interactions between lncRNA and miRNA. Wang et al. proposed a method based on graph convolutional neural and conditional random field for predicting lncRNA–miRNA interactions, which had an AUC value of 0.947 and presented higher prediction accuracy than the other methods . Zhang et al. used network distance analysis to predict lncRNA–miRNA interaction, verifying the reliability of this method . Besides, Liu et al. established a novel matrix factorization model to predict lncRNA–miRNA interactions, and the model obtained reliable performance . Thus, the interaction prediction between cuproptosis-related lncRNAs and miRNA could be investigated by computational biology methods in the future.
Undoubtedly, our current research also included several limitations and flaws. First, the whole samples used for our signatures were obtained from the TCGA and GEO databases. Second, there weren’t enough clinical samples used to validate signatures. As a result, additional study in the following clinical stage is required. Finally, in vivo and in vitro investigations should be conducted to investigate the underlying processes of how these cuproptosis-related lncRNAs influence ATM.
In summary, we effectively developed a novel seven lncRNAs signature with excellent sensitivities and specificities to predict survival outcomes in patients with ATM. Furthermore, our research sheds light on how cuproptosis-related lncRNAs in ATM function at the molecular level. The signature might offer direction for the customized treatment of ATM patients as well as aid in evaluating the effectiveness of targeted therapies and immunotherapy.
Availability of data and materials
The public datasets were obtained from TCGA (https://portal.gdc.cancer.gov/) and GEO (https://www.ncbi.nlm.nih.gov/geo/). GEO Accession Numbers: GSE40967, GSE53622, and GSE84437.
Bray F, Ferlay J, Soerjomataram I, Siegel RL, Torre LA, Jemal A. Global cancer statistics 2018: GLOBOCAN estimates of incidence and mortality worldwide for 36 cancers in 185 countries. CA Cancer J Clin. 2018;68(6):394–424.
Arnold M, Abnet CC, Neale RE, Vignat J, Giovannucci EL, McGlynn KA, et al. Global burden of 5 major types of gastrointestinal cancer. Gastroenterology. 2020;159(1):335-49.e15.
Tsvetkov P, Coy S, Petrova B, Dreishpoon M, Verma A, Abdusamad M, et al. Copper induces cell death by targeting lipoylated TCA cycle proteins. Science. 2022;375(6586):1254–61.
Hanahan D, Weinberg RA. Hallmarks of cancer: the next generation. Cell. 2011;144(5):646–74.
Denoyer D, Masaldan S, La Fontaine S, Cater MA. Targeting copper in cancer therapy: “Copper That Cancer.” Metallomics. 2015;7(11):1459–76.
Pinho J, da Silva I, Amaral J, Rodrigues C, Casini A, Soveral G, et al. Therapeutic potential of a copper complex loaded in pH-sensitive long circulating liposomes for colon cancer management. Int J Pharm. 2021;599:120463.
Hughes R, Elliott R, Li X, Munro A, Makda A, Carter R, et al. Multiparametric high-content cell painting identifies copper ionophores as selective modulators of esophageal cancer phenotypes. ACS Chem Biol. 2022;17(7):1876–89.
Du C, Guan X, Liu Y, Xu Z, Du X, Li B, et al. Disulfiram/copper induces antitumor activity against gastric cancer cells in vitro and in vivo by inhibiting S6K1 and c-Myc. Cancer Chemother Pharmacol. 2022;89(4):451–8.
Batista PJ, Chang HY. Long noncoding RNAs: cellular address codes in development and disease. Cell. 2013;152(6):1298–307.
Huarte M. The emerging role of lncRNAs in cancer. Nat Med. 2015;21(11):1253–61.
Quinn JJ, Chang HY. Unique features of long non-coding RNA biogenesis and function. Nat Rev Genet. 2016;17(1):47–62.
Zhang L, Xu X, Su X. Noncoding RNAs in cancer immunity: functions, regulatory mechanisms, and clinical application. Mol Cancer. 2020;19(1):48.
Zhang Y, Mao Q, Xia Q, Cheng J, Huang Z, Li Y, et al. Noncoding RNAs link metabolic reprogramming to immune microenvironment in cancers. J Hematol Oncol. 2021;14(1):169.
Xu R, Wu X, Du A, Zhao Q, Huang H. Identification of cuproptosis-related long non-coding ribonucleic acid signature as a novel prognosis model for colon cancer. Am J Cancer Res. 2022;12(11):5241–54.
Tu H, Zhang Q, Xue L, Bao J. Cuproptosis-related lncRNA gene signature establishes a prognostic model of gastric adenocarcinoma and evaluate the effect of antineoplastic drugs. Genes. 2022;13(12):2214.
Sun K, Zu C, Wu X, Wang Q, Hua P, Zhang Y, et al. Identification of lncRNA and mRNA regulatory networks associated with gastric cancer progression. Front Oncol. 2023;13:1140460.
Hu J, Qian Y, Peng L, Ma L, Qiu T, Liu Y, et al. Long noncoding RNA EGFR-AS1 promotes cell proliferation by increasing EGFR mRNA stability in gastric cancer. Cell Physiol Biochem Int J Exp Cell Physiol Biochem Pharmacol. 2018;49(1):322–34.
Hao Z, Liang P, He C, Sha S, Yang Z, Liu Y, et al. Prognostic risk assessment model and drug sensitivity analysis of colon adenocarcinoma (COAD) based on immune-related lncRNA pairs. BMC Bioinform. 2022;23(1):435.
Graden JA, Winge DR. Copper-mediated repression of the activation domain in the yeast Mac1p transcription factor. Proc Natl Acad Sci U S A. 1997;94(11):5550–5.
Lukanovic D, Herzog M, Kobal B, Cerne K. The contribution of copper efflux transporters ATP7A and ATP7B to chemoresistance and personalized medicine in ovarian cancer. Biomed Pharmacother. 2020;129:110401.
Minoru K, Yoko S, Masayuki K, Miho F, Mao T. KEGG as a reference resource for gene and protein annotation. Nucleic Acids Res. 2016;D1:D457–62.
Subramanian A, Tamayo P, Mootha VK, Mukherjee S, Ebert BL, Gillette MA, et al. Gene set enrichment analysis: a knowledge-based approach for interpreting genome-wide expression profiles. Proc Natl Acad Sci U S A. 2005;102(43):15545–50.
Newman AM, Liu CL, Green MR, Gentles AJ, Feng W, Xu Y, et al. Robust enumeration of cell subsets from tissue expression profiles. Nat Methods. 2015;12(5):453–7.
Racle J, de Jonge K, Baumgaertner P, Speiser DE, Gfeller D. Simultaneous enumeration of cancer and immune cell types from bulk tumor gene expression data. Elife. 2017;6:e26476.
Becht E, Giraldo NA, Lacroix L, Buttard B, Elarouci N, Petitprez F, et al. Estimating the population abundance of tissue-infiltrating immune and stromal cell populations using gene expression. Genome Biol. 2016;17(1):218.
Finotello F, Mayer C, Plattner C, Laschober G, Rieder D, Hackl H, et al. Molecular and pharmacological modulators of the tumor immune contexture revealed by deconvolution of RNA-seq data. Genome Med. 2019;11(1):34.
Li T, Fu J, Zeng Z, Cohen D, Li J, Chen Q, et al. TIMER2.0 for analysis of tumor-infiltrating immune cells. Nucleic Acids Res. 2020;48(W1):W509–14.
Aran D, Hu Z, Butte AJ. xCell: digitally portraying the tissue cellular heterogeneity landscape. Genome Biol. 2017;18(1):220.
Jiang P, Gu S, Pan D, Fu J, Sahu A, Hu X, et al. Signatures of T cell dysfunction and exclusion predict cancer immunotherapy response. Nat Med. 2018;24(10):1550–8.
Lu YC, Shi JQ, Zhang ZX, Zhou JY, Zhou HK, Feng YC, et al. Transcriptome based system biology exploration reveals homogeneous tumorigenicity of alimentary tract malignancy. Front Oncol. 2020;10:580276.
Yang S, Liu T, Cheng Y, Bai Y, Liang G. Immune cell infiltration as a biomarker for the diagnosis and prognosis of digestive system cancer. Cancer Sci. 2019;110(12):3639–49.
Li J, Zhang Y, Xu Q, Zhang Y, Bei S, Ding Y, et al. Diagnostic value of circulating lncRNAs for gastric cancer: a systematic review and meta-analysis. Front Oncol. 2022;12:1058028.
Cheng B, Rong A, Zhou Q, Li W. LncRNA LINC00662 promotes colon cancer tumor growth and metastasis by competitively binding with miR-340-5p to regulate CLDN8/IL22 co-expression and activating ERK signaling pathway. J Exp Clin Cancer Res. 2020;39(1):5.
Huang JZ, Chen M, Chen D, Gao XC, Zhu S, Huang H, et al. A peptide encoded by a putative lncRNA HOXB-AS3 suppresses colon cancer growth. Mol Cell. 2017;68(1):171-84.e6.
Roohinejad Z, Bahramian S, Shamsabadi F, Sahebi R, Amini A, Sabour D, et al. Upregulation of the c-MYC oncogene and adjacent long noncoding RNAs PVT1 and CCAT1 in esophageal squamous cell carcinoma. BMC Cancer. 2023;23(1):34.
Li SR, Bu LL, Cai L. Cuproptosis: lipoylated TCA cycle proteins-mediated novel cell death pathway. Signal Transduct Target Ther. 2022;7(1):158.
Zhang S, Li X, Tang C, Kuang W. Inflammation-related long non-coding RNA signature predicts the prognosis of gastric carcinoma. Front Genet. 2021;12:736766.
Yang X, Zeng T, Liu Z, He W, Hu M, Tang T, et al. Long noncoding RNA GK-IT1 promotes esophageal squamous cell carcinoma by regulating MAPK1 phosphorylation. Cancer Med. 2022;11(23):4555–74.
Li D, Shen Y, Ren H, Wang L, Yang J, Wang Y. Repression of linc01555 up-regulates angiomotin-p130 via the microRNA-122-5p/clic1 axis to impact vasculogenic mimicry-mediated chemotherapy resistance in small cell lung cancer. Cell Cycle (Georgetown, Tex). 2023;22(2):255–68.
Feng L, Yang J, Zhang W, Wang X, Li L, Peng M, et al. Prognostic significance and identification of basement membrane-associated lncRNA in bladder cancer. Front Oncol. 2022;12:994703.
Liu C, Zhao S, Lv ZX, Zhao XJ. Promoting action of long non-coding RNA small nucleolar RNA host gene 4 in ovarian cancer. Acta Biochim Pol. 2023;70(1):59–68.
Yarchoan M, Hopkins A, Jaffee EM. Tumor mutational burden and response rate to PD-1 inhibition. N Engl J Med. 2017;377(25):2500–1.
Eder T, Hess AK, Konschak R, Stromberger C, Johrens K, Fleischer V, et al. Interference of tumour mutational burden with outcome of patients with head and neck cancer treated with definitive chemoradiation: a multicentre retrospective study of the German Cancer Consortium Radiation Oncology Group. Eur J Cancer. 2019;116:67–76.
Ambrosio M, Spagnoli L, Perotti B, Petrelli F, Caini S, Saieva C, et al. Paving the path for immune enhancing nutrition in colon cancer: modulation of tumor microenvironment and optimization of outcomes and costs. Cancers. 2023;15(2):437.
Mo S, Shen X, Wang Y, Liu Y, Sugasawa T, Yang Z, et al. Systematic single-cell dissecting reveals heterogeneous oncofetal reprogramming in the tumor microenvironment of gastric cancer. Hum Cell. 2023;36(2):689–701.
Ramos-Casals M, Brahmer JR, Callahan MK, Flores-Chávez A, Keegan N, Khamashta MA, et al. Immune-related adverse events of checkpoint inhibitors. Nat Rev Dis Primers. 2020;6(1):38.
Rhyner Agocs G, Assarzadegan N, Kirsch R, Dawson H, Galván J, Lugli A, et al. LAG-3 expression predicts outcome in stage II colon cancer. J Pers Med. 2021;11(8):749.
Guo Y, Chu H, Xu J. Research progress of immune heckpoint LAG-3 in gastric cancer: a narrative review. Eur Rev Med Pharmacol Sci. 2023;27(1):248–55.
Hu Y, Qian Y, Wei J, Jin T, Kong X, Cao H, et al. The disulfiram/copper complex induces autophagic cell death in colorectal cancer by targeting ULK1. Front Pharmacol. 2021;12:752825.
Sun F, Sun J, Zhao Q. A deep learning method for predicting metabolite-disease associations via graph neural network. Brief Bioinform. 2022;23(4):bbac266.
Wang T, Sun J, Zhao Q. Investigating cardiotoxicity related with hERG channel blockers using molecular fingerprints and graph attention mechanism. Comput Biol Med. 2023;153:106464.
Wang W, Zhang L, Sun J, Zhao Q, Shuai J. Predicting the potential human lncRNA–miRNA interactions based on graph convolution network with conditional random field. Brief Bioinform. 2022;23(6):bbac463.
Zhang L, Yang P, Feng H, Zhao Q, Liu H. Using network distance analysis to predict lncRNA–miRNA interactions. Interdiscip Sci Comput Life Sci. 2021;13(3):535–45.
Liu H, Ren G, Chen H, Liu Q, Yang Y, Zhao Q. Predicting lncRNA–miRNA interactions based on logistic matrix factorization with neighborhood regularized. Knowl-Based Syst. 2020;191:105261.
This study was supported by the Science and Technology Program of traditional Chinese Medicine in Zhejiang Province (2021ZB208) and the Zhejiang Medical and Health Science and Technology Plan Project (2021KY927).
Ethics approval and consent to participate
Since the TCGA and GEO databases could be accessed by anybody and provides patient data without requiring personal identity, informed permission and ethics approval were not necessary.
Consent for publication
The study's authors affirmed that there were no financial or commercial ties that might be viewed as having a potential competing interests.
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Additional file 1
. Cuproptosis genes.
Additional file 2
. Univariate and multivariate Cox regression analysis in TCGA-ATM dataset.
Additional file 3
. GO analysis of the DEGs.
Additional file 4
. KEGG analysis of the DEGs.
Additional file 5
. Supplementary Figures.
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
About this article
Cite this article
Xie, Y., Song, X., Du, D. et al. Identification of cuproptosis-related lncRNAs to predict prognosis and immune infiltration characteristics in alimentary tract malignancies. BMC Bioinformatics 24, 184 (2023). https://doi.org/10.1186/s12859-023-05314-z
- Long non-coding RNA
- Immune infiltration
- Alimentary tract malignancies