- Research
- Open access
- Published:
Modelling cell type-specific lncRNA regulatory network in autism with Cycle
BMC Bioinformatics volume 25, Article number: 307 (2024)
Abstract
Background
Autism spectrum disorder (ASD) is a class of complex neurodevelopment disorders with high genetic heterogeneity. Long non-coding RNAs (lncRNAs) are vital regulators that perform specific functions within diverse cell types and play pivotal roles in neurological diseases including ASD. Therefore, exploring lncRNA regulation would contribute to deciphering ASD molecular mechanisms. Existing computational methods utilize bulk transcriptomics data to identify lncRNA regulation in all of samples, which could reveal the commonalities of lncRNA regulation in ASD, but ignore the specificity of lncRNA regulation across various cell types.
Results
Here, we present Cycle (Cell type-specific lncRNA regulatory network) to construct the landscape of cell type-specific lncRNA regulation in ASD. We have found that each ASD cell type is unique in lncRNA regulation, and more than one-third and all cell type-specific lncRNA regulatory networks are characterized as scale-free and small-world, respectively. Across 17 ASD cell types, we have discovered 19 rewired and 11 stable modules, along with eight rewired and three stable hubs within the constructed cell type-specific lncRNA regulatory networks. Enrichment analysis reveals that the discovered rewired and stable modules and hubs are closely related to ASD. Furthermore, more similar ASD cell types tend to be connected with higher strength in the constructed cell similarity network. Finally, the comparison results demonstrate that Cycle is a potential method for uncovering cell type-specific lncRNA regulation.
Conclusion
Overall, these results illustrate that Cycle is a promising method to model the landscape of cell type-specific lncRNA regulation, and provides insights into understanding the heterogeneity of lncRNA regulation between various ASD cell types.
Background
Autism spectrum disorder (ASD) refers to a collection of neurodevelopmental disorders exhibiting profound genetic diversity and complexity [1, 2]. Since childhood, ASD individuals have a wide range of difficulties and deficiencies in social interaction and language communication [3]. Despite striking progress in studying ASD has demonstrated that ASD possesses strong genetic heterogeneity and numerous molecules participate in regulating a series of complex biological processes, including neuronal activity [4] and immune response [2], an understanding of the pathobiology of ASD is still largely unclear. Unlocking the underlying pathogenesis of ASD at the molecular regulatory level holds profound implications in early detection and personalized treatment.
Long non-coding RNAs (lncRNAs) comprise a category of non-coding RNAs that are typically longer than 200 nucleotides, which act as regulators to make significant contributions to neurological diseases, e.g. ASD [3, 5]. In the field of neurobiology, previous studies [3, 6] have revealed that numerous lncRNAs exert biological functions specific to cell types, including neuronal differentiation, synaptic development, and plasticity [7]. In addition, lncRNA regulation also exhibits to be tissue-specific [8], and cell developmental-stage specific [9]. Due to the heterogeneity and complexity in the development of ASD, studying cell type-specific or dynamic lncRNA regulation could provide a new perspective for discovering potential therapeutic strategies for ASD.
Recently, computational methods are promising ways to decipher the function of lncRNAs in modulating ASD-related biological processes. By using bulk transcriptomics data, computational methods for identifying lncRNA regulation can be grouped into three primary categories: sequence-based methods that rely on nucleic acid sequence characteristics, expression-based methods focusing on variations in lncRNA expression levels, and integration-based methods that combine multiple sources of data. Sequence-based methods calculate the binding energy of RNA base pairs to infer lncRNA-target pairs. A prime example is LncTar [10], which utilizes the nearest neighbour thermodynamic model to compute the binding free energy of lncRNA-RNA pairs. Expression-based methods encompass a diverse range of approaches, including statistical methods [11, 12], deep learning methods [13, 14], and causal inference approaches [15]. These methods utilize gene expression profiles to derive and establish lncRNA-target correlation or causality pairs. Alternatively, integration-based methods [16, 17] combine different types of data (e.g., sequence information and expression profiles), thereby enhancing the precision and reliability of lncRNA target prediction. The major limitation of the above methods using bulk transcriptomics data is that they ignore the heterogeneity of lncRNA regulation across various samples (cell lines or tissues). As single-cell and single-nucleus RNA sequencing technology continues to evolve, inferring lncRNA regulation with single-cell or cell type resolution opens a way to explore lncRNA regulation specific to unique cells or cell types in ASD. Regarding cell-specific gene regulation, CSN (Cell-Specific Network) method [18] pioneers the construction of cell-specific networks using single-cell transcriptome data. Subsequently, as an improvement of CSN, c-CSN [19], loc-CSN [20], and p-CSN [21] are also presented to infer conditional, local, and partial cell-specific networks, respectively. Specifically, for exploring cell-specific miRNA regulation, CSmiR [22] has also been developed to investigate single-cell level modulation of miRNA expression. In terms of regulation specific to individual cell types, scHumanNet [23] aims to generate specialized gene regulatory networks (GRNs) for individual cell types by leveraging the information contained in the HumanNet reference interactome [24] and single-cell expression data. Recently, scMTNI [25] integrates single-cell multi-omics datasets to build GRNs specific to cell types across cell lineages. However, these cell-specific or cell type-specific regulation approaches primarily prioritize transcription factor or miRNA regulation over lncRNA regulation. To infer lncRNA regulation specific to biological conditions, CDSlncR [9] could infer lncRNA regulatory networks corresponding to distinct developmental states of the brain neocortex. Given that the pathogenesis of ASD involves a series of cell types and biological processes regulated by lncRNAs, thus it is crucial to study cell type-specific lncRNA regulation in ASD.
To explore the dynamic lncRNA regulation across various ASD cell types, we present a novel method, Cycle (Cell type-specific lncRNA regulatory network), to model cell type-specific lncRNA regulatory networks in ASD. Cycle has two main contributions as follows. Firstly, instead of considering all types of interactions among genes (lncRNAs and mRNAs), Cycle concentrates on identifying interactions between lncRNAs and mRNAs. Secondly, taking the diversity and specificity of cells and cell types into consideration, Cycle identifies lncRNA regulatory networks specific to each cell type.
We have applied Cycle into single-nucleus RNA-sequencing (snRNA-seq) data of ASD brain tissues [26] for modelling the landscape of cell type-specific lncRNA regulation in ASD. Our research has found that each ASD cell type is unique in lncRNA regulation. Notably, over one-third of the cell type-specific lncRNA regulatory networks are scale-free, and all of them exhibit to be small-world. Among 17 ASD cell types, we have inferred 19 rewired modules and 11 stable modules, along with eight rewired hubs and three stable hubs based on the identified cell type-specific lncRNA regulatory networks. Importantly, the discovered rewired and stable modules, and stable hubs are closely associated with ASD. Additionally, ASD cell types that are more similar tend to be strongly connected in the constructed cell similarity network. Finally, our comparison results suggest that Cycle is a promising approach in elucidating cell type-specific lncRNA regulation.
Methods
The flowchart of Cycle
Cycle includes three main components (Fig. 1). Firstly, Cycle conducts data pre-processing, including gene annotation, feature selection, and data splitting to acquire highly expressed lncRNAs and mRNAs in 17 ASD cell types. For each ASD cell type, Cycle further identifies lncRNA regulatory networks specific to it. In total, 17 cell type-specific lncRNA regulatory networks are modelled. Based on the constructed cell type-specific lncRNA regulatory networks, Cycle further infers the rewired and stable modules and hubs. Finally, Cycle performs four types of downstream analyses, including network topological analysis, uniqueness analysis, cell similarity network construction, and enrichment analysis. The details of each component will be described in the following.
Single-nucleus RNA-sequencing data in ASD
The raw snRNA-seq data of ASD [26] is obtained from the Sequence Read Archive (SRA) with accession number PRJNA434002 (https://ncbi.nlm.nih.gov/bioproject/434002), and the analyzed data is from https://autism.cells.ucsc.edu. As a pre-processing step, we obtain the matched lncRNA and mRNA expression data by utilizing gene annotation information from HGNC (HUGO Gene Nomenclature Committee, https://www.genenames.org/), and select lncRNAs and mRNAs whose expression levels are higher than the average expression level across all cells. In total, we have retained 813 lncRNAs and 5,133 mRNAs highly expressed in 52,003 ASD cells. Similar to [26], we annotate the 52,003 ASD cells into 17 cell types based on the expression of known cell type markers. These 17 cell types include oligodendrocyte precursor cells (OPC), oligodendrocytes, microglia cells, fibrous astrocytes (ASTFB), protoplasmic astrocytes (ASTPP), layer 2/3 excitatory neurons (L2/3), layer 4 excitatory neurons (L4), layer 5/6 corticofugal projection neurons (L5/6), layer 5/6 cortico-cortical projection neurons (L5/6-CC), SV2C interneurons (IN-SV2C), somatostatin interneurons (IN-SST), VIP interneurons (IN-VIP), parvalbumin interneurons (IN-PV), endothelial cells, NRGN-expressing neurons (NeuNRGN-I), NRGN-expressing neurons (NeuNRGN-II), and maturing neurons (Neu-mat). Various cell types play different roles in the brain region (see Additional file 1 for the detailed information of 17 ASD cell types).
Identification of cell type-specific lncRNA regulatory networks
For each cell type, modelling cell type-specific networks is grounded upon the identified cell-specific regulatory networks. Hence, Cycle is firstly used to identify cell-specific lncRNA regulatory networks. Here, Cycle adapts CSN [18] with local strategy [20] to quantitatively estimate the correlation strength of lncRNA-mRNA relationship pairs in each cell.
In cell k, \(l_{k}\) and \(m_{k}\) are the expression values of lncRNA \(lncR_{l}\) and mRNA \(mR_{m}\), respectively, and \(\rho_{lm}^{\left( k \right)}\) is calculated as the interaction strength between \(lncR_{l}\) and \(mR_{m}\) in the following:
where N is the number of cells for ASD snRNA-seq data, \(n_{l}^{\left( k \right)}\) and \(n_{m}^{\left( k \right)}\) are the neighbourhood number of \(l_{k}\) and \(m_{k}\) in the bins of cell k for \(lncR_{l}\) and \(mR_{m}\), respectively, and \(n_{lm}^{\left( k \right)}\) is the neighbourhood number of \(\left( {l_{k} ,m_{k} } \right)\) in the interaction bin of cell k.
Owing to the specificity and heterogeneity of cells, self-adaptive window size \(B_{l}^{\left( k \right)}\) and \(B_{m}^{\left( k \right)}\) of bins in cell \(k\) are iteratively generated based on local standard deviations as follows:
where initial \(B_{l} \left( 0 \right)\) and \(B_{m} \left( 0 \right)\) are a quantile range, and \(t\) starts from \(1,2,...\), until convergence is achieved. If the convergence is not achieved during the iterations, \(B_{l}^{\left( k \right)} \left( 1 \right)\) and \(B_{m}^{\left( k \right)} \left( 1 \right)\) will be adopted as window sizes in practice for cell \(k\).
where \(\mu_{lm}^{\left( k \right)} = 0\) represents the mean value of \(\rho_{lm}^{\left( k \right)}\), and \(\sigma_{lm}^{\left( k \right)} = \sqrt {\frac{{n_{l}^{\left( k \right)} n_{m}^{\left( k \right)} \left( {n - n_{l}^{\left( k \right)} } \right)\left( {n - n_{m}^{\left( k \right)} } \right)}}{{n^{4} \left( {n - 1} \right)}}}\) denotes the standard deviation of \(\rho_{lm}^{\left( k \right)}\). Each \(z_{lm}^{\left( k \right)}\) value has a corresponding p-value, and a smaller p value (e.g., p < 0.01) indicates a higher strength between \(lncR_{l}\) and \(mR_{m}\) in cell k.
For each cell, we only focus on the lncRNA-mRNA interactions with statistical significance (e.g., p-value less than 0.01). If a significant lncRNA-mRNA interaction exists in more than 90% of total cells of a cell type, the lncRNA-mRNA interaction is regarded as one of a collection of lncRNA-mRNA interactions for the cell type. By integrating all of the lncRNA-mRNA interactions belonging to individual cell types, Cycle constructs 17 cell type-specific lncRNA-mRNA regulatory networks.
Network topological analysis
Topological analysis contributes to exploring the characteristics and organization of biological networks including lncRNA regulatory networks. Degree and density are two widely used metrics to characterize a biological network. If the node degree distribution of a cell type-specific lncRNA regulatory network adheres to a power-law distribution with a p value of a Kolmogorov–Smirnov test [27] larger than 0.05, the network tends to be a scale-free network. If the characteristic density of a cell type-specific lncRNA regulatory network is higher than that of its corresponding random networks at a significance level (e.g., 0.05), the network is regarded as a small-world network. Here, for each cell type-specific lncRNA regulatory network, we generate 1,000 random networks by randomizing the lncRNA-mRNA interactions. We utilize the Student's t-test for statistically quantifying the differences between the constructed cell type-specific lncRNA regulatory networks and their corresponding random networks. In this work, the igraph R package [28] is applied to analyze the topological attributes of the constructed cell type-specific lncRNA regulatory networks.
Hub lncRNA inference
Hub lncRNAs with high connectivity play key pivot roles in a cell type-specific lncRNA regulatory network. Rather than inferring hubs as those with a node degree exceeding a giving value, we assume that the node degree of lncRNAs follows the Poisson distribution [29,30,31]. For each lncRNA, we calculate the p value of it as follows:
where \(\lambda = np\), \(p = \frac{m}{{A_{n}^{2} }}\), \(n\) is the number of lncRNAs, m is the number of lncRNA-mRNA pairs in a lncRNA–mRNA regulatory network, and \(A_{n}^{2}\) is the number of all possible lncRNA-mRNA interactions. In this work, a lncRNA with a p value less than 0.05 is considered as a hub lncRNA.
Stable and dynamic analysis
We perform stable and dynamic analysis to reveal the commonality and heterogeneity of lncRNA regulation between different ASD cell types. Previous studies [15, 32] have shown that lncRNA regulation is 'on' in some biological conditions but is 'off' in other biological conditions. Here, lncRNA-mRNA interactions or hub lncRNAs existing in at least 90% of ASD cell types are considered as the stable lncRNA regulatory network or hub lncRNAs, and lncRNA-mRNA interactions or hub lncRNAs existing in only one ASD cell type are viewed as the rewired lncRNA regulatory network or hub lncRNAs. To further identify highly connected functional modules within the stable and rewired lncRNA regulatory networks, we have applied the Markov Cluster (MCL) algorithm [33] to discover the stable and rewired lncRNA regulatory modules. In each module, the number of genes (lncRNAs and mRNAs) should be at least three.
Uniqueness of cell type-specific lncRNA regulation
To know whether lncRNA regulation in each ASD cell type is unique or not, we calculate the differences of lncRNA-mRNA regulatory networks (or hub lncRNAs) across ASD cell types. Before computing differences between each pair of ASD cell types, we need to calculate the similarity of lncRNA-mRNA regulatory networks (or hub lncRNAs) across ASD cell types. To assess the similarity of lncRNA-mRNA regulatory networks (or hub lncRNAs) between two ASD cell types, we employ the Simpson model [34] to analyze lncRNA-mRNA interactions (or hub lncRNAs) obtained from ASD cell types \(i\) and \(j\), respectively. This approach generates a similarity value \(sim_{ij}\) that measures the similarity between cell type-specific lncRNA regulatory networks (or hub lncRNAs). The dissimilarity or difference \(dif_{ij}\) between cell type-specific lncRNA regulatory networks (or hub lncRNAs) is then computed as described below.
where \(LR_{i}\) and \(LR_{j}\) are lncRNA-mRNA interactions or hub lncRNAs existing in ASD cell types \(i\) and \(j\), \(\left| {LR_{i} } \right| \cap \left| {LR_{j} } \right|\) represents the intersection number of lncRNA-mRNA interactions or hub lncRNAs between \(LR_{i}\) and \(LR_{j}\), and \(min\left( {\left| {LR_{i} } \right|,\left| {LR_{j} } \right|} \right)\) is the smaller number of lncRNA-mRNA interactions or hub lncRNAs between \(LR_{i}\) and \(LR_{j}\). A larger value of \(dif_{ij}\) denotes a higher uniqueness between ASD cell types \(i\) and \(j\).
Enrichment analysis
To understand the potential biological functions of the stable and rewired lncRNA-mRNA regulatory modules, we conduct enrichment analysis with miRspongeR [35] and clusterProfiler [36] R packages. The databases used for functional enrichment analysis include Gene Ontology (GO) [37], Kyoto Encyclopedia of Genes and Genomes (KEGG) [38], and Reactome Pathway database (Reactome) [39]. Additionally, three disease databases including Disease Ontology (DO) [40], DisGeNET [41], and Network of Cancer Genes (NCG) [42] are also considered for disease enrichment analysis. With regard to hub lncRNAs, we employ RNAenrich [43], a powerful comprehensive web server for ncRNA enrichment, to explore potential pathways, biological processes, and diseases in which they participate. In this work, the enriched KEGG, GO, Reactome, DO, DisGeNET, or NCG term with an adjusted p value < 0.05 (adjusted by the Benjamini–Hochberg approach) is viewed as a significant term.
Results
In this section, we show the application of Cycle in uncovering cell type-specific lncRNA regulation in ASD. The ASD dataset used and code for the reproducibility of the analysis are available at https://github.com/chenchenxiong/Cycle.
The landscape of cell type-specific lncRNA regulation in ASD
To investigate the lncRNA regulation across 17 ASD cell types, we have identified the landscape of cell type-specific lncRNA regulation by following the workflow of Cycle (Fig. 1). We have discovered that the number of lncRNA-mRNA interactions and hub lncRNAs tends to be various across 17 ASD cell types (Fig. 2a). In the case of lncRNA-mRNA interactions, L4 and microglia cells obtain the largest and least number of interactions, respectively. In the case of hub lncRNAs, L2/3 and ASTFB have the largest and least number of hubs, respectively. Network topological analysis displays that 6 out of 17 (~ 35.29%) cell type-specific lncRNA-mRNA regulatory networks adhere to a power law distribution, and all cell type-specific lncRNA-mRNA networks display higher densities compared to their corresponding random networks (Fig. 2b and Additional file 2). These results indicate that over one-third of these cell type-specific lncRNA regulatory networks tend to be scale-free, and all of these cell type-specific lncRNA regulatory networks exhibit to be small-world.
Each ASD cell type is unique in lncRNA regulation
To understand whether the identified lncRNA regulation for each ASD cell type is unique, we calculate the dissimilarity or difference between each pair of ASD cell types. We have found that the identified lncRNA regulatory networks and hub lncRNAs between any pairs of 17 ASD cell types are various, indicating the uniqueness of each cell type (Fig. 3a and b). For the identified lncRNA regulatory networks, nearly half of pairs between 17 cell types (~ 46.32%) have a difference value with more than 0.500. Specifically, the highest difference value (0.951) between 17 cell types is between microglia and NeuNRGN-II (Fig. 3a). For hub lncRNAs, more than one-third pairs between 17 cell types (~ 36.76%) have a difference value with more than 0.500, and the highest difference value (0.844) between 17 cell types also exists between L5/6-CC and Neu-mat (Fig. 3b). These results have suggested that each ASD cell type is unique in lncRNA regulatory networks and hub lncRNAs.
In addition, we have further explored stable and rewired lncRNA-mRNA regulatory networks and hubs across various cell types. As a result, we have found that 115,108 lncRNA-mRNA interactions and eight hub lncRNAs (CPVL-AS2, LINC00343, LINC01202, LINC01619, LINC01811, LINC02301, LINC03013, and SYNPO2L-AS1) only exist in one cell type, and 63 lncRNA-mRNA interactions and three hub lncRNAs (ANKRD17-DT, LINC01572, and MIRLET7BHG) exist in at least 90% cell types (Fig. 3c). In total, we have obtained 115,108 rewired interactions, 63 stable interactions, eight rewired hubs, and three stable hubs across 17 ASD cell types. Overall, the number of rewired interactions or hubs is larger than that of stable interactions or hubs, indicating that lncRNA regulation tends to be dynamic across ASD cell types.
Rewired and stable lncRNA regulatory modules and hub lncRNAs are closely associated with ASD
To identify highly connected functional modules within the identified stable and rewired lncRNA regulatory networks, we have discovered 19 rewired and 11 stable lncRNA regulatory modules by applying MCL algorithm [33]. To reveal the biological significance of the identified rewired and stable modules, we further conduct functional and disease enrichment analysis of them. Enrichment analysis results indicate that most of the rewired and stable lncRNA regulatory modules are functional and significantly enriched in ASD-related biological processes and pathways (Tables 1 and 2, Additional file 3). For example, a large number of enriched terms, including neurodevelopment [44], synaptic transmission [45], cellular signalling [46], and immune response [47] are closely related to the pathogenesis of ASD.
Based on the rewired and stable lncRNA-mRNA regulatory networks, we further infer eight rewired hubs and three stable hubs, and conduct enrichment analysis of them with RNAenrich [43]. Enrichment analysis reveals that the rewired hubs are not significantly enriched in any functional terms, but the stable hub lncRNAs are significantly enriched in 184 KEGG, 3074 GO, and 364 Reactome terms. Particularly, several GO, KEGG or Reactome terms are closely associated with ASD (Table 3, Additional file 3).
Cell similarity network
To understand the similarity of each pair of ASD cell types, we have further constructed a cell similarity network by using the inferred cell type-specific lncRNA-mRNA interactions and hub lncRNAs in each ASD cell type. In this work, if the similarity value of a cell–cell pair is larger than the median value of similarity, the cell–cell pair is considered to be a link in the cell similarity network. As a result, we have found that L4 is similar with the largest number of other ASD cell types, while ASTFB is similar with the least number of other ASD cell types (Fig. 4).
In comparison with the other method
In this section, we compare Cycle with the other three methods, including CDSlncR [9], LncRNA2Target v3.0 [82], and WGCNA [83] in the identification of cell type-specific lncRNA regulation. CDSlncR [9] is the first method to investigate cell type-specific lncRNA regulation. LncRNA2Target [82] predicts lncRNA-mRNA interactions by analyzing the knockdown and overexpression of lncRNAs. WGCNA (Weighted Gene Co-expression Network Analysis) [83] is a co-expression-based prediction method that can be utilized to identify lncRNA-mRNA interactions. In the WGCNA analysis, we set the scale-free topology model fit index (\(R^{2}\)) to 0.85, and filter out lncRNA-mRNA pairs whose topological overlap measure (TOM) similarity values are greater than the median TOM similarity value of all lncRNA-mRNA pairs within each cell type. To ensure fairness, the p value cutoff of CDSlncR, LncRNA2Target, and Cycle is set to be the same. We focus on comparing the number of validated lncRNA-mRNA interactions predicted by Cycle, CDSlncR [9], LncRNA2Target [82], and WGCNA [83]. These experimentally validated lncRNA-mRNA interactions are collected from NPInter v5.0 [84], LncTarD v2.0 [85] and LncRNA2Target v3.0 [82]. For each ASD cell type, Cycle performs the best due to obtaining the largest number of validated lncRNA-mRNA interactions (Fig. 5). The comparison result indicates that Cycle is a promising approach to model cell type-specific lncRNA regulation in ASD.
Discussion
ASD is a set of complex neurodevelopmental disorders that manifest with varying symptoms among individuals. Exploring the regulatory mechanisms of lncRNAs within and between different ASD cell types helps to elucidate the etiology and ontogeny of ASD. In this work, we develop a novel approach (Cycle) to model cell type-specific lncRNA regulatory networks in ASD. For each ASD cell type, we have shown that the lncRNA regulation tends to be unique. The rewired and stable lncRNA regulatory modules and hub lncRNAs are significantly enriched in several ASD-related terms or pathways. In addition, cell similarity networks can help to know which cell types are similar to the least or largest number of other ASD cell types. In comparison with other methods, Cycle performs the best in inferring cell type-specific lncRNA regulation.
In future, Cycle can be further improved in the following four aspects. Firstly, Cycle currently focuses on lncRNA regulation specific to ASD cell types. In future, it is necessary to study other condition-specific lncRNA regulation, e.g., sex-specific or region-specific lncRNA regulation. Secondly, Cycle only infers the association/correlation rather than causal relationships between lncRNAs and mRNAs. In future, we will conduct cell type-specific lncRNA causal regulation research. Thirdly, the enrichment analysis highly depends on the incomplete annotated databases and prior knowledge. Consequently, the enrichment analysis results are biased, which is a common issue of existing computational methods, including Cycle. To alleviate this problem, it is necessary to integrate more functional annotation information in future. Finally, the competing endogenous RNA (ceRNA) hypothesis [86] suggests that lncRNAs can modulate gene expression by acting as ceRNAs, thus it is strongly needed to identify cell type-specific lncRNA-related ceRNA networks for comprehensively understanding lncRNA regulation.
Conclusion
Taken together, Cycle is useful for modelling the landscape of cell types-specific lncRNA regulation in ASD and contributes to elucidating the heterogeneity of lncRNA regulation underlying various ASD cell types.
Availability of data and materials
Cycle is released under the GPL-3.0 License, and is freely available at https://github.com/chenchenxiong/Cycle. The raw snRNA-seq data of ASD [26] is accessed at Sequence Read Archive with accession number PRJNA434002 (https://ncbi.nlm.nih.gov/bioproject/434002), and the analyzed snRNA-seq data is from https://autism.cells.ucsc.edu. The lists of all the data used in this study are available in the Additional files
Abbreviations
- CSN:
-
Cell-specific network
- ceRNA:
-
Competing endogenous RNA
- DO:
-
Disease ontology
- GO:
-
Gene ontology
- HGNC:
-
HUGO gene nomenclature committee
- KEGG:
-
Kyoto encyclopedia of genes and genomes
- L2/3:
-
Layer 2/3 excitatory neurons
- L4:
-
Layer four excitatory neurons
- L5/6:
-
Layer 5/6 corticofugal projection neurons
- L5/6-CC:
-
Layer 5/6 cortico-cortical projection neurons
- lncRNAs:
-
Long non-coding RNAs
- MCL:
-
Markov cluster
- Neu-mat:
-
Maturing neurons
- Neu-NRGN-I:
-
NRGN-expressing neurons (type I)
- Neu-NRGN-II:
-
NRGN-expressing neurons (type II)
- OPC:
-
Oligodendrocyte precursor cells
- IN-PV:
-
Parvalbumin interneurons
- ASTFB:
-
Fibrous astrocytes
- ASTPP:
-
Protoplasmic astrocytes
- Reactome:
-
Reactome pathway database
- SRA:
-
Sequence read archive
- snRNA-seq:
-
Single-nucleus RNA-sequencing
- IN-SST:
-
Somatostatin interneurons
- IN-SV2C:
-
SV2C interneurons
- IN-VIP:
-
VIP interneurons
- NCG:
-
Network of cancer genes
References
Lai MC, Lombardo MV, Baron-Cohen S. Autism. Lancet. 2014;383:896–910.
Lord C, Brugha TS, Charman T, Cusack J, Dumas G, Frazier T, et al. Autism spectrum disorder. Nat Rev Dis Primers. 2020;6:5.
Parikshak NN, Swarup V, Belgard TG, Irimia M, Ramaswami G, Gandal MJ, et al. Genome-wide changes in lncRNA, splicing, and regional gene expression patterns in autism. Nature. 2016;540:423–7.
Paulsen B, Velasco S, Kedaigle AJ, Pigoni M, Quadrato G, Deo AJ, et al. Autism genes converge on asynchronous development of shared neuron classes. Nature. 2022;602:268–73.
Mattick JS, Amaral PP, Carninci P, Carpenter S, Chang HY, Chen LL, et al. Long non-coding RNAs: definitions, functions, challenges and recommendations. Nat Rev Mol Cell Biol. 2023;24:430–47.
Soutschek M, Schratt G. Non-coding RNA in the wiring and remodeling of neural circuits. Neuron. 2023;111:2140–54.
Samaddar S, Banerjee S. Far from the nuclear crowd: cytoplasmic lncRNA and their implications in synaptic plasticity and memory. Neurobiol Learn Mem. 2021;185: 107522.
Mattioli K, Volders PJ, Gerhardinger C, Lee JC, Maass PG, Melé M, et al. High-throughput functional analysis of lncRNA core promoters elucidates rules governing tissue specificity. Genome Res. 2019;29:344–55.
Huang M, Ma J, Zhang J. Inferring cell developmental stage-specific lncRNA regulation in the developing human neocortex with CDSlncR. Front Mol Neurosci. 2023;15:1037565.
Li J, Ma W, Zeng P, Wang J, Geng B, Yang J, et al. LncTar: a tool for predicting the RNA targets of long noncoding RNAs. Brief Bioinform. 2015;16:806–12.
Liao Q, Liu C, Yuan X, Kang S, Miao R, Xiao H, et al. Large-scale prediction of long non-coding RNA functions in a coding-non-coding gene co-expression network. Nucleic Acids Res. 2011;39:3864–78.
Xu X, Yu J, Huang J, Wang R, Kuang X, Dang L, et al. Comprehensive analysis of lncRNA-mRNA co-expression networks in HPV-driven cervical cancer reveals the pivotal function of LINC00511-PGK1 in tumorigenesis. Comput Biol Med. 2023;159: 106943.
Zhao T, Hu Y, Peng J, Cheng L. DeepLGP: a novel deep learning method for prioritizing lncRNA target genes. Bioinformatics. 2020;36:4466–72.
Kim Y, Lee M. Deep learning approaches for lncRNA-mediated mechanisms: a comprehensive review of recent developments. Int J Mol Sci. 2023;24:10299.
Zhang J, Le TD, Liu L, Li J. Inferring and analyzing module-specific lncRNA-mRNA causal regulatory networks in human cancer. Brief Bioinform. 2019;20:1403–19.
Lv L, Wei M, Lin P, Chen Z, Gong P, Quan Z, et al. Integrated mRNA and lncRNA expression profiling for exploring metastatic biomarkers of human intrahepatic cholangiocarcinoma. Am J Cancer Res. 2017;7:688–99.
Zheng Y, Luo H, Teng X, Hao X, Yan X, Tang Y, et al. NPInter v5.0: ncRNA interaction database in a new era. Nucleic Acids Res. 2023;51:D232-96.
Dai H, Li L, Zeng T, Chen L. Cell-specific network constructed by single-cell RNA sequencing data. Nucleic Acids Res. 2019;47: e62.
Li L, Dai H, Fang Z, Chen L. c-CSN: single-cell RNA sequencing data analysis by conditional cell-specific network. Genom Proteom Bioinform. 2021;19:319–29.
Wang X, Choi D, Roeder K. Constructing local cell-specific networks from single-cell data. Proc Natl Acad Sci U S A. 2021;118: e2113178118.
Wang Y, Xuan C, Wu H, Zhang B, Ding T, Gao J. P-CSN: single-cell RNA sequencing data analysis by partial cell-specific network. Briefings Bioinform. 2023;24:bbad180.
Zhang J, Liu L, Xu T, Zhang W, Zhao C, Li S, et al. Exploring cell-specific miRNA regulation with single-cell miRNA-mRNA co-sequencing data. BMC Bioinform. 2021;22:578.
Cha J, Yu J, Cho J-W, Hemberg M, Lee I. scHumanNet: a single-cell network analysis platform for the study of cell-type specificity of disease genes. Nucleic Acids Res. 2022;51: e8.
Kim C, Baek S, Cha J, Yang S, Kim E, Marcotte EM, et al. HumanNet v3: an improved database of human gene networks for disease research. Nucleic Acids Res. 2022;50:D632–9.
Zhang S, Pyne S, Pietrzak S, Halberg S, McCalla SG, Siahpirani AF, et al. Inference of cell type-specific gene regulatory networks on cell lineages from single cell omic datasets. Nat Commun. 2023;14:3064.
Velmeshev D, Schirmer L, Jung D, Haeussler M, Perez Y, Mayer S, et al. Single-cell genomics identifies cell type-specific molecular changes in autism. Science. 2019;364:685–9.
Bartels RH, Horn SD, Liebetrau AM, Harris WL. A computational investigation of conover’s kolmogorov-smirnov test for discrete distributions. J Stat Comput Simul. 1978;7:151–61.
Gabor C, Tamas N, Vincent T, Szabolcs H. The igraph software package for complex network research. InterJournal. 2006;Complex Systems:1695.
Jiang W, Li X, Rao S, Wang L, Du L, Li C, et al. Constructing disease-specific gene networks using pair-wise relevance metric: application to colon cancer identifies interleukin 8, desmin and enolase 1 as the central elements. BMC Syst Biol. 2008;2:72.
Zhang J, Le Duy T, Liu L, He J, Li J. Identifying miRNA synergistic regulatory networks in heterogeneous human data via network motifs. Mol Biosyst. 2016;12:454–63.
Barabási AL, Oltvai ZN. Network biology: understanding the cell’s functional organization. Nat Rev Genet. 2004;5:101–13.
Peng WX, Koirala P, Mo YY. LncRNA-mediated regulation of cell signaling in cancer. Oncogene. 2017;36:5661–7.
Enright AJ, Van Dongen S, Ouzounis CA. An efficient algorithm for large-scale detection of protein families. Nucleic Acids Res. 2002;30:1575–84.
Zhang J, Liu L, Li J, Le T. LncmiRSRN: identification and analysis of long non-coding RNA related miRNA sponge regulatory network in human cancer. Bioinform (Oxford, England). 2018;34:4232–40.
Zhang J, Liu L, Xu T, Xie Y, Zhao C, Li J, et al. miRspongeR: an R/Bioconductor package for the identification and analysis of miRNA sponge interaction networks and modules. BMC Bioinform. 2019;20:235.
Yu G, Wang L, Han Y, He Q. clusterProfiler: an R package for comparing biological themes among gene clusters. OMICS. 2012;16:284–7.
Ashburner M, Ball CA, Blake JA, Botstein D, Butler H, Cherry JM, et al. Gene ontology: tool for the unification of biology. Gene Ontol Consort Nat Genet. 2000;25:25–9.
Kanehisa M, Goto S. KEGG: kyoto encyclopedia of genes and genomes. Nucleic Acids Res. 2000;28:27–30.
Fabregat A, Jupe S, Matthews L, Sidiropoulos K, Gillespie M, Garapati P, et al. The reactome pathway knowledgebase. Nucleic Acids Res. 2018;46:D649–55.
Schriml LM, Munro JB, Schor M, Olley D, McCracken C, Felix V, et al. The human disease ontology 2022 update. Nucleic Acids Res. 2022;50:D1255–61.
Piñero J, RamÃrez-Anguita JM, Saüch-Pitarch J, Ronzano F, Centeno E, Sanz F, et al. The DisGeNET knowledge platform for disease genomics: 2019 update. Nucleic Acids Res. 2020;48:D845–55.
Repana D, Nulsen J, Dressler L, Bortolomeazzi M, Venkata SK, Tourna A, et al. The network of cancer genes (NCG): a comprehensive catalogue of known and candidate cancer genes from cancer sequencing screens. Genome Biol. 2019;20:1.
Zhang S, Amahong K, Zhang Y, Hu X, Huang S, Lu M, et al. RNAenrich: a web server for non-coding RNA enrichment. Bioinformatics. 2023;39:btad421.
Willsey HR, Willsey AJ, Wang B, State MW. Genomics, convergent neuroscience and progress in understanding autism spectrum disorder. Nat Rev Neurosci. 2022;23:323–41.
Sacai H, Sakoori K, Konno K, Nagahama K, Suzuki H, Watanabe T, et al. Autism spectrum disorder-like behavior caused by reduced excitatory synaptic transmission in pyramidal neurons of mouse prefrontal cortex. Nat Commun. 2020;11:5140.
Jiang C, Lin L, Long S, Ke X, Fukunaga K, Lu Y, et al. Signalling pathways in autism spectrum disorder: mechanisms and therapeutic implications. Sig Transduct Target Ther. 2022;7:1–36.
Meltzer A, Van de Water J. The role of the immune system in autism spectrum disorder. Neuropsychopharmacol. 2017;42:284–98.
Hu Y, Ehli EA, Boomsma DI. MicroRNAs as biomarkers for psychiatric disorders with a focus on autism spectrum disorder: current progress in genetic association studies, expression profiling, and translational research. Autism Res. 2017;10:1184–203.
Sharma A, Mehan S. Targeting PI3K-AKT/mTOR signaling in the prevention of autism. Neurochem Int. 2021;147: 105067.
Vithayathil J, Pucilowska J, Landreth GE. Chapter 3 - ERK/MAPK signaling and autism spectrum disorders. In: Shekhar A, editor. Progress in Brain Research. Elsevier; 2018. p. 63–112.
Gamal El-Din TM, Lantin T, Tschumi CW, Juarez B, Quinlan M, Hayano JH, et al. Autism-associated mutations in KV7 channels induce gating pore current. Proc Natl Acad Sci. 2021;118: e2112666118.
Wang Z, Fan F, Li Z, Ye F, Wang Q, Gao R, et al. Structural insights into the functional mechanism of the ubiquitin ligase E6AP. Nat Commun. 2024;15:3531.
Fatemi SH, Folsom TD. Autism spectrum disorders and ataxia. In: Manto M, Schmahmann JD, Rossi F, Gruol DL, Koibuchi N, editors. Handbook of the cerebellum and cerebellar disorders. Dordrecht: Springer, Netherlands; 2013. p. 1895–906.
Wegiel J, Flory M, Kaczmarski W, Brown WT, Chadman K, Wisniewski T, et al. Partial agenesis and hypoplasia of the corpus callosum in idiopathic autism. J Neuropathol Exp Neurol. 2017;76:225–37.
Bloch-Gallego E, Anderson DI. Key role of Rho GTPases in motor disorders associated with neurodevelopmental pathologies. Mol Psychiatry. 2023;28:118–26.
Shen L, Lin Y, Sun Z, Yuan X, Chen L, Shen B. Knowledge-guided bioinformatics model for identifying autism spectrum disorder diagnostic microRNA biomarkers. Sci Rep. 2016;6:39663.
Zhong C, Rando J, Patti MA, Braun JM, Chen A, Xu Y, et al. Gestational thyroid hormones and autism-related traits in the EARLI and HOME studies. Autism Res. 2024;17:716–27.
Rani N, Nowakowski TJ, Zhou H, Godshalk SE, Lisi V, Kriegstein AR, et al. A primate lncRNA mediates notch signaling during neuronal development by sequestering miRNA. Neuron. 2016;90:1174–88.
Ballester-Navarro P, MartÃnez-Madrid MJ, Javaloyes-SanchÃs A, Belda-Cantó C, Aguilar V, Inda M-M, et al. Interplay of circadian clock and melatonin pathway gene variants in adults with autism, intellectual disability and sleep problems. Res Autism Spectr Disorders. 2021;81: 101715.
Fombonne E. Epidemiology of pervasive developmental disorders. Pediatr Res. 2009;65:591–8.
Verhagen MG, Pasterkamp RJ. Chapter 5 - Axon guidance: semaphorin/neuropilin/plexin signaling. In: Rubenstein J, Rakic P, Chen B, Kwan KY, Kolodkin A, Anton E, editors. Cellular migration and formation of axons and dendrites (Second Edition). Academic Press; 2020. p. 109–22.
Bagni C, Zukin RS. A synaptic perspective of fragile x syndrome and autism spectrum disorders. Neuron. 2019;101:1070–88.
Percy AK. Rett syndrome: exploring the autism link. Arch Neurol. 2011;68:985–9.
Trillingsgaard A, ØStergaard JR. Autism in Angelman syndrome: an exploration of comorbidity. Autism. 2004;8:163–74.
Rosina E, Battan B, Siracusano M, Di Criscio L, Hollis F, Pacini L, et al. Disruption of mTOR and MAPK pathways correlates with severity in idiopathic autism. Transl Psychiatry. 2019;9:1–10.
Hudson CC, Hall L, Harkness KL. Prevalence of depressive disorders in individuals with autism spectrum disorder: a meta-analysis. J Abnorm Child Psychol. 2019;47:165–75.
Chang Q, Yang H, Wang M, Wei H, Hu F. Role of microtubule-associated protein in autism spectrum disorder. Neurosci Bull. 2018;34:1119–26.
White SW, Oswald D, Ollendick T, Scahill L. Anxiety in children and adolescents with autism spectrum disorders. Clin Psychol Rev. 2009;29:216–29.
Brown DG, Murphy M, Cadeddu R, Bell R, Weis A, Chiaro T, et al. Colitis reduces active social engagement in mice and is ameliorated by supplementation with human microbiota members. Nat Commun. 2024;15:2769.
Lingampelly SS, Naviaux JC, Heuer LS, Monk JM, Li K, Wang L, et al. Metabolic network analysis of pre-ASD newborns and 5-year-old children with autism spectrum disorder. Commun Biol. 2024;7:1–22.
Foerster S, Floriddia EM, van Bruggen D, Kukanja P, Hervé B, Cheng S, et al. Developmental origin of oligodendrocytes determines their function in the adult brain. Nat Neurosci. 2024;27:1–10.
McFadden K, Minshew NJ. Evidence for dysregulation of axonal growth and guidance in the etiology of ASD. Front Hum Neurosci. 2013;7:671.
Wilfert AB, Turner TN, Murali SC, Hsieh P, Sulovari A, Wang T, et al. Recent ultra-rare inherited variants implicate new autism candidate risk genes. Nat Genet. 2021;53:1125–34.
Nisar S, Bhat AA, Masoodi T, Hashem S, Akhtar S, Ali TA, et al. Genetics of glutamate and its receptors in autism spectrum disorder. Mol Psychiatry. 2022;27:2380–92.
Gao J, Pickett HA. Targeting telomeres: advances in telomere maintenance mechanism-specific cancer therapies. Nat Rev Cancer. 2022;22:515–32.
Amestoy A, Baudrillard C, Briot K, Pizano A, Bouvard M, Lai M-C. Steroid hormone pathways, vitamin D and autism: a systematic review. J Neural Transm (Vienna). 2023;130:207–41.
Sans N, Ezan J, Moreau MM, Montcouquiol M. Chapter 13 - planar cell polarity gene mutations in autism spectrum disorder, intellectual disabilities, and related deletion/duplication syndromes. In: Sala C, Verpelli C, editors. Neuronal and synaptic dysfunction in autism spectrum disorder and intellectual disability. San Diego: Academic Press; 2016. p. 189–219.
Khera R, Mehan S, Kumar S, Sethi P, Bhalla S, Prajapati A. Role of JAK-STAT and PPAR-gamma signalling modulators in the prevention of autism and neurological dysfunctions. Mol Neurobiol. 2022;59:3888–912.
Malik M, Tauqeer Z, Sheikh AM, Wen G, Nagori A, Yang K, et al. NF-κB signaling in the brain of autistic subjects. Mediators Inflamm. 2011;2011: 785265.
Rademacher S, Eickholt BJ. PTEN in autism and neurodevelopmental disorders. Cold Spring Harb Perspect Med. 2019;9: a036780.
Pourtavakoli A, Ghafouri-Fard S. Calcium signaling in neurodevelopment and pathophysiology of autism spectrum disorders. Mol Biol Rep. 2022;49:10811–23.
Cheng L, Wang P, Tian R, Wang S, Guo Q, Luo M, et al. LncRNA2Target v2.0: a comprehensive database for target genes of lncRNAs in human and mouse. Nucleic Acids Res. 2019;47:D140-4.
Langfelder P, Horvath S. WGCNA: an R package for weighted correlation network analysis. BMC Bioinform. 2008;9:559.
Zheng Y, Luo H, Teng X, Hao X, Yan X, Tang Y, et al. NPInter v5.0: ncRNA interaction database in a new era. Nucleic Acids Res. 2022;51:D232-9.
Zhao H, Yin X, Xu H, Liu K, Liu W, Wang L, et al. LncTarD 2.0: an updated comprehensive database for experimentally-supported functional lncRNA-target regulations in human diseases. Nucleic Acids Res. 2023;51:D199-207.
Salmena L, Poliseno L, Tay Y, Kats L, Pandolfi PP. A ceRNA hypothesis: the Rosetta stone of a hidden RNA language? Cell. 2011;146:353–8.
Acknowledgements
We would like to thank the reviewers for their valuable comments, which helped improve the work substantially.
Funding
This work has been supported by the National Natural Science Foundation of China (Grant Number: 61963001), the Yunnan Xingdian Talents Support Plan—Young Talents Program, the Applied Basic Research Foundation of Science and Technology of Yunnan Province (Grant Number: 202101BA070001-221), and the Doctoral Scientific Research Foundation of Dali University.
Author information
Authors and Affiliations
Contributions
CX and JZ conceived the idea of this work. HY, XW, and CZ refined the idea. CX and JZ designed and performed the experiments. MZ, HY, XW, and CZ participated in the design of the study and performed the statistical analysis. CX, HY, XW, CZ, and JZ drafted the manuscript. All authors revised the manuscript. All authors read and approved the final manuscript.
Corresponding author
Ethics declarations
Ethics approval and consent to participate
Not applicable.
Consent for publication
Not applicable.
Competing interests
The authors declare no competing interests.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License, which permits any non-commercial use, sharing, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if you modified the licensed material. You do not have permission under this licence to share adapted material derived from this article or parts of it. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by-nc-nd/4.0/.
About this article
Cite this article
Xiong, C., Zhang, M., Yang, H. et al. Modelling cell type-specific lncRNA regulatory network in autism with Cycle. BMC Bioinformatics 25, 307 (2024). https://doi.org/10.1186/s12859-024-05933-0
Received:
Accepted:
Published:
DOI: https://doi.org/10.1186/s12859-024-05933-0