Dynamically characterizing individual clinical change by the steady state of disease-associated pathway

Sun, Shaoyan; Yu, Xiangtian; Sun, Fengnan; Tang, Ying; Zhao, Juan; Zeng, Tao

doi:10.1186/s12859-019-3271-x

Volume 20 Supplement 25

Proceedings of the 2018 International Conference on Intelligent Computing (ICIC 2018) and Intelligent Computing and Biomedical Informatics (ICBI) 2018 conference: bioinformatics

Research
Open access
Published: 24 December 2019

Dynamically characterizing individual clinical change by the steady state of disease-associated pathway

Shaoyan Sun¹^na1,
Xiangtian Yu^2,3^na1,
Fengnan Sun⁴,
Ying Tang³,
Juan Zhao³ &
…
Tao Zeng^3,5

BMC Bioinformatics volume 20, Article number: 697 (2019) Cite this article

1202 Accesses
1 Citations
1 Altmetric
Metrics details

Abstract

Background

Along with the development of precision medicine, individual heterogeneity is attracting more and more attentions in clinical research and application. Although the biomolecular reaction seems to be some various when different individuals suffer a same disease (e.g. virus infection), the final pathogen outcomes of individuals always can be mainly described by two categories in clinics, i.e. symptomatic and asymptomatic. Thus, it is still a great challenge to characterize the individual specific intrinsic regulatory convergence during dynamic gene regulation and expression. Except for individual heterogeneity, the sampling time also increase the expression diversity, so that, the capture of similar steady biological state is a key to characterize individual dynamic biological processes.

Results

Assuming the similar biological functions (e.g. pathways) should be suitable to detect consistent functions rather than chaotic genes, we design and implement a new computational framework (ABP: Attractor analysis of Boolean network of Pathway). ABP aims to identify the dynamic phenotype associated pathways in a state-transition manner, using the network attractor to model and quantify the steady pathway states characterizing the final steady biological sate of individuals (e.g. normal or disease). By analyzing multiple temporal gene expression datasets of virus infections, ABP has shown its effectiveness on identifying key pathways associated with phenotype change; inferring the consensus functional cascade among key pathways; and grouping pathway activity states corresponding to disease states.

Conclusions

Collectively, ABP can detect key pathways and infer their consensus functional cascade during dynamical process (e.g. virus infection), and can also categorize individuals with disease state well, which is helpful for disease classification and prediction.

Introduction

The general gene-set or gene module can be used to find combinatory biomarkers or signatures to indicate particular phenotypic change of a biological system, e.g. the disease development or worsening [1,2,3]. However, such ab initio predictions usually have less interpretability in biological researches [4]. Different from these conventional methods, the pathway centered analysis can provide a new way to balance the discovery of interpretable biological functions and the detection of new pathway elements. Thus, these approaches could highlight the conditional importance of prior-known pathways [5] in a phenotype change/ transition [6], and also uncover its new candidate underlying functions [7].

Biological pathway consists of a set of interactive genes or other biomolecules, which is well-known to execute a series of functional cascades for particular cellular response/outcome [8]. Nowadays, there are many carefully curated pathways available to represent creditable functional compositions and interactions [9], which are expected to targetedly capture the permutation of established biological functions involved in the phenotype changes [10, 11].

Indeed, pathway centered models and methods have been widely applied in diverse biological and clinical studies. For instance, the well-known gene set enrichment analysis (GSEA) [12] can recognize dys-regulated pathway according to the measurement of status change of a pathway. Similarly, many pathway-level aggregation methods have been designed to investigate the biological signatures of different phenotypes on the pathway activity level [13]. PAGODA can reveal multiple overlapping aspects of transcriptional heterogeneity for coordinated variability amongst tested cells, by evaluating biological pathways as persistent cell-type specific features or transient processes [14]. The mendelian randomization-based pathway enrichment analysis (MRPEA), has developed a pathway association analysis method for correcting the genetic confounding effects of environmental exposures during the genetic studies of human complex diseases [15]. Pathifier is a principle curve based algorithm to infer pathway deregulation scores for each tumor sample on the basis of expression data by transforming gene-level information into pathway-level information [16]. A silico Pathway Activation Network Decomposition Analysis (iPANDA) is designed for robust biomarker identification from gene expression data, which estimates the pathway activation scores based on the degree of differential gene expression and pathway topology decomposition [17]. There are many further improvements on these pathway-centered computational approaches and their applications by machine learning ideas and technologies [18,19,20]. A network-based pathway-expanding approach is applied to take the topological structures of biological networks into account [21]; especially, the interaction between internal and external genes of the pathway and between pathways, can accurately and reliably identify significant pathways related to the corresponding disease [22]. And a multiple kernel learning has been proposed to feature selection by separately per data type and by pathway membership, and this maximizes the amount of information used to build effective prognostic prediction due to its usage of all available data [23].

Actually, there exists some pathway-related studies based on the time-course experiment and data, e.g. a pathway-based phylogenetic approach [24], a web-based software program Network Painter [25], and a new Wnt signaling time-fit model [26]. However, there is still urgent requirement on the computational characterization of interactive pathways and their phenotype-associated states, which should provide a new viewpoint of network of networks for a biological system [27], and should also supply new strategy to state prediction for the phenotype change of a biological system.

Here, as shown in Fig. 1, we present a composite computational framework (ABP: Attractor analysis of Boolean network of Pathway) to investigate the dynamical biological process (e.g. state transition of biological systems). ABP reconstructs the Boolean network (BN) of pathways by the pathway activity profiles, and then applies the BN attractors to indicate the steady pathway states, which can quantitatively characterize the biological system status corresponding to different conditions or phenotypes (e.g. normal and disease). By wide evaluation on time-course gene expression datasets of virus infections, ABP has shown its efficiency and accuracy on identifying key phenotype-associated pathways subjected to virus infection; and rebuilding these key pathways’ consensus functional cascade during individual specific virus infection; and especially categorizing individuals with disease state well, which should be helpful for disease classification and prediction in personalized medicine.

Methods

Our proposed ABP framework consists of a few main steps and parameters as illustrated in Fig. 1:

(1)
Normalization of gene expression data produced by high-throughput technologies, e.g. RNA sequencing or microarray;
(2)
Quantification of pathway activity according to pathway measuring approaches, i.e. GSVA using the expression data of each sample and gene-sets from KEGG database;
(3)
Selection of key phenotype-associated pathways through differential significance test (P < 0.05), where the number of finally selected pathways is dependent on the maximal number of significantly dys-regulated pathways at different time points;
(4)
Extraction of individual specific time-course pathway activity profile (i.e. every individual would have multiple samples collected at consecutive time points), thus, the organization of pathway activity profile is a cubic dataset, including key pathways, individuals, and time points; of note, the number of key pathways should be less than the number of time points, which is required by the follow-up BN construction;
(5)
Reconstruction of individual specific Boolean network based on corresponding time-course pathway activity profile, which is calculated by BoolNet method, using binarizeTimeSeries function with “scanStatistic” method, windowSize with intervals from 0.1 to 0.2, sign.level with intervals from 0.05 to 0.15; and applying reconstructNetwork function with “bestfit” method and default arguments.
(6)
Extraction of individual specific attractors corresponding to each Boolean network, which represent the individual specific steady pathway state so as to distinguish dissimilar phenotypes by unsupervised (i.e. HCL) or supervised (i.e. SVM with radial kernel) methods.

These key steps applied in above framework will be introduced in details in below.

The calculation of pathway activity scores

Due to the reduced costs of high-throughput technologies, omics data is being produced under more complex experimental designs with multiple phenotypic and/or clinical samples. The Gene Set Variation Analysis (GSVA) [28] allows to assess the underlying pathway activity variation by transforming the gene by sample matrix into a gene set by sample matrix, which just provide the relative enrichment (scores) of pathways across the samples to support follow-up traditional analytical methods in a pathway focused manner, rather than the only qualitative enrichment with respect to a phenotype change. In this study, we apply GSVA to obtain the activity scores of KEGG pathways at multiple time points, which will be further used in following network and dynamic analysis on pathway level rather than gene level.

Notably, several approaches have proposed some pathway-level aggregation methods, but, it still remains unclear how they compare with one another due to limited evaluations. One recent benchmarking investigation has pointed that there is actually necessity for further improving pathway-level aggregation [29]. As known, there were remarkable outcome differences for different pathway tools even when the same data input. For example, the results of a tested approach were typically consistent across different datasets in cancer studies, yet different between methods [30].

Thus, any alternative kinds of activity score can also be used in this step if necessary in particular analysis routines, which would supply tunable analysis strategies dependent on researcher’s objective and experience.

The rank of dys-regulated pathways

With an assumption of that the subjects / individuals with the same clinical symptoms should have the similar molecular / pathway reaction mechanism, the pathway activity may be various in individuals, but the final steady state of a network among pathways should be similar among the same symptom group or distinguishing between different symptom groups. Thus, we rank the differentially activated pathways between different phenotypic individual groups at each time point, which are dependent on the difference test of the activity scores by Wilcoxon rank sum test.

The selection of key pathways during dynamical biological process

Then, according to the number of high-ranked dys-regulated pathways at each time point, we determine one critical time point, and the pathways significantly enriched (e.g. observed) at this time point are key pathways. These selected key pathways and their activities across consecutive time points can be regarded as the state indices which could trace the active pathway during a dynamical process, even in each subject. Noted, the final state or converged state (rather than currently observed state) of one pathway is characterized and quantified by the steady sate of this pathway in a subject, and this steady sate is modeled by network attractor and deduced from following Boolean network analysis.

The re-construction of Boolean network among key pathways

To characterize the dynamical features of a biological system among above filtered key pathways, a network of pathways is constructed based on Boolean network model. At the starting point for a Boolean Network Model, N kinds of interacting nodes (i.e. pathways or meta-genes) are collected, the states of which are modeled as either “on” (active, or up-regulated) or “off” (inactive, or down-regulated). Then, at any given time, the system of such N key pathways is just in a system- or network-state. Along with a dynamical process, the system will change from one state to another depending on the interactions between these pathways. Thus, from any start state, the system is assumed to happen a confirmed sequence of state changes and ends up in a stable state (known as an attractor), and this sequence or trajectory of such system state change is defined as a Boolean process corresponding to discretized time-course data.

In practice, the Boolean network model is applied to convert the gene or pathway expression snapshots over a time course into a Boolean form by noting which genes or pathways are active or not. The dynamics of a Boolean network (BN) model (simply using the current state to determine the next state) can be described as follows:

$$ {\mathrm{s}}_{\mathrm{i}}\left(\mathrm{t}+1\right)=\left\{\begin{array}{c}1\ \sum \limits_{\mathrm{j}}{\mathrm{a}}_{\mathrm{j}\mathrm{i}}{\mathrm{s}}_{\mathrm{j}}\left(\mathrm{t}\right)>0\\ {}0\ \sum \limits_{\mathrm{j}}{\mathrm{a}}_{\mathrm{j}\mathrm{i}}{\mathrm{s}}_{\mathrm{j}}\left(\mathrm{t}\right)<0\\ {}{\mathrm{s}}_{\mathrm{i}}\left(\mathrm{t}\right)\ \sum \limits_{\mathrm{j}}{\mathrm{a}}_{\mathrm{j}\mathrm{i}}{\mathrm{s}}_{\mathrm{j}}\left(\mathrm{t}\right)=0\end{array}\right. $$

where, the s_i(t) represents the state of some variable (e.g. a node in network) i at time point t; a_ji points the influence weight of another variable j for this variable i; thus, the state of variable i at time point t + 1 would be determined by all other variables’ states at the time point ahead, i.e. at time point t.

In this study, the BoolNet package is used for the construction and evaluation of Boolean networks, which has been successfully applied in many biological and biomedical researches [31]. In particular, BoolNet is developed for the reconstruction and analysis of binary gene-regulatory networks: (i) the Network reconstruct function is used for inferring Boolean networks from temporal gene expression or pathway activity profiles by popular reconstruction algorithms; and (ii) the binarize Time Series function is used for binarizing the real-valued time series from these reconstruction algorithms. Noted, the tuning parameters used in BoolNet are searched by grid method and determined by the follow-up state clustering performance. Of course, some alternative Boolean network analysis methods are worthy of future benchmark investigations.

The biological state clustering based on attractors of temporal network of key pathways

As assumed, the attractors represent a final steady state of a biological system, e.g. the success-infection or failure-infection states during virus infection. Obviously, these final states are expected to classify or even predict the particular phenotypes of individuals. Here, it is relevant to evaluate the indicative power of those state features rather than expression features, so that, the clustering of those states and expressions would be applicable and comparable, e.g. hierarchical clustering of pathway states and gene expressions respectively. However, in math terms, a state of an attractor is just a binary vector (attractor with simple structure) or a group of vectors (attractor with loop structure), so the conventional clustering method is not applied before we can supply the distance matrix among states. To generally measure the distance between two states (attractors), the canonical-correlation analysis (CCA) is used to infer the association information from cross-covariance matrices, which can indicate the association (distance) between two groups of vectors, i.e. two attractors.

Given two matrix X and Y, canonical-correlation analysis is to seek two vectors a and b to maximize the correlation between a ′ X and b ′ Y, i.e.

$$ \underset{\mathrm{a},\mathrm{b}}{\max}\uprho \left({\mathrm{a}}^{\prime}\mathrm{X},\mathrm{b}^{\prime}\mathrm{Y}\right)=\underset{\mathrm{a},\mathrm{b}}{\max}\frac{\mathrm{a}^{\prime }{\Sigma}_{\mathrm{XY}}\mathrm{b}}{\sqrt{\mathrm{a}^{\prime }{\Sigma}_{\mathrm{XX}}\mathrm{a}}\sqrt{\mathrm{b}^{\prime }{\Sigma}_{\mathrm{YY}}\mathrm{b}}} $$

where, Σ_XX = Cov(X, X), Σ_YY = Cov(Y, Y), Σ_XY = Cov(X, Y), and Cov(., .) is covariance matrix.

Then, the distance matrix converted from CCA matrix between multiple attractors corresponding to multiple individual Boolean networks, is used to build the hierarchical tree. Two clusters are divided, expecting one corresponds to Sx group and the other one corresponds to Asx group. And the clustering accuracy, defined as the application efficiency, is evaluated by the index Acc [32]. Given samples in Sx and Asx groups that are:

$$ {\left\{{\mathrm{S}}_{\mathrm{i}}\right\}}_{\mathrm{i}=1}^2 $$

where the state-based individuals clustering also provide two individual clusters:

$$ {\left\{{\mathrm{C}}_{\mathrm{j}}\right\}}_{\mathrm{j}=1}^2 $$

Then, the application efficiency of network reconstruction is calculated as:

$$ \mathrm{Acc}=\frac{\underset{\uptau \left(\left[1,2\right]\right)}{\min}\sum \limits_{\mathrm{j}=1}^2\mid {\mathrm{C}}_{\mathrm{j}}\sqcap {\mathrm{S}}_{\uptau \left(\mathrm{j}\right)}\mid }{\sum \limits_{\mathrm{i}=1}^2\mid {\mathrm{S}}_{\mathrm{i}}\mid } $$

Besides, the performance measurement of SVM adopts the AUC with random 5-fold cross-validation, which is used to evaluate the robustness of SVM features (i.e. pathway states as indicators).

Results and discussion

Instruction of datasets and experimental steps

To assess ABP, we collected and analyzed serially sampled gene expression data from a challenge study [27], noted as Rhinovirus UVA data. This data mainly contains 20 human volunteers (subjects) inoculated with live human rhinovirus (HRV), and each subject was serially sampled for a few days quantifying temporal whole blood gene expression by Affymetrix GeneChips technology, clinical symptom scores self-reported over 8–10 symptoms, and viral shedding from periodic nasopharyngeal titrations [33]. Each subject has different samples on 15 time points, one time point before viral Inoculum and the other 14 time points after inoculation. Each subject was designated as a symptomatic subject (Sx) or an asymptomatic subject (Asx) by a modified Jackson score based on these clinical symptoms self-reported [33]. The analyzed HRV UVA subjects were divided in to Sx group (10 individuals) and Asx group (6 individuals). Thus, actually, there were (10 + 6)*15 = 240 gene expression profiles used in this data study.

Based on the pathway activities across multiple samples, we determined the critical time point and corresponding selected pathways. Seeing Fig. 2, the number of differentially activated pathways achieved maximum at 12_th time point (i.e. 48 h after inoculation), and 13 pathways significantly changed at this time point suggest a critical point between two groups of subjects. These selected pathways in Table 1 were applied as the activity indices aiming to trace the state change of pathways across consecutive time points in each subject, and the final state of pathways was determined by the steady sate of a Boolean network corresponding to each subject. Noted, at 14 _th time point, there were also nine dys-regulated pathways observed, only two of which were also observed at 12 _th time point (i.e. Fatty acid degradation and Glycine, serine and threonine metabolism), and the remains had less significance relevant to the studied disease. Thus, the selected 13 pathways at 12_th time point should be more pathogen informative in following analysis.

Table 1 The key pathways detected in HRV dataset

Full size table

Key pathways associated with individual phenotypes

Rhinovirus (RV) leads the majority of common colds, and also causes exacerbations in patients with asthma and chronic obstructive pulmonary disease. In the detected key pathways by ABP, many signaling pathways were efficiently detected and could be productive entry pathways for HRV. By examining the effects of tiotropium on RV infection and RV infection-induced airway inflammation, RV14 titres, RNA and cytokine concentrations would be reduced, along with the reduction of the expression of intercellular adhesion molecule and the number of cellular acidic endosomes modulating airway inflammation in rhinovirus infection [34]. Influenza virus or HRV infections of the upper airway can cause colds and the flu, and they can also activate exacerbations of lower airway diseases so as to induce chronic obstructive pulmonary disease and asthma, where a systems approach has identified the temporally changing patterns of host gene expression from these viruses [35], and a transmembrane protein RNASEK is needed for the replication of HRV, influenza A virus, and dengue virus [36]. In addition, a global study has also confirmed that RSV is an important respiratory pathogen in the elderly, and preventative measures such as vaccination could decrease severe respiratory illnesses and complications in the elderly [37]. Effective drugs and modification of pre-existing drugs or regimens would be improve to increase effectiveness of antiviral therapy, Pleconaril has some activity against enteroviruses and some efficacy against rhinoviruses in ongoing trials [38], e.g. it has been suggested that quercetin inhibits RV endocytosis and replication in airway epithelial cells at multiple stages of the RV life cycle [39].

In addition, a few our detected biological processes and functions are also associated with virus infection [40,41,42], such as HRV infection, as discovered in Table 1 (Noted, the metagene id in this table will be used to label the corresponding pathway in following figures for convenience). For example, the transient receptor potential (TRP) channel family are potential candidates for sensing physical and chemical stimuli, and TRP channels +may be novel therapeutic targets for controlling virus-induced cough [43]; Amoebiasis, the condition of harbouring the protozoan parasite Entamoebahistolytica, is a major health problem throughout the world [44], and parasite virulence can implicate multiple amoebic and host factors through complex host-parasite interactions [45]; and HCV infection is able to induce autophagy and downstream UPR molecules regulating key autophagic gene expression, which just can similarly promote human rhinovirus infection via the autophagic pathway [46, 47].

Temporal module network charactering the dynamical process of individual phenotype appearance

For each subject / individual, the topological structure of his / her Boolean network is displayed in Fig. 3. Obviously, these structures can’t classify the Sx and Asx groups directly, although there were some group specific edges observed in the networks. For instance, the regulation association between “Hepatitis C” and “Influenza A” pathways, has many observations in the Sx-individual specific networks (5 in 10) rather than in the Asx-individual specific networks (0 in 6), which is represented by the edge (meta-)Gene12 - > (meta-)Gene13; or the regulation association between “Amoebiasis” and “Glycine, serine and threonine metabolism” pathways, also has some observations in the Sx-individual specific networks (4 in 10) but none in the Asx-individual specific networks (0 in 6), which is represented by the edge (meta-)Gene11 - > (meta-)Gene2. Besides, there is a cross-reactivity between hepatitis C virus and Influenza A virus reported, and the host responses to an infectious agent can be influenced by such cross-reactive memory cells [48]. Thus, the “Hepatitis C” and “Influenza A” pathways, along with their involved genes / proteins, and even their interaction could all play important roles in HRV infection through a transferable manner.

Attractor of module network indicating the pathways states corresponding to phenotypes

In addition to the aforementioned comparison and discussion about topological structures of individual specific BN, we further compared the individual specific final states of BN represented by the network attractors of networks. As illustrated in Fig. 4, there were several obvious common pathways observed in active states for Sx or Asx subjects, e.g. Hepatitis C and Influenza A, indicating the shared molecular mechanism suffering from virus infection. And most other pathways will be active or not for different subjects, but not distinguish for Sx and Asx groups separately. By contrast, integrating all pathways’ states, a simple clustering (i.e. HCL) can distinguish the Sx and Asx individuals with an accuracy larger than 80%, and would be more efficient than that based on topological structures in a parameter space, as displayed in Fig. 5a. Next, the AUC of SVM with 5-fold cross-validation had also been used to evaluate the robustness of our adopted network attractor based classification model. The results in Fig. 5b illustrated again the classification model trained from pathways’ states can achieve higher AUC than the classification model trained from pathways’ topological structures. In addition, in a bootstrap manner, by removing the samples at each time point, the attractor based model had been re-built and its performances kept well (i.e. with robustness) as shown in Fig. 5c. These results supported again the merit of this systems biology research, which provided discriminative systematical features (i.e. network features) to characterize the phenotype diversity, and also revealed the interpretable mechanisms underlying individual specific phenotypic changes. It should be noted that, although there are only 20 individuals in this data, the number of samples is actually larger than 200. And in the efficiency evaluation by hierarchical clustering or SVM, there are indeed 13 features used, which is less than the number of observations of model. Thus, the performance obtained here is not trivial. Of course, the evaluation on more individuals with multiple temporal samples should be further carried on in future work.

Replicate discovery on independent dataset

To further validate the robustness of ABP, we used another gene expression data also inoculated with live human rhinovirus (HRV) [27], and this dataset was referred to Rhinovirus Duke data. This new dataset includes serially sampled gene expression data from 19 human volunteers (subjects). Each subject has different samples in 19 time points, one time point before the viral Inoculum and the remaining 18 time points behind inoculation. That means there are actually 19*19 = 361 expression profiles here. According to a modified Jackson score computed from the self-reported clinical symptoms, the analyzed HRV Duke subjects were also divided in to Sx group (9 individuals) and Asx group (3 individuals) and others [27]. We used the same key pathways with an assumption that the pathway reaction mechanism would be similar in the same disease, and more importantly this can also validate key pathways in an independent test. From the results on this dataset, we actually observed again that Boolean network model can mimic well a dynamical process that the biological system changes from one state to another. Especially, the steady state of the network attractors can distinguish Ax and Asx individuals efficiently (Fig. 6), which was also robust on parameters than that based on network structures. Notably, in the unsupervised evaluation as clustering in Fig. 6a, the steady state of the network attractors obviously outperformed the structure of network. Meanwhile, in the supervised evaluation as classification in Fig. 6b, the steady state of the network attractors tended to have satisfactory AUC performance (e.g. > 0.8) on larger parameter scopes than the structure of network, although the median performance of the structure of network seemed to be higher possibly caused by sample unbalance.

Conclusions

In this study, we designed and implemented a computational framework ABP to investigate the dynamic of biological process for individual comparisons, especially on the network of pathways rather than genes. Based on the quantification of pathways activity (i.e. estimated activity score based on expression), the Boolean networks are reconstructed for each individual, so that the important network attractor can be obtained and applied to infer the final stable biological system state of individuals after particular phenotypic change (e.g. virus infection). Noted, some other pathway score like Pathifier has also been tested, however, the results show it is weak to obtain reasonable Boolean network, which would be caused by these scores being limited into a numeric region that would greatly affect the following network inference. Thus, the optimized combination of pathways activity estimation and Boolean network construction should be deserved to future study. Actually, both the structure and the attractor of the Boolean network can reveal indicative features (e.g. pathway associations or pathway activations) distinguishing Sx individuals or its sub-group from the Asx individuals. They would provide candidate disease identification or prediction approaches when there are enough individuals with temporal samples available [49].

Indeed, our proposed ABP has shown its efficiency and accuracy on identifying phenotype-associated pathways. These identified pathways could explain the phenotypic change in a state-transition manner, which is strongly supported by wide analysis and evaluation on multiple temporal gene expression datasets of HRV infections. A future direction is to further investigate how to build attractor-focused disease prediction model [50], and especially to release the model implementation as a web service for wide bioinformatics and biomedical study and application.

Availability of data and materials

The R codes are available at https://github.com/ztpub/Attractor-analysis-of-Boolean-network-of-Pathway.

Abbreviations

ABP:: Attractor analysis of Boolean network of Pathway
AUC:: Area Under Curve
BN:: Boolean network
CCA:: Canonical-correlation analysis
GSEA:: Gene set enrichment analysis
GSVA:: Gene Set Variation Analysis
HCL:: Hierarchical clustering
HCV:: Hepatitis C virus
HRV:: Human rhinovirus
iPANDA:: A silico Pathway Activation Network Decomposition Analysis
KEGG:: Kyoto Encyclopedia of Genes and Genomes
MRPEA:: Mendelian randomization-based pathway enrichment analysis
PAGODA:: Pathway and gene set over-dispersion analysis
RV:: Rhinovirus
SVM:: Support Vector Machine

References

Creixell P, et al. Pathway and network analysis of cancer genomes. Nat Methods. 2015;12(7):615–21.
Article CAS PubMed PubMed Central Google Scholar
Chen P, et al. Discovery of relationships between long non-coding RNAs and genes in human diseases based on tensor completion. IEEE Access. 2019;6:59152–62.
Google Scholar
Deng S, et al. Predicting hub genes associated with cervical cancer through gene co-expression networks. IEEE/ACM Trans Comput Biol Bioinform. 2016;13(1):27–35.
Article CAS PubMed Google Scholar
Alcaraz N, et al. De novo pathway-based biomarker identification. Nucleic Acids Res. 2017;45(16):e151.
Article PubMed PubMed Central CAS Google Scholar
Vafai SB, Mootha VK. Medicine. A common pathway for a rare disease? Science. 2013;342(6165):1453–4.
Article CAS PubMed Google Scholar
Yimlamai D, et al. Hippo pathway activity influences liver cell fate. Cell. 2014;157(6):1324–38.
Article CAS PubMed PubMed Central Google Scholar
Conlon P, et al. Single-cell dynamics and variability of MAPK activity in a yeast differentiation pathway. Proc Natl Acad Sci U S A. 2016;113(40):E5896–905.
Article CAS PubMed PubMed Central Google Scholar
Yu X, et al. Integrative enrichment analysis: a new computational method to detect dysregulated pathways in heterogeneous samples. BMC Genomics. 2015;16:918.
Article PubMed PubMed Central CAS Google Scholar
Cerami EG, et al. Pathway commons, a web resource for biological pathway data. Nucleic Acids Res. 2011;39(Database issue):D685–90.
Article CAS PubMed Google Scholar
Hollander D, et al. A network-based analysis of colon cancer splicing changes reveals a tumorigenesis-favoring regulatory pathway emanating from ELK1. Genome Res. 2016;26(4):541–53.
Article CAS PubMed PubMed Central Google Scholar
Deng S, et al. Identifying stages of kidney renal cell carcinoma by combining gene expression and DNA methylation data. IEEE/ACM Trans Comput Biol Bioinform. 2017;14(5):1147–53.
Article PubMed Google Scholar
Subramanian A, et al. Gene set enrichment analysis: a knowledge-based approach for interpreting genome-wide expression profiles. Proc Natl Acad Sci U S A. 2005;102(43):15545–50.
Article CAS PubMed PubMed Central Google Scholar
Ihnatova I, Budinska E. ToPASeq: an R package for topology-based pathway analysis of microarray and RNA-Seq data. BMC Bioinformatics. 2015;16:350.
Article PubMed PubMed Central CAS Google Scholar
Fan J, et al. Characterizing transcriptional heterogeneity through pathway and gene set overdispersion analysis. Nat Methods. 2016;13(3):241–4.
Article CAS PubMed PubMed Central Google Scholar
Fan Q, et al. GWAS summary-based pathway analysis correcting for the genetic confounding impact of environmental exposures. Brief Bioinform. 2018;19(5):725–30.
Drier Y, et al. Pathway-based personalized analysis of cancer. Proc Natl Acad Sci U S A. 2013;110(16):6388–93.
Article CAS PubMed PubMed Central Google Scholar
Ozerov IV, et al. In silico pathway activation network decomposition analysis (iPANDA) as a method for biomarker development. Nat Commun. 2016;7:13427.
Article CAS PubMed PubMed Central Google Scholar
Palaniappan SK, et al. Abstracting the dynamics of biological pathways using information theory: a case study of apoptosis pathway. Bioinformatics. 2017;33(13):1980–6.
Article CAS PubMed Google Scholar
Zhang H, et al. DiscMLA: an efficient discriminative motif learning algorithm over high-throughput datasets. IEEE/ACM Trans Comput Biol Bioinform. 2018;15(6):1810–20.
Article CAS PubMed Google Scholar
Huang DS, Zheng C. Independent component analysis based penalized discriminant method for tumor classification using gene expression data. Bioinformatics. 2006;22(15):1855–62.
Article CAS PubMed Google Scholar
Bao W, et al. Novel human microbe-disease association prediction using network consistency projection. BMC Bioinformatics. 2017;18(S16):543.
Article PubMed PubMed Central CAS Google Scholar
Zhang Q, et al. A network-based pathway-expanding approach for pathway analysis. BMC Bioinformatics. 2016;17(Suppl 17):536.
Article PubMed PubMed Central Google Scholar
Seoane JA, et al. A pathway-based data integration framework for prediction of disease progression. Bioinformatics. 2014;30(6):838–45.
Article CAS PubMed Google Scholar
Schraiber JG, et al. Inferring evolutionary histories of pathway regulation from transcriptional profiling data. PLoS Comput Biol. 2013;9(10):e1003255.
Article PubMed PubMed Central Google Scholar
Karr JR, et al. NetworkPainter: dynamic intracellular pathway animation in Cytobank. BMC Bioinformatics. 2015;16:172.
Article PubMed PubMed Central Google Scholar
MacLean AL, et al. Parameter-free methods distinguish Wnt pathway models and guide design of experiments. Proc Natl Acad Sci U S A. 2015;112(9):2652–7.
Article CAS PubMed PubMed Central Google Scholar
Zeng T, et al. Prediction of dynamical drug sensitivity and resistance by module network rewiring-analysis based on transcriptional profiling. Drug Resist Updat. 2014;17(3):64–76.
Article PubMed Google Scholar
Hanzelmann S, Castelo R, Guinney J. GSVA: gene set variation analysis for microarray and RNA-seq data. BMC Bioinformatics. 2013;14:7.
Article PubMed PubMed Central Google Scholar
Hwang S. Comparison and evaluation of pathway-level aggregation methods of gene expression data. BMC Genomics. 2012;13(Suppl 7):S26.
Article PubMed PubMed Central Google Scholar
Jaakkola MK, Elo LL. Empirical comparison of structure-based pathway methods. Brief Bioinform. 2016;17(2):336–45.
Article CAS PubMed Google Scholar
Mussel C, Hopfensitz M, Kestler HA. BoolNet--an R package for generation, reconstruction and analysis of Boolean networks. Bioinformatics. 2010;26(10):1378–80.
Article PubMed CAS Google Scholar
Yu X, et al. Unravelling personalized dysfunctional gene network of complex diseases based on differential network model. J Transl Med. 2015;13:189.
Article PubMed PubMed Central Google Scholar
Liu TY, et al. An individualized predictor of health and disease using paired reference and target samples. BMC Bioinformatics. 2016;17:47.
Article PubMed PubMed Central CAS Google Scholar
Yamaya M, et al. Inhibitory effects of tiotropium on rhinovirus infection in human airway epithelial cells. Eur Respir J. 2012;40(1):122–32.
Article CAS PubMed Google Scholar
Kim TK, et al. A systems approach to understanding human rhinovirus and influenza virus infection. Virology. 2015;486:146–57.
Article CAS PubMed Google Scholar
Perreira JM, et al. RNASEK is a V-ATPase-associated factor required for endocytosis and the replication of rhinovirus, influenza a virus, and dengue virus. Cell Rep. 2015;12(5):850–63.
Article CAS PubMed Google Scholar
Falsey AR, et al. Respiratory syncytial virus and other respiratory viral infections in older adults with moderate to severe influenza-like illness. J Infect Dis. 2014;209(12):1873–81.
Article PubMed Google Scholar
Rawlinson WD. Antiviral agents for influenza, hepatitis C and herpesvirus, enterovirus and rhinovirus infections. Med J Aust. 2001;175(2):112–6.
Article CAS PubMed Google Scholar
Ganesan S, et al. Quercetin inhibits rhinovirus replication in vitro and in vivo. Antivir Res. 2012;94(3):258–71.
Article CAS PubMed Google Scholar
Wang X, et al. Visual detection of the human metapneumovirus using reverse transcription loop-mediated isothermal amplification with hydroxynaphthol blue dye. Virol J. 2012;9:138.
Article CAS PubMed PubMed Central Google Scholar
Venkatesan A, Sharma R, Dasgupta A. Cell cycle regulation of hepatitis C and encephalomyocarditis virus internal ribosome entry site-mediated translation in human embryonic kidney 293 cells. Virus Res. 2003;94(2):85–95.
Article CAS PubMed Google Scholar
Fabian P, et al. An optimized method to detect influenza virus and human rhinovirus from exhaled breath and the airborne environment. J Environ Monit. 2009;11(2):314–7.
Article CAS PubMed Google Scholar
Abdullah H, et al. Rhinovirus upregulates transient receptor potential channels in a human neuronal cell line: implications for respiratory virus-induced cough reflex sensitivity. Thorax. 2014;69(1):46–54.
Article CAS PubMed Google Scholar
Chayen A, Avron B. Entamoeba histolytica, antigens and amoebiasis. Curr Top Microbiol Immunol. 1985;120:19–41.
CAS PubMed Google Scholar
Faust DM, Guillen N. Virulence and virulence factors in Entamoeba histolytica, the agent of human amoebiasis. Microbes Infect. 2012;14(15):1428–41.
Article PubMed Google Scholar
Wu Q, et al. Interleukin-1 receptor-associated kinase M (IRAK-M) promotes human rhinovirus infection in lung epithelial cells via the autophagic pathway. Virology. 2013;446(1–2):199–206.
Article CAS PubMed Google Scholar
Wang J, et al. Hepatitis C virus core protein activates autophagy through EIF2AK3 and ATF6 UPR pathway-mediated MAP 1LC3B and ATG12 expression. Autophagy. 2014;10(5):766–84.
Article CAS PubMed PubMed Central Google Scholar
Wedemeyer H, et al. Cross-reactivity between hepatitis C virus and influenza a virus determinant-specific cytotoxic T cells. J Virol. 2001;75(23):11392–400.
Article CAS PubMed PubMed Central Google Scholar
Yu XT, Zeng T. Integrative analysis of Omics big data. Methods Mol Biol. 2018;1754:109–35.
Article CAS PubMed Google Scholar
Zeng T, et al. Big-data-based edge biomarkers: study on dynamical drug sensitivity and resistance in individuals. Brief Bioinform. 2016;17(4):576–92.
Article PubMed Google Scholar

Download references

Acknowledgments

Not applicable.

About this supplement

This article has been published as part of BMC Bioinformatics Volume 20 Supplement 25, 2019: Proceedings of the 2018 International Conference on Intelligent Computing (ICIC 2018) and Intelligent Computing and Biomedical Informatics (ICBI) 2018 conference: bioinformatics. The full contents of the supplement are available online at https://bmcbioinformatics.biomedcentral.com/articles/supplements/volume-20-supplement-25.

Funding

This paper was supported by the National Natural Science Foundation of China (Nos. 11901272, 61803360, 11871456). Publication costs are funded by National Natural Science Foundation of China (No. 11901272).

Author information

Shaoyan Sun and Xiangtian Yu contributed equally to this work.

Authors and Affiliations

School of Mathematics and Statistics Science, Ludong University, Yantai, 264025, China
Shaoyan Sun
Shanghai Jiao Tong University Affiliated Sixth People’s Hospital, Shanghai, 200233, China
Xiangtian Yu
Key Laboratory of Systems Biology, Institute of Biochemistry and Cell Biology, Chinese Academy Science, Shanghai, 200031, China
Xiangtian Yu, Ying Tang, Juan Zhao & Tao Zeng
Medical Laboratory, Yantaishan Hospital, Yantai, 264001, China
Fengnan Sun
Shanghai Research Center for Brain Science and Brain-Inspired Intelligence, Shanghai, 201210, China
Tao Zeng

Authors

Shaoyan Sun
View author publications
You can also search for this author in PubMed Google Scholar
Xiangtian Yu
View author publications
You can also search for this author in PubMed Google Scholar
Fengnan Sun
View author publications
You can also search for this author in PubMed Google Scholar
Ying Tang
View author publications
You can also search for this author in PubMed Google Scholar
Juan Zhao
View author publications
You can also search for this author in PubMed Google Scholar
Tao Zeng
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

SS and XY performed the data analysis and wrote the manuscript; SS and TZ designed this study and revised the manuscript; FS, YT and JZ contributed to data interpretation. All authors have read and approved the final manuscript.

Corresponding authors

Correspondence to Shaoyan Sun or Tao Zeng.

Ethics declarations

Ethics approval and consent to participate

Not applicable.

Consent for publication

Not applicable.

Competing interests

The authors declare that they have no competing interests.

Additional information

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.

Reprints and permissions

About this article

Cite this article

Sun, S., Yu, X., Sun, F. et al. Dynamically characterizing individual clinical change by the steady state of disease-associated pathway. BMC Bioinformatics 20 (Suppl 25), 697 (2019). https://doi.org/10.1186/s12859-019-3271-x

Download citation

Published: 24 December 2019
DOI: https://doi.org/10.1186/s12859-019-3271-x

Proceedings of the 2018 International Conference on Intelligent Computing (ICIC 2018) and Intelligent Computing and Biomedical Informatics (ICBI) 2018 conference: bioinformatics

Dynamically characterizing individual clinical change by the steady state of disease-associated pathway

Abstract

Background

Results

Conclusions

Introduction

Methods

The calculation of pathway activity scores

The rank of dys-regulated pathways

The selection of key pathways during dynamical biological process

The re-construction of Boolean network among key pathways

The biological state clustering based on attractors of temporal network of key pathways

Results and discussion

Instruction of datasets and experimental steps

Key pathways associated with individual phenotypes

Temporal module network charactering the dynamical process of individual phenotype appearance

Attractor of module network indicating the pathways states corresponding to phenotypes

Replicate discovery on independent dataset

Conclusions

Availability of data and materials

Abbreviations

References

Acknowledgments

About this supplement

Funding

Author information

Authors and Affiliations

Contributions

Corresponding authors

Ethics declarations

Ethics approval and consent to participate

Consent for publication

Competing interests

Additional information

Publisher’s Note

Rights and permissions

About this article

Cite this article

Share this article

BMC Bioinformatics

Contact us