- Software
- Open access
- Published:
TreeQNet: a webserver for Treatment evaluation with Quantified Network
BMC Bioinformatics volume 23, Article number: 473 (2022)
Abstract
Background
Personalized therapy has been at the forefront of cancer care, making cancer treatment more effective. Since cancer patients respond individually to drug therapy, predicting the sensitivity of each patient to specific drugs is very helpful to apply therapeutic agents. Traditional methods focus on node (molecular) information but ignore relevant interactions among different nodes, which has very limited application in complex situations, such as cancer drug responses in real clinical practice.
Results
Treatment evaluation with Quantified Network (TreeQNet) is a webserver which could predict sensitivity to drugs for patients through the innovative use of proteomic and phosphoproteomic network from tumor tissues.
Conclusion
TreeQNet service: http://bioinfo.ustc.edu.cn/. TreeQNet source code: https://github.com/Really00/treeqnet-web-front/.
Background
Cancer is one of the most common diseases threatening human health in the world. Drug treatment is most closely related to the specific origin of cancer and the location of gene mutation. Driven by the development of integrated omics technology, it becomes possible that select the most effective drug for each individual patient [1]. The main challenge of personalized medicine is to identify biomarkers for different cancers. Over the last decade, several methods have been developed to infer the potential relationships between cell line and drug [2]. Existing methods mainly rely on regression, classification or multi-kernel learning to predict drug response, but these methods are only based on single-molecule or static networks. Because the genetic backgrounds of within-class samples are heterogeneous and biochemical reactions are stochastic [3], single molecule biomarkers identified in differentially expressed molecules are limited in many cases. On the other hand, biological functions are often achieved through a set of collaborative molecules or a network of interacting molecules. Genes perform their functions by interacting with other molecules, and these interactions can be abstracted as edges of biomolecular networks. Therefore, it makes sense to introduce edge-based biomarkers [4]. Signaling pathways, protein complexes and sub-networks have greater discriminatory power than individual genes in distinguishing disease phenotypes [5]. In summary, a differentially expressed interaction or network can provide more details about human pathogenic states than a traditional single differentially expressed gene.
In a biological system, it is the interactions (regulations) or edges among molecules rather than single molecules that facilitate a biological function or signal transduction involved in diseases. Complex diseases are generally resulted from the failure of relevant systems instead of single molecules, which should be investigated in a dynamic and network manner. Thus, it is important to identify network signatures that are associated with complex diseases for early diagnosis and clinical prognosis. Edge biomarker represents one of the network signatures that are sensitive to network perturbation in diseases. Specifically, the “node” represents each single molecule, and the “edge” indicates the interaction or correlation between each pair of genes or molecules in the biological network. The “edge biomarker” is a more sensitive type of biomarker or signature that could reflect the perturbed networks in disease treatment. Cancer, as a complex disease caused by related systems, is more appropriate to use dynamic and intermolecular networks [6].
So if effective edge markers can be identified from the omics data may add new insights to cancer drug prediction based on the EdgeBiomarker algorithm proposed by Zhang et al. [4]. Combining the results of first-line drug trials in colorectal cancer using the miniPDX model [7], we innovatively use the proteomic and phosphoproteomic information of colorectal cancer patients to extract the kinase-substrate edges with the best classification ability as edge markers and construct a predictive model for patient sensitivity to three drugs, Afatinib, Gefitinib and Regorafenib [8]. Here, we develop TreeQNet, a web-based service for use by a broad range of researchers who hope to assist patients with their medication at the clinical level. TreeQNet service not only provides a new perspective for the study of colorectal cancer, but also for the study of other cancers.
Methods and implementation
Web server architecture
The TreeQNet web server is implemented using Model-View-View Model (MVVM) architecture model, which separates the user interface from the operation logic to provide users with a better experience. Javascript and Hypertext Markup Language (HTML) are the main languages for web services. The front-end web is based on HTML and Cascading Style Sheets (CSS). View layer is constructed based on Vue framework 2.5 which makes the page render and respond very quickly. Vue achieves efficient bidirectional data binding and flexible component system through MVVM idea, and provides a flexible file upload configuration. We develop Kexpress framework 1.0 based on express at the back-end.
Functionally, the server includes edge feature selection and drug sensitivity prediction. Model algorithm implemented with R seamlessly integrates with Kexpress framework shown in Fig. 1. The built-in data in the data layer includes the node data needed to predict the patients’ sensitivity to three drugs, edge data needed to predict sensitivity to three drugs, sensitivity results to three drugs in a miniPDX model from colorectal cancer patients, classification markers for colorectal cancer patients (1 indicates primary tumor, 2 indicates metastatic tumor). The upload data in the data layer is composed of protein expression data and phosphorylation site expression data uploaded by the users. In protein expression data file, the first column is the protein gene name of the patient to be predicted (no duplicate values) and the second column is the protein expression value of the patient to be predicted (after normalization, no missing values). This is similar for the phosphorylation site expression data file. Both files must be in csv format. Data processing mainly includes screening differentially correlated gene pairs (DCPs) by Pearson correlation coefficient (PCC) from 31 samples in the database as well as through the EdgeBiomarker algorithm for converting node data to edge data. In the process of EdgeBiomarker algorithm, the training set and testing set are divided by randomly sampling the dataset 100 times. Then the parameter alpha of the elastic network is searched in fixed steps by cross-validation method. Next, the parameter lambda corresponding to each alpha is then calculated and the best lambda is chosen for each alpha. Finally, the mean square error (MSE) is calculated and the features corresponding to the smallest MSE are then chosen as the edge biomarker. Model construction consists of converting all 31 samples in the database and the sample to be diagnosed into edge data, filtering edge features by the built-in kinase-substrate edge features in the system, further screening the edge features by correlation of the edge features and drug sensitivity as well as constructing an elastic network from these 31 processed samples. Ultimately, drug sensitivity prediction constructs a prediction model for the sensitivity of patients to three drugs, Afatinib, Regorafenib, and Gefitinib, which can realize the prediction of drug sensitivity in new patients.
TreeQNet workflow
TreeQNet provides guidance on drug selection for individual patient, creating opportunities for personalized treatments of patients. TreeQNet can predict drug sensitivity based on kinase-substrate network and the whole algorithm flow is shown in Fig. 2. We first used EdgeBiomarker algorithm to construct kinase-substrate networks. After that, the edge strength between the kinase and the phosphorylated substrate is calculated based on PhosphoSitePlus [9] or NetworKIN 3.0 [10]. The edge transformations are made for the kinase and phosphorylated substrate data and the calculation method is shown in formula 1.
where \(x_{u, j, k}\) represents the expression value of the u-th kinase of the j-th sample in the k-th state. \(x_{v, k, k}\) is the expression value of the vth substrate of the jth sample in the kth state, k is the state of the sample, set 1 as the primary tumor, and set 2 as the metastatic tumor. \(\mu _{u, k}\) is the mean expression value of kinase u. \(\mu _{v, k}\) is the mean expression value of substrate v. \(\sigma _{u, k}\) is the standard deviation of the expression value of kinase u. \(\sigma _{v, k}\) is the standard deviation of the expression value of substrate v.
These edge features are stored in the system as built-in data. Subsequently, we predict drug sensitivity based on elastic net regression model. Elastic net use 1696 edge strength features. Users can upload protein expression values and phosphorylation site expression values of patient tumor tissue and predict tumor cell growth inhibitions (TCGIs) for afatinib, gefitinib, and regorafenib.
Example case
To help users quickly get start with TreeQNet and show the usefulness of this tool, we provide 13 example cases for user reference. Users can download sample data in the database by “Download Example Data” button. The sample data contain protein expression value and phosphorylation site expression value data of 13 CRC patients. We measured the drug response effects of each tumor for each drug by tumor cell growth inhibition (TCGI). As shown in Fig. 3, the predicted results are obtained: Afatinib R-squared 0.75, p value 0.002, Gefitinib R-squared 0.61, p value 0.026 and Regorafenib R-squared 0.75, p value 0.003. The observed TCGI ratio varied a lot due to the limited number of samples. The prediction model to some extent could alleviate the influence of outliers and thus the predicted TCGIs showed narrower scales than the observed ones. While, the high correlation between predicted and observed ones suggested the good performance of our prediction model. In addition, two external colorectal cancer samples were used for testing (Fig. 3), other tumor and drug models will continue to be added and enhanced. Additional detailed user guides are available in the Additional file 1.
Conclusion
TreeQNet is a user-friendly web server for treatment evaluation of cancer drugs that facilitates the precise treatment and evaluation for cancer patients and effectively selects the most suitable targeted therapy for patients without druggable mutations. For researchers, TreeQNet assists them to predict drug response through network analysis combined with proteomic and phosphoproteomic data. In addition, TreeQNet can help researchers establish an accurate index to determine the suitable drugs for a given tumor type. TreeQNet server accessible to users via web browsers is convenient for researchers to process data quickly, and provides a visual preview of the results.
Availability of data and materials
TreeQNet web interface is freely available at http://bioinfo.ustc.edu.cn/. The source code can be found at https://github.com/Really00/treeqnet-web-front/.
Abbreviations
- TreeQNet:
-
Treatment evaluation with Quantified Network
- MVVM:
-
Model-view-view model
- HTML:
-
Hypertext markup language
- TCGIs:
-
Tumor cell growth inhibitions
- CRC:
-
Colorectal cancer
References
Eisenstein M. Personalized medicine: special treatment. Nature. 2014;513(7517):8–9.
Geeleher P, Cox NJ, Huang RS. Clinical drug response can be predicted using baseline gene expression levels and in vitro drug sensitivity in cell lines. Genome Biol. 2014;15(3):1–12.
Raser JM, O’shea EK. Noise in gene expression: origins, consequences, and control. Science. 2005;309(5743):2010–3.
Zhang W, Zeng T, Chen L. Edgemarker: identifying differentially correlated molecule pairs as edge-biomarkers. J Theor Biol. 2014;362:35–43.
Zeng T, Zhang W, Yu X, Liu X, Li M, Liu R, Chen L. Edge biomarkers for classification and prediction of phenotypes. Sci China Life Sci. 2014;57(11):1103–14.
Hood L, Flores M. A personal view on systems medicine and the emergence of proactive p4 medicine: predictive, preventive, personalized and participatory. New Biotechnol. 2012;29(6):613–24.
Zhang F, Wang W, Long Y, Liu H, Cheng J, Guo L, Li R, Meng C, Yu S, Zhao Q, et al. Characterization of drug responses of mini patient-derived xenografts in mice for predicting cancer patient clinical therapeutic response. Cancer Commun. 2018;38(1):1–12.
Li C, Sun Y-D, Yu G-Y, Cui J-R, Lou Z, Zhang H, Huang Y, Bai C-G, Deng L-L, Liu P, et al. Integrated omics of metastatic colorectal cancer. Cancer Cell. 2020;38(5):734–47.
Hornbeck PV, Zhang B, Murray B, Kornhauser JM, Latham V, Skrzypek E. Phosphositeplus, 2014: mutations, ptms and recalibrations. Nucleic Acids Res. 2015;43(D1):512–20.
Horn H, Schoof EM, Kim J, Robin X, Miller ML, Diella F, Palma A, Cesareni G, Jensen LJ, Linding R. Kinomexplorer: an integrated platform for kinome biology studies. Nat Methods. 2014;11(6):603–4.
Acknowledgements
Not applicable.
Funding
This work has been supported by the National Key Technologies R &D Program [2017YFA0505502] and the Strategic Priority Research Program of the Chinese Academy of Sciences (CAS) (XDB38000000). The funders had no role in the design of the study and collection, analysis, and interpretation of data, decision to publish, or preparation of the manuscript.
Author information
Authors and Affiliations
Contributions
RZ, HZ, and JW contributed to the conception and design of the study, theoretical framework, the interpretation and discussion of the findings. ZL, YH and YS wrote and tested the code of TreeQNet. ZL wrote the drafting of the manuscript. ZL and YH supported the development of TreeQNet, and contributed to drafting and critical revision of the manuscript. QL, YS, and CL curate data and test the web functionality. All authors have read and approved the final manuscript.
Corresponding authors
Ethics declarations
Ethics approval and consent to participate
Not applicable.
Competing interests
The authors declare that they have no competing interests.
Consent for publication
Not applicable.
Additional information
Publisher’s Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Supplementary Information
Additional file 1.
Introduction to TreeQNet: The usage of the TreeQNet software is described. The format of the input data (including protein expression data and phosphorylation site expression data) is described. Relevant built-in data from the software is listed. We also outline the main calculation method and the results of typical run.
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
About this article
Cite this article
Li, Z., Huang, Y., Li, Q. et al. TreeQNet: a webserver for Treatment evaluation with Quantified Network. BMC Bioinformatics 23, 473 (2022). https://doi.org/10.1186/s12859-022-05024-y
Received:
Accepted:
Published:
DOI: https://doi.org/10.1186/s12859-022-05024-y