 Research
 Open Access
 Published:
Predicting synthetic lethal interactions in human cancers using graph regularized selfrepresentative matrix factorization
BMC Bioinformatics volume 20, Article number: 657 (2019)
Abstract
Background
Synthetic lethality has attracted a lot of attentions in cancer therapeutics due to its utility in identifying new anticancer drug targets. Identifying synthetic lethal (SL) interactions is the key step towards the exploration of synthetic lethality in cancer treatment. However, biological experiments are faced with many challenges when identifying synthetic lethal interactions. Thus, it is necessary to develop computational methods which could serve as useful complements to biological experiments.
Results
In this paper, we propose a novel graph regularized selfrepresentative matrix factorization (GRSMF) algorithm for synthetic lethal interaction prediction. GRSMF first learns the selfrepresentations from the known SL interactions and further integrates the functional similarities among genes derived from Gene Ontology (GO). It can then effectively predict potential SL interactions by leveraging the information provided by known SL interactions and functional annotations of genes. Extensive experiments on the synthetic lethal interaction data downloaded from SynLethDB database demonstrate the superiority of our GRSMF in predicting potential synthetic lethal interactions, compared with other competing methods. Moreover, case studies of novel interactions are conducted in this paper for further evaluating the effectiveness of GRSMF in synthetic lethal interaction prediction.
Conclusions
In this paper, we demonstrate that by adaptively exploiting the selfrepresentation of original SL interaction data, and utilizing functional similarities among genes to enhance the learning of selfrepresentation matrix, our GRSMF could predict potential SL interactions more accurately than other stateoftheart SL interaction prediction methods.
Background
Cancers are complex diseases that caused by the defects of multiple genes. Exploring the genetic interactions within cancer cells is important for understanding the mechanisms of cancers. Synthetic lethality, which is a kind of genetic interaction, has attracted a lot of attentions in cancer therapeutics due to its utility in identifying new anticancer drug targets [1, 2]. Synthetic lethality arises between two genes if the combination of deficiencies in the expression of these two genes causes cell death, whereas a deficiency in only one of these two genes will not affect the cell viability [3–6]. Thus, targeting a nonessential gene that has a synthetic lethal (SL) interaction with a tumourspecific mutated gene would be an effective cancer therapy [7], because only tumour cells which harbour this mutation would be killed. In recent years, highthroughput wetlab screenings such as chemical libraries [8], pooled RNA interference [9, 10] and CRISPRbased genome editing technology [11, 12] have been conducted for searching SL interactions. However, due to the limitations of wetlab screenings such as high cost, offtarget effects and unclear mechanisms [2], efficient computational methods are needed to serve as useful complements for wetlab screenings.
Recently, various computational algorithms have been proposed to predict SL interactions [13]. According to the principle of the model, existing methods can be roughly classified into three categories: knowledgebased methods, supervised machine learning methods and matrix factorization methods. Knowledgebased methods utilize the knowledge or hypotheses about SL interactions to predict potential SL interactions. For example, based on the assumption that SL interactions tend to take place between genes that are coexpressed, JerbyArnon et al. [4] developed a method named DAISY to predict SL interactions from short hairpin RNA (shRNA), somatic copy number alternation (SCNA) and gene expression profiles. Similarly, Sinha et al. [14] proposed a method named Mining Synthetic Lethals (MiSL) to identify mutationspecific SL interactions for specific cancers from pancancer human tumour data. However, knowledgebased methods rely heavily on the knowledge of other genomic data, and do not exploit the underlying mechanisms of known SL interactions [2]. Supervised machine learning methods utilize existing SL interactions to build up classification models which could be used to predict novel SL interactions. Based on available SL interactions of yeast, various classification models such as maximum likelihood estimation (MLE) [15], support vector machines (SVM) [15] and ensemble classifiers [16], have been developed for predicting SL interactions. Traditional supervised machine learning methods require both positive and negative training data being available for learning. However, for SL interaction prediction, there are only positive data, and no negative data is available. Matrix factorization methods have become popular for link prediction in recent years due to their utilities in capturing the underlying mechanisms of observed links and incorporating extra relevant information. For instance, Liu et al. [2] formulated SL interaction prediction in human cancer as a logistic matrix factorization problem and assigned higher importance weights for validated SL interaction pairs than unknown pairs. Furthermore, to promote the accuracy of predicted results, they incorporated proteinprotein interaction (PPI) similarity and Gene Ontology (GO) similarity into their model. However, the performance of matrix factorization methods depend on the choice of the dimensionality of the latent space which is usually previously unknown and hard to determine.
To address the above problems, in this paper, we introduce a novel graph regularized selfrepresentative matrix factorization (GRSMF) model for SL interaction prediction. Based on known SL interactions, our method focuses on learning a representation matrix from the original input data, which could capture the similarities between genes based on their SL interaction partners. Moreover, GRSMF also draws support from the function similarities among genes that derived from Gene Ontology (GO) annotations to enhance the prediction accuracy. Experiment results on SynLethDB dataset demonstrate that compared with other competing interaction prediction methods, our GRSMF model could achieve more accurate prediction results. Furthermore, case studies of predicting novel SL interactions also demonstrate the effectiveness of GRSMF in predicting SL interactions in human cancer.
Results
In this section, we demonstrate the performance of GRSMF on SynLethDB database [17]. We perform the sensitivity analysis for parameters in GRSMF to show their impact on prediction performance. Furthermore, we conduct case studies to show the top SL pairs identified by our method.
Experimental data
We test the performance of GRSMF and compare it with existing methods on SynLethDB database. We also use the GO similarity matrix as graph regularization term in GRSMF.
SynLethDB
Currently, SynLethDB is the most comprehensive database for human SL pairs. It collects SL pairs of human and other four model species from four different sources: (1) biochemical experiments, (2) related databases (Synlethality [18], Decipher https://decipher.sanger.ac.uk/, GenomeRNAi http://www.genomernai.org/[19], BioGRID https://thebiogrid.org/ [20]), (3) text mining [17] and (4) computational predicted method DAISY [4]. After removing the duplicate human SL pairs, we obtained 19,667 human SL interaction pairs involving 6,375 genes.
GO similarity
As demonstrated in [2], functional similarity among genes based on their GO annotations can promote the performance for SL interaction prediction. Therefore, we also utilize the GO similarities among genes for SL interaction prediction in our GRSMF method. We obtain the GO similarity matrix by using the same method presented in [21] and similarly we only consider the biological process (BP) terms in GO.
Experimental setting
In our experiments, we compare our proposed GRSMF against two existing methods, namely SL^{2}MF [2] and BLMNII [22]. We choose these two methods since they are the latest stateoftheart SL interaction prediction methods [2]. We also test the performance of GRSMF without the graph regularization and we denote this variant as SMF. These 3 methods are summarized as follows.

SL^{2}MF predicts the SL pairs based on logistic matrix factorization and it trains the model by assigning higher importance weights for known SL pairs than unknown pairs. The parameters c, λ and α are set to 50, 0.01 and 1.0 respectively. In addition, the number of nearest neighbors in GO similarity graph k_{1} is set to 150.

BLMNII was originally designed for drugtarget interaction prediction. It was applied for SL prediction in [2] and thus we also implemented it for comparison in this study. In BLMNII, we set the value of the linear combination weight in the range of {0, 0.1, 0.2, …, 1.0}.

SMF is a variant of GRSMF without the graph regularization. The parameter λ in SMF is set to 2^{−7}.
For GRSMF, it has two parameters λ and β and we adopt grid search to select the optimal values for them from the range { 2^{−8},2^{−7},…,2^{4},2^{5}}. In particular, λ and β are set to 2^{−7} and 2^{−5} respectively.
In addition, we adopt 5fold crossvalidation in our experiments for performance evaluation and comparison. The known SL pairs are equally split into 5 nonoverlapping subsets. We iteratively use 1 subset for testing and the remaining for training in 5fold cross validation and use the AUC score (i.e., area under the ROC curve) as our evaluation metric.
Experimental results based on 5fold cross validation
Figure 1 shows the performance of various methods on SynLethDB. As shown in Fig. 1, GRSMF achieves an AUC of 0.923 and significantly outperforms BLMNII (0.735), SL^{2}MF (0.847) and SMF (0.893). In particular, we can also have the following two observations based on the comparison in Fig. 1.
First, SMF (i.e., without graph regularization using GO similarity matrix) outperforms SL^{2}MF significantly, showing that selfrepresentative matrix factorization is very effective for SL prediction. Unlike SL^{2}MF that projects genes into a latent space and predict the SL interaction probability between two genes based on the inner product of their latent vectors, SMF adaptively learns the intrasimilarities among genes from the observed SL interactions, according to representation rule introduced in Eq. (1). Thus, SMF has dataadaptiveness and avoids the determination of some sensitive parameters such as the dimension of latent space. Second, graph regularization using GO similarity matrix can boost the prediction performance by 3% from 0.893 (SMF) to 0.923 (GRSMF), demonstrating that functional information of genes is very important for SL interaction prediction. This is also consistent with the results reported in [2]. For these two reasons, GRSMF achieves the highest performance and shows its effectiveness and superiority for SL prediction.
We analyze the running time of various methods. All the experiments are conducted on a laptop with 3.6 GHz CPU and 32G memory. The results are shown in Table 1. We can find from this table that our GRSMF takes the longest running time. However, predicting potential SL interactions based on computational methods is usually used as an offline process, where producing an accurate and reliable prediction result is the primary goal. Therefore, our method is still meaningful.
Effect of parameters
Recall that GRSMF has two parameters, namely, λ and β to control the influence of l_{2} regularization and graph regularization respectively. In this section, we study the impact of these parameters on GRSMF’s prediction performance.
In Fig. 2, we first fix β as 2 ^{−5} and set λ to different values. We can observe that the AUC scores of GRSMF under different λ values are quite close, indicating that our GRSMF model is not very sensitive to the parameter λ.
In Fig. 3, we fix λ as 2 ^{−7} and then investigate the impact of graph regularization using GO similarity. As shown in Fig. 3, large values of β usually lead to poor performance while small values (e.g., 2 ^{−6} and 2 ^{−5}) result in very good performance of GRSMF. Based on the results in Fig. 3, our overall conclusion is that SMF in GRSMF plays a critical role for SL interaction prediction, while graph regularization can further help to improve the prediction performance.
Case studies
We further take all the known SL pairs in SynLethDB as training data and apply GRSMF to predict novel SL pairs. In particular, we rank the unknown pairs in X in descending order based on their interaction scores predicted by GRSMF. Thus, the topranked pairs are more likely to be potential SL pairs. In Table 2, we show the top10 SL pairs as well as their scores predicted by GRSMF. Here, the score of each SL pair is obtained from the predicted label marix \(\hat {X}\), as described in Algorithm 1.
The gene XRCC2 is involved in the repair of DNA doublestrand breaks by homologous recombination. The gene IGHMBP2 also has the functions including DNA binding, DNA recombination, DNA repair and DNA replication. XRCC2 and IGHMBP2 (i.e., 5^{th} pair) have backup functions like DNA repair [23] and thus they have high likelihood to be a SL pair. In addition, MAPK12 is associated with breast cancer, while XRCC2 is part of BCDX2 complex, which acts downstream of BRCA2 recruitment and upstream of RAD51 recruitment [24]. It also makes sense that XRCC2 and MAPK12 (i.e., 4^{th} pair) is predicted as a potential SL pair.
It is wellknown that poly ADPribose polymerases (PARPs) form SL interactions with the two breast cancer genes BRCA1 and BRCA2 [25]. As shown in Table 2, there are 5 predicted SL pairs involving PARP2. In particular, the gene PMS2 interacts with both BRCA1 and BRCA2 [20], and GRSMF predicts that PARP2 and PMS2 have SL interactions (i.e., 6^{th} pair in Table 2).
Discussion
Synthetic lethality is a new angle for cancer therapeutics. Computational methods have been proposed to predict potential SL interactions, which can greatly reduce the costs of biological experiments. In this paper, we present a novel model, named graph regularized selfrepresentative matrix factorization (GRSMF) algorithm, to identify potential SL interactions among genes. Our model focuses on selfrepresentative matrix factorization and also integrates GO similarity matrix as graph regularization. Compared with previous matrix factorization models, we do not need to determine the dimensionality of the latent space and directly learn the similarities among genes based on observed SL interactions. Experiment results on SynLethDB database demonstrate that our GRSMF achieves better performance than other competing methods. Case studies on our predicted novel SL pairs show that our model can effectively identify some candidate SL pairs for further verification.
Conclusions
In conclusion, revealing the molecular mechanisms underlying cancers is essential to the treatment of cancers and development of new anticancer drugs. Predicting potential SL interactions via computational approaches not only helps to improve our understanding of the mechanisms underlying cancers, but also provides a effective way to aided anticancer therapies. In this study, we provide an efficient model to predict potential SL interactions. In contrast to most existing matrix factorization models, our model avoids the determination of some sensitive hyperparameters, which makes it easy to implement. Our model provides a promising strategy to predict potential SL interactions for further experimental verification, and contributes to the treatment of cancers. In the future, we plan to integrate biological knowledge such as pathways, protein domains, TCGA data, etc., to further improve our GRSMF model.
Methods
In this section, we first describe the notations and formulate the problem, then we propose a novel graph regularized selfrepresentative matrix factorization model and introduce a relaxed MajorizationMinimization algorithm to solve the optimization problem.
Notations and problem statement
In this paper, a set of genes is denoted by \(G=\{g_{i}\}_{i=1}^{n}\), where n is the number of genes. A binary association matrix \(X = [X_{ij}] \in {0, 1}^{n\times n}\) is used to describe the SL interactions among genes in G. If there exists a validated SL interaction between genes g_{i} and g_{j}, X_{ij} is set to 1; otherwise, X_{ij} is set to 0. Here, gene pair (g_{i},g_{j}) with X_{ij}=0 is referred to as “unknown pair”, since there is no clear evidence to demonstrate whether there is an SL interaction between genes g_{i} and g_{j} or not. Note that gene pairs (g_{i},g_{j}) and (g_{j},g_{i}) are treated as the same pair, and we set X_{ii}=0 for i=1,…,n. Therefore, X is a symmetric matrix with X_{ij}=X_{ji}.
Given a set of observed SL interactions, the problem of SL interaction prediction is to identify a set of gene pairs that are most likely to have SL interaction from the “unknown pairs”. This task can be achieved by ranking the candidate gene pairs according to the predicted SL propensities in descending orders, and selecting the topranked gene pairs as potential SL interactions. In this study, we propose a regularized selfrepresentative matrix factorization model to identify the SL interactions among genes in G. Furthermore, based on the functional similarities between genes, we also incorporate a neighborhood regularization in our model to enhance the accuracy of prediction.
Selfrepresentative matrix factorization model
The proposed model is developed based on selfrepresentative matrix factorization, which has been successfully used in subspace clustering [26]. In traditional selfrepresentative matrix factorization models, the objective is to exploit the representation of the original data X in which the data itself is treated as a dictionary, i.e., the input data X is selfrepresented by a linear combination of its columns as X≈XU, where U∈R^{n×n} denotes the coefficient matrix which is a meaningful representation of the columns of X. In this study, since the input data X is a symmetric matrix, the representations of its rows should be same as the representations of its columns. Thus, we propose a new model in which X is selfrepresented by linear combinations of its rows and columns as X≈U^{T}XU. For the ith gene g_{i}, U_{li} can be used to denote the probability of gene g_{i} being represented by gene g_{l}, which captures the similarity between genes g_{i} and g_{l} based on their SL interactions with other genes. To guarantee the probability property of U_{·i} (which denotes the ith column of U), we introduce constraints 0≤U_{li}≤1 and \(\sum _{l=1}^{n} U_{li} =1\) for i=1,…,n. To avoid the trivial solution that only one element in U_{·i} has value 1 while all the other elements being zeros, we also impose a constraint on U_{·i}. In particular, we choose the l_{2} norm \(\U_{\cdot i}\_{2}^{2} = \sum _{l=1}^{n} U_{li}^{2}\) due to its simplicity and effectiveness.
Hence, by taking into account the above constraints, we have the following regularized selfrepresentative matrix factorization model
where ∥·∥_{F} is the Frobenius norm and λ is a tuning parameter which controls the influence of the l_{2} regularization.
Graph regularization
In the above objective function (1), the representation matrix U is learned from the original data matrix X, which makes it sensitive to the input data X. If the input data only covers very few known SL interactions (which means most of the elements in X are zeros), it may be hard to learn a comprehensive representation matrix. Therefore, we would like to incorporate some prior information that can reflect the similarities among genes into our model. In particular, based on the assumption that genes with similar functions tend to have similar presentations, we take into account the Gene Ontology (GO) semantic similarities among genes to promote the determination of representation matrix and improve the prediction performance. Let S∈R^{n×n} denote the GO similarity matrix where S_{ij} describes the functional similarity between genes g_{i} and g_{j}. The value of S_{ij} is ranging from 0 to 1, where the larger the value of S_{ij}, the more similar the corresponding two genes. In this study, we adopt the method presented in [21] to calculate the Gene Ontology (GO) semantic similarities among genes. The graph regularization based on S is defined as follows
where Tr(·) denotes the trace of a matrix, D is a diagonal matrix with \(D_{ii} = \sum _{j=1}^{n} S_{ij}\) and L=D−S.
Graph regularized selfrepresentative matrix factorization model
By incorporating the above graph regularization term (2) into Eq. (1), the final objective optimization function of our graph regularized selfrepresentative matrix factorization (GRSMF) model is formulated as follows.
where the parameter β controls the effect of graph regularization.
Optimization algorithm
In this study, we solve the objective function (3) based on relaxed MajorizationMinimization [27] method. Particularly, the objective function in Eq. (3) is denoted as \(\mathcal {J}\) and ▽_{U} denotes the gradient of our objective function with respect to U.
Let \(\bigtriangledown ^{+}_{U} = 4(XU)(U^{T} X U) + 2 \lambda U+2 \beta (D U)\) and \(\bigtriangledown ^{}_{U}=4(XU) X + 2 \beta S U\) denote the positive and negative parts of ▽_{U} respectively. Then we have \(\bigtriangledown _{U} =\bigtriangledown _{U}^{+}  \bigtriangledown _{U}^{}\).
Due to the constraint \(\sum \limits _{l=1}^{n} U_{li} =1\) and 0≤U_{li}≤1, we obtain the following updating rule for U_{li}:
where a_{i} and b_{i} can be obtained by Eqs. 6 and 7 respectively.
The details of the optimization algorithm are described in Algorithm 1. U can be updated by Eq. (5). In this study, we stop the iteration until the relative change of objective function \(\mathcal {J}\) is less than 1e−4, i.e., \(\frac {\\mathcal {J}^{(t+1)}  \mathcal {J}^{(t)}\_{1}}{\\mathcal {J}^{(t)}\_{1}} < 1e4\), where \(\mathcal {J}^{(t)}\) denotes the value of objective function at tth iteration. Finally, the predicted label matrix \(\hat {X}\) can be computed by \({\hat {X} = U^{T} X U}\) when the algorithm arrives at the convergence conditions.
Availability of data and materials
All the experimental results and code can be downloaded from https://github.com/OylCityU/GRSMF.
Abbreviations
 AUC:

Area under the curve
 BLMNII:

Bipartite local model with neighborbased interactionprofile inferring
 BP:

Biological process
 GO:

Gene ontology
 GRSMF:

Graph regularized selfrepresentative matrix factorization
 MLE:

Maximum likelihood estimation
 PPI:

Proteinprotein interaction
 ROC:

Receiver operating characteristic
 SL:

Synthetic lethal
 SMF:

Selfrepresentative matrix factorization
References
 1
O’Neil NJ, Bailey ML, Hieter P. Synthetic lethality and cancer. Nat Rev Genet. 2017; 18(10):613.
 2
Liu Y, Wu M, Liu C, Li X, Zheng J. Sl 2 mf: Predicting synthetic lethality in human cancers via logistic matrix factorization. IEEE/ACM Trans Comput Biol Bioinforma. 2019:1–1. https://doi.org/10.1109/TCBB.2019.2909908.
 3
Hartwell LH, Szankasi P, Roberts CJ, Murray AW, Friend SH. Integrating genetic approaches into the discovery of anticancer drugs. Science. 1997; 278(5340):1064–8.
 4
JerbyArnon L, Pfetzer N, Waldman YY, McGarry L, James D, Shanks E, SeashoreLudlow B, Weinstock A, Geiger T, Clemons PA, et al.Predicting cancerspecific vulnerability via datadriven detection of synthetic lethality. Cell. 2014; 158(5):1199–209.
 5
Boone C, Bussey H, Andrews BJ. Exploring genetic interactions and networks with yeast. Nat Rev Genet. 2007; 8(6):437.
 6
Hartman JL, Garvik B, Hartwell L. Principles for the buffering of genetic variation. Science. 2001; 291(5506):1001–4.
 7
Nijman SM. Synthetic lethality: general principles, utility and detection using genetic screens in human cells. FEBS Lett. 2011; 585(1):1–6.
 8
Simons A, Dafni N, Dotan I, Oron Y, Canaani D. Establishment of a chemical synthetic lethality screen in cultured human cells. Genome Res. 2001; 11(2):266–73.
 9
Turner NC, Lord CJ, Iorns E, Brough R, Swift S, Elliott R, Rayter S, Tutt AN, Ashworth A. A synthetic lethal sirna screen identifying genes mediating sensitivity to a parp inhibitor. EMBO J. 2008; 27(9):1368–1377.
 10
Luo J, Emanuele MJ, Li D, Creighton CJ, Schlabach MR, Westbrook TF, Wong K. K., Elledge SJ. A genomewide rnai screen identifies multiple synthetic lethal interactions with the ras oncogene. Cell. 2009; 137(5):835–848.
 11
Du D, Roguev A, Gordon DE, Chen M, Chen SH, Shales M, Shen JP, Ideker T, Mali P, Qi LS, et al.Genetic interaction mapping in mammalian cells using crispr interference. Nat Methods. 2017; 14(6):577.
 12
Han K, Jeng EE, Hess GT, Morgens DW, Li A, Bassik MC. Synergistic drug combinations for cancer identified in a crispr screen for pairwise genetic interactions. Nat Biotechnol. 2017; 35(5):463.
 13
Folger O, Jerby L, Frezza C, Gottlieb E, Ruppin E, Shlomi T. Predicting selective drug targets in cancer through metabolic networks. Mol Syst Biol. 2011; 7(1):501.
 14
Sinha S, Thomas D, Chan S, Gao Y, Brunen D, Torabi D, Reinisch A, Hernandez D, Chan A, Rankin EB, et al.Systematic discovery of mutationspecific synthetic lethals by mining pancancer human primary tumor data. Nat Commun. 2017; 8:15580.
 15
Li B, Cao W, Zhou J, Luo F. Understanding and predicting synthetic lethal genetic interactions in saccharomyces cerevisiae using domain genetic interactions. BMC Syst Biol. 2011; 5(1):73.
 16
Wu M, Li X, Zhang F, Li X, Kwoh CK, Zheng J. In silico prediction of synthetic lethality by metaanalysis of genetic interactions, functions, and pathways in yeast and human cancer. Cancer Informat. 2014; 13:14026.
 17
Guo J, Liu H, Zheng J. Synlethdb: synthetic lethality database toward discovery of selective and sensitive anticancer drug targets. Nucleic Acids Res. 2015; 44(D1):1011–7.
 18
Li Xj, Mishra SK, Wu M, Zhang F, Zheng J. Synlethality: an integrative knowledge base of synthetic lethality towards discovery of selective anticancer therapies. BioMed Res Int. 2014; 2014. https://doi.org/10.1109/bibm.2013.6732609.
 19
Schmidt EE, Pelz O, Buhlmann S, Kerr G, Horn T, Boutros M. Genomernai: a database for cellbased and in vivo rnai phenotypes, 2013 update. Nucleic Acids Res. 2012; 41(D1):1021–6.
 20
ChatrAryamontri A, Oughtred R, Boucher L, Rust J, Chang C, Kolas NK, O’Donnell L, Oster S, Theesfeld C, Sellam A, et al.The biogrid interaction database: 2017 update. Nucleic Acids Res. 2017; 45(D1):369–79.
 21
Wang JZ, Du Z, Payattakool R, Yu PS, Chen CF. A new method to measure the semantic similarity of go terms. Bioinformatics. 2007; 23(10):1274–81.
 22
Mei JP, Kwoh CK, Yang P, Li XL, Zheng J. Drug–target interaction prediction by learning from local information and neighbors. Bioinformatics. 2012; 29(2):238–45.
 23
Ohali A, Avigad S, Zaizov R, Ophir R, HornSaban S, Cohen IJ, Meller I, Kollender Y, Issakov J, Yaniv I. Prediction of high risk ewing’s sarcoma by gene expression profiling. Oncogene. 2004; 23(55):8997.
 24
Haricharan S, Bainbridge MN, Scheet P, Brown PH. Somatic mutation load of estrogen receptorpositive breast tumors predicts overall survival: an analysis of genome sequence data. Breast Cancer Res Treat. 2014; 146(1):211–20.
 25
Chan DA, Giaccia AJ. Harnessing synthetic lethal interactions in anticancer drug discovery. Nat Rev Drug Disc. 2011; 10(5):351.
 26
Ma S, Zhang L, Hu W, Zhang Y, Wu J, Li X, et al.Selfrepresentative manifold concept factorization with adaptive neighbors for clustering. In: IJCAI: 2018. p. 2539–45. https://doi.org/10.24963/ijcai.2018/352.
 27
Yang Z, Oja E. Unified development of multiplicative algorithms for linear and quadratic nonnegative matrix factorization. IEEE Trans Neural Netw. 2011; 22(12):1878–91.
Acknowledgements
Not Applicable.
About this supplement
This article has been published as part of BMC Bioinformatics, Volume 20 Supplement 19, 2019: 18th International Conference on Bioinformatics. The full contents of the supplement are available at https://bmcbioinformatics.biomedcentral.com/articles/supplements/volume20supplement19.
Funding
The publication of this supplement was funded by the National Natural Science Foundation of China (61602309, 61871272 and 61575125), Guangdong Basic and Applied Basic Research Foundation [2019A1515011384], Shenzhen Fundamental Research Program under Grant JCYJ20170817095210760 and JCYJ20170302154328155.
Author information
Affiliations
Contributions
JH, MW and LOY conceived and designed the study, performed the statistical analysis and drafted the manuscript. ZZ conceived of the study, and participated in its design and coordination and helped to draft the manuscript. FL participated in the design of the study, performed the statistical analysis and helped to revise the manuscript. All authors read and approved the final manuscript.
Corresponding authors
Correspondence to Le OuYang or Zexuan Zhu.
Ethics declarations
Ethics approval and consent to participate
Not applicable.
Consent for publication
Not applicable.
Competing interests
The authors declare that they have no competing interests.
Additional information
Publisher’s Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
About this article
Cite this article
Huang, J., Wu, M., Lu, F. et al. Predicting synthetic lethal interactions in human cancers using graph regularized selfrepresentative matrix factorization. BMC Bioinformatics 20, 657 (2019). https://doi.org/10.1186/s1285901931973
Received:
Accepted:
Published:
Keywords
 Synthetic lethality
 Graph regularization
 Matrix factorization