Performance of epistasis detection methods in semi-simulated GWAS

Chatelain, Clément; Durand, Guillermo; Thuillier, Vincent; Augé, Franck

doi:10.1186/s12859-018-2229-8

Research article
Open access
Published: 18 June 2018

Performance of epistasis detection methods in semi-simulated GWAS

Clément Chatelain ORCID: orcid.org/0000-0001-7265-6410¹,
Guillermo Durand²,
Vincent Thuillier³ &
…
Franck Augé¹

BMC Bioinformatics volume 19, Article number: 231 (2018) Cite this article

3416 Accesses
13 Citations
1 Altmetric
Metrics details

Abstract

Background

Part of the missing heritability in Genome Wide Association Studies (GWAS) is expected to be explained by interactions between genetic variants, also called epistasis. Various statistical methods have been developed to detect epistasis in case-control GWAS. These methods face major statistical challenges due to the number of tests required, the complexity of the Linkage Disequilibrium (LD) structure, and the lack of consensus regarding the definition of epistasis. Their limited impact in terms of uncovering new biological knowledge might be explained in part by the limited amount of experimental data available to validate their statistical performances in a realistic GWAS context. In this paper, we introduce a simulation pipeline for generating real scale GWAS data, including epistasis and realistic LD structure. We evaluate five exhaustive bivariate interaction methods, fastepi, GBOOST, SHEsisEpi, DSS, and IndOR. Two hundred thirty four different disease scenarios are considered in extensive simulations. We report the performances of each method in terms of false positive rate control, power, area under the ROC curve (AUC), and computation time using a GPU. Finally we compare the result of each methods on a real GWAS of type 2 diabetes from the Welcome Trust Case Control Consortium.

Results

GBOOST, SHEsisEpi and DSS allow a satisfactory control of the false positive rate. fastepi and IndOR present an increase in false positive rate in presence of LD between causal SNPs, with our definition of epistasis. DSS performs best in terms of power and AUC in most scenarios with no or weak LD between causal SNPs. All methods can exhaustively analyze a GWAS with 6.10⁵ SNPs and 15,000 samples in a couple of hours using a GPU.

Conclusion

This study confirms that computation time is no longer a limiting factor for performing an exhaustive search of epistasis in large GWAS. For this task, using DSS on SNP pairs with limited LD seems to be a good strategy to achieve the best statistical performance. A combination approach using both DSS and GBOOST is supported by the simulation results and the analysis of the WTCCC dataset demonstrated that this approach can detect distinct genes in epistasis. Finally, weak epistasis between common variants will be detectable with existing methods when GWAS of a few tens of thousands cases and controls are available.

Backgrounds

During the past decade, Genome-Wide Association Studies (GWAS) have focused on individual Single Nucleotide Polymorphisms (SNPs), looking at variants exhibiting independent, additive and cumulative effects on phenotypes. To date thousands of SNPs have been associated with diseases and other complex traits [1], but in most cases those variants independently explain only a small fraction of the estimated heritability [2, 3]. For example, in Crohn’s Disease, cumulative additive effects explain 10.6% of the variability while the estimated heritability is 53% in Type-2 diabetes, 4.7% for an estimated heritability of 26% and in Lupus, 6.6% for an estimated heritability of 44% [4]. The correct estimation of this missing heritability is still a subject of research interest [5] which may be due to a limitation of this additive model, confirming geneticist’s hypothesis that most phenotypes are not only driven by genetic variants acting independently, but that other phenomenons have to be taken into account including, but not limited to: epigenetics, environment interactions, and genetic interactions between loci (epistasis) [6]. Zuk et al. [7] evaluated that up to 80% of the missing heritability could be due to epistasis in some diseases. Genome Wide scan for epistasis are therefore seen as potentially a very fruitful approach to better understand the genetics of these diseases and to identify new therapeutic strategies.

A vast number of methods for detecting epistasis have been developed in recent years, ranging from exhaustive bivariate tests to machine learning methods (for recent reviews see [8, 9]). Exhaustive bivariate methods (referred to as bivariate methods hereafter) test all pairs of SNPs in a case control GWAS for epistasis that we define as departure from additivity on a logit scale [10]. These methods are adapted to a full analysis of GWAS data, because require less computation than other methods such as random forests and they might be less impacted by high dimensionality. Moreover bivariate methods do not introduce prior biological knowledge, such as network methods, which may limit the discovery of new biology. Finally, we note that epistasis definitions are not always equivalent to other types of approaches, such as Random Forest. The present paper focuses therefore on a selection of bivariate methods. The performance of other types of methods will be evaluated in future work. Bivariate methods face major technical challenges due to the vast number of combinations to be tested even for pairwise analysis (billions to trillions for a typical GWAS): controlling the number of false positive while maintaining sufficient power and ensuring computing time remains acceptable.

The direct evaluation of the statistical performance of these methods on real GWAS is not possible, as true epistatic interactions underlying complex diseases are still largely unknown. In contrast the power and false positive rate of these methods can be easily evaluated in simulated GWAS where the phenotype-genotype relationship is predefined. Evaluations using different types of simulations have been performed. Wang et al. [11] and Frost el al. [12] compared their methods on simulated datasets with independant SNPs. Emily [13], Goudey et al. [14], and Yu et al. [15] compared their methods to PLINK, χ², and BOOST on simulated contingency tables with two SNPs including linkage disequilibrium (LD). Wan et al. [16] compared BOOST and PLINK under H₀ using evolutionary simulations [17] to generate datasets of 38836 SNPs with simulated LD. Other studies also evaluated the performance of Gene-based bivariate methods [18, 19] or interaction tests for quantitative traits [20] in simulated GWAS. Some of these endeavours have in particular highlighted the influence of LD on the control of the type 1 error rates (see [20] for instance). However, the previous simulations present generally two important simplifications compared to real GWAS: (i) the absence of realistic, population specific LD pattern between simulated SNPs and (ii) the limited number of SNPs compared to a real GWAS. LD presents a complex structure at the genome-wide scale and accurate evaluation of the false positive rate of exhaustive bivariate methods would therefore require simulation of such realistic data. Moreover, most comparisons includes few scenarios, less than three methods and are performed in the context of papers introducing a new method to highlight its performances. There is therefore a lack of independent and exhaustive comparisons of these methods.

Several approaches have been implemented to simulate case control GWAS on a genome wide scale with realistic LD using a reference panel: HAPGEN2 [21], GenomeSIMLA [17], GWASIMULATOR [22], and waffect [23]. For instance Spencer et al. [24] evaluated the power of the univariate χ² test with various SNP chips on genome wide HAPGEN simulations. Perduca et al. [23] evaluated the power of univariate tests in PLINK on waffect simulations including more than 10⁵ SNPs. However, no study evaluating the performance of epistatic detection softwares with these approaches has been reported, even if all three approaches have the capacity to simulate epistatic interactions. Because GWASIMULATOR is not able to simulate epistasic interactions between SNPs on a same chromosome, and therefore in LD, we introduce a new simulation pipeline combining the approaches of GWASIMULATOR [22] and waffect [23].

The objective of the present work is to provide an evaluation of the performance of a set of representative epistasis detection methods in simulated GWAS with features close to a real GWAS, both in terms of size and LD pattern. Five methods are selected due to their ability to scale for exhaustive genome wide bivariate analysis with a GPu implementation, their performances in previous studies, their popularity, and their representation of different approaches (LD, regression, Haplotype, ROC curve). Fastepi [25] is implemented in the software PLINK (option –fast-epistasis) and used as a fast method to test for interactions. It is based on a 2×2 contingency table of allele counts and tests a SNP pair for epistasis by comparing their LD in cases and controls. SHEsisEpi [26] is another LD based method based on a 3×3 contingency table. Both methods use a χ² statistics with one degree of freedom and may therefore be more powerful than other approaches. IndOR [13] is also based on the correlation between two SNPs in cases and controls, inspired by a biological definition of epistasis, the effect of one gene masking the effect of another. GBOOST [27] compares two regression models with or without an interaction term using a likelihood ratio test to detect epistasis. GBOOST corresponds to Ficher’s definition of epistasis and has been used as benchmark method in several studies as discussed above. Finally DSS [14] is a model free approach based on the ROC curve to test the improvement of the discriminative power when using both SNPs together or independently. For each method the following performances are evaluated: power, false positive rate control, and computational performance. This article presents firstly a simulation pipeline for generating semi-simulated GWAS with realistic LD and epistasis, and then the performance comparison of the five methods. Finally the results of each method applied to a GWAS of Type 2 Diabetes (T2D) from the Welcome Trust Case Control Consortium [28] are presented.

Methods

Three-step GWAS simulation

We consider a set of template genotypes $\boldsymbol {X}_{i}\in \mathbb {R}^{p}$, i∈{1,..,n}, with $X_{ij_{c}}\in \{0,1,2\}$ representing for sample i the number of minor alleles at locus j_c of chromosome c∈{1,..,C}, with p_c the number of loci on chromosome c and $p={\sum \nolimits }_{c=1}^{C} p_{c}$ the total number of loci.

For step one a population of m individuals (m≫n) with genotype reproducing the LD structure of the template genotypes is simulated following the method of Li et al. [22]: for each simulated genotype k∈{1,..,m} and for each chromosome c, (i) a start locus d_c is selected uniformly at random, (ii) a (2l+1)-SNP haplotype [d_c−l,d_c+l]∈{0,1,2}^2l+1 is sampled uniformly from the template genotypes, (iii) the right part of the chromosome is generated by choosing the allele at locus d_c+i randomly among template genotypes corresponding to the simulated haplotype at loci [d_c−2l+i,d_c−1+i] for l<i≤p_c−d_c, (iv) similarly the left part of the chromosome is generated by choosing the alleles at loci d_c−i given the simulated haplotype at [d_c+1−i,d_c+2l−i] for l<i<d_c. In the following we use the European (EUR) samples from the 1000 Genomes Project phase 3 [29] as template genotypes and 129,238 SNP markers selected from the Affymetrix 500K chip to generate a population of 100,000 female samples. Markers with low MAF (< 0.05) or not in Equilibrium of Hardy Weinberg (p<10⁻³) in the template genotypes were excluded. This step allows the accurate replication of the LD structure of the EUR group for an accurate evaluation of the false positive rate in the present benchmark.

For step two we consider a set I of SNPs causal for a disease D, with a disease probability p_k=P(D|G_k) given a genotype G_k described with a logit model:

$$ \log\left(\frac{p_{k}}{1-p_{k}}\right) = \alpha + \underbrace{\sum\limits_{x\in I} \boldsymbol{\beta}_{x} \boldsymbol{1}_{x_{k}}}_{\textrm{main effect}} + \underbrace{\sum\limits_{(x,y)\in I^{2}} \boldsymbol{1}_{x_{k}}^{\mathsf{T}} \boldsymbol{\beta}_{x,y} \boldsymbol{1}_{y_{k}}}_{\textrm{$2^{nd}$ order epistasis}} $$

(1)

with α the intercept, $\boldsymbol {\beta }_{x}\in \mathbb {R}^{2}$ the main effect parameter vector for SNP x, $\boldsymbol {\beta }_{x,y} \in \mathbb {R}^{2 \times 2}$ the matrix of interaction coefficients between SNPs x and y, and $\boldsymbol {1}_{x_{k}} \in \{0,1\}^{2}$ the indicator vector for the value of SNP x in sample k. In the present work we restrict SNP interactions to the second order and to disease models with two causal SNPs (a,b). However the simulation procedure can be easily generalized to higher order interactions by adding the corresponding terms in Eq. 1. Each disease scenario is defined by the following parameters: (i) prevalence K=0.15, (ii) MAF f_a and f_b of each causal SNP, (iii) LD r² between SNP a and b, (iv) marginal risk ratios R_a=(r_a,r_a)^T and R_b=(r_b,r_b)^T (only dominant models were considered), and (v) epistasis model matrix ρ parametrized as departure from product relative risk, with a risk ratio given by $\boldsymbol {R}_{a}^{\mathsf {T}} \boldsymbol {\rho } \boldsymbol {R}_{b}$. The disease prevalence K=0.15 represents a high estimate for a complex disease such as diabetes (estimated K=0.12 according to the American Diabetes Association) or NAFLD (estimated K=0.13 [30]). The disease model is constructed as follows: (i) the SNP pair best satisfying the MAF and LD constraints is selected through an exhaustive genome-wide search, (ii) the disease model (Eq. 1) is solved numerically to satisfy the prevalence and risk ratio constraints. The 234 scenarios considered are summarized in Table 1.

Table 1 Two hundred thirty four disease scenarios considered in the simulations

Full size table

For step three n₁+n₀ cases and controls are selected at random without replacement from the m individuals with probability p_gwas=n₁p_k/(mK)+n₀(1−p_k)/(m−mK). Cases and controls are then affected using the backward sampling method waffect introduced by Perduca et al. [23]. This step is repeated for each disease scenario replicate. In the present work n₀=n₁=1000 and 500 replicates are generated per scenario. The number of replicates were chosen as a compromise between statistical accuracy and computation time, 2 months on a 30 Tesla K40 GPU cluster. The cohort size corresponds to a medium size GWAS and is sufficient to observe dominant-dominant epistasis effect with parameter ρ≈4 with a reasonable power. The relationship between the cohort size and the statistical power of each method for various scenarios is presented in the “Results” section.

Methods benchmark

The following bivariate methods are selected for comparison: SHEsisEpi [26], fastepi [25], IndOR [13], DSS [14], and GBOOST [27]. All analyses are performed on Nvidia Tesla K40 graphic cards. We use the GPU implementation provided by their respective authors for GBOOST and DSS, and a GPU implementation in the GWISFI platform [31] for the other methods. The 2χ² test [32] included in the GWISFI distribution is excluded from the analysis because the related score diverges for contingency tables with at least one empty cell. Numerical issues can occur as well for IndOR when inverting the covariance matrix V_Φ (see [13] for definition). This issue is treated by affecting a zero score to SNP pairs with non invertible V_Φ. The information gain [33] and epiblaster [34] methods implemented in the GWISFI distribution require computing empirical p-value and are not included in the benchmark.

For each replication of each scenario, all 5 methods are applied to detect the SNP pair in epistasis in the resulting GWAS. Each method outputs a list of SNP pairs ranked by their respective epistasis score which were converted to p-value using the asymptotic score distribution corresponding to each method. For a given p-value threshold p₀ the True Positive Rate (TPR) for a given method and scenario is defined at a SNP level as the probability that the causal pair (a,b) has an epistasis p-value p<p₀, and the False Positive Rate (FPR) as the mean number of non-causal pairs with epistasis p-value p<p₀ normalized by the number of SNP pairs. Grouping SNPs by LD blocks is a usual post-processing step in GWAS univariate analysis [35] and can be extended to the case of bivariate analysis. The LD blocks are defined a priori on the whole simulated population of size m using the plink options –blocks no-small-max-span –blocks-max-kb 500, which is based on the haplotype block definition of Haploview [36] suggested by Gabriel et al. [37]. SNPs that are not affected to a block via this procedure are affected to a single-SNP LD block. The whole set of SNPs map to 68819 LD blocks. The SNP pairs passing the epistasis p-value threshold are mapped to LD block pairs, with the causal LD block pair defined as the unique block pair (A,B) such that (a,b)∈(A,B) with (a,b) the causal SNP pair. TPR (resp. FPR) is defined at LD block level as the detection rate of the causal block pair (resp. the mean detection rate of the non causal block pairs).

The effect of the disease parameters on statistical power is evaluated by performing a canonical correlation analysis between (i) the power of the methods for each scenarios (normalized matrix P_ij for method i and scenarios j) and (ii) the value of the parameters for each scenarios (normalized matrix V_kj for parameter k and scenario j).

The effect of cohort size and MAF on statistical power is assessed in a separate set of simulations that consider only two SNPs. The disease penetrance P(D|G_k) for each 9 genotypes G_k∈{0,1,2}² is computed as in step two of the previous three step simulation (Eq. 1). The genotypes of the n₀ controls and n₁ cases are simulated using Bayes formula P(G_k|D)=P(D|G_k)P(G_k)/K, with the genotype frequency P(G_k) controlled by f_a=f_b=f and r². For the simulations with varying cohort size we consider f=0.3, r²∈{0,0.2}, dominant-dominant epistasis models (model M₁ in Table 1) with main effect (r_a,r_b)∈{(1,1),(1.5,1),(1.5,1.5)}, and GWAS with n₀=n₁=n, n∈{500,1000,2500,5000,10000}. For the simulation with varying MAF we consider n=1000. For a given scenario and epistasis parameter ρ we compute the statistical power in 1000 replicates with a p-value threshold p=0.05/(1.25×10¹⁰), corresponding to a threshold p=0.05 with a Bonferroni correction for a bivariate analysis of a GWAS with 5×10⁵ SNPs. For each scenario and method the epistasis parameter ρ giving a power 0.8 is identified using Brent’s algorithm [38]. For this set of simulations GBOOST and DSS were reimplemented in Python using the information provided by the authors in [16, 39] respectively for computational performances reasons.

Bivariate analysis of the WTCCC T2D

We applied each method on the WTCCC GWAS data on T2D [28], genotyped on the Affymetrix 5.0 platform. The standard Quality Control (QC) procedure suggested by the WTCCC is applied to the dataset except that SNPs with missing data are excluded because the bivariate methods implemented in the GWIS platform do not handle missing data. 363387 SNPs, 1953 cases and 2978 controls remain after QC. For each method interacting SNPs with FDR<0.05 are selected. Univariate association was performed using plink with the –fisher option. LD blocks are defined as in the simulations using the plink command –blocks no-small-max-span –blocks-max-kb 500 and SNPs are mapped to LD blocks in a n-1 relation. Each block is associated to a gene if the block region overlap with the gene region augmented by a 10kb margin. Consistently with the simulations two genes (A,B) are defined in epistasis if it exists two snps (a,b) detected in significant epistasis such that a∈A and b∈B).

Results

Type 1 error rate

The score distributions of each method in several M₀ simulations are represented in Fig. 1. In the absence of marginal effect the tail of the score distribution of all five methods is well described by their asymptotic distribution, even in presence of a realistic LD structure. We note that for high scores (> 50) the FPR of IndOR is underestimated by its asymptotic distribution. In the presence of a main effect on two SNPs in LD an important inflation of the far tail distribution is observed for IndOR and fastepi. The FPR for M₁ simulations with no main effect, i.e. the ratio of non-causal pairs detected, is depicted in Fig. 2 for a p-value threshold of 0.05 after Bonferroni correction and present a similar behaviour: a good, slightly conservative, control of the FPR is observed for DSS, GBOOST and SHEsisEpi, while a two- to five-folds FPR increase is observed for IndOR and fastepi in presence of two SNPs in LD and in epistasis. Similar conclusions can be made in the M₂, M₃ and M₄ models (Additional file 1: Figures). These findings suggest that grouping SNPs by LD blocks could improve the FPR control of IndOR and fastepi. The FPR at a SNP and LD block level are compared in Fig. 3 in the absence of epistasis and with main effect. In presence of SNPs in LD the FPR of IndOR and fastepi is only improved by one-fold at block level compared to the SNP level.

These observations indicate that analysis using IndOR and fastepi should focus on interactions between SNPs that are not in LD to ensure a good control of the false positive rate (under the definition of epistasis we assume in this work). Moreover caution should be taken when interpreting IndOR p-values for high scores. As mentioned in the “Methods” section, computing the IndOR score requires an inversion of the estimation of a variance-covariance matrix V_Φ (see [13] for definitions), which can be singular with a non zero probability for finite cohort sizes. In the authors R implementation and in the present work, this issue is treated by affecting zero to SNP pairs with non invertible V_Φ. However, given the number of tests performed in exhaustive bivariate analysis of GWAS, such a situation occurs at an important rate among SNPs with high scores and a deviation from the asymptotic $\chi ^{2}_{4}$ distribution in the far tail can be expected. This situation highlights the importance of evaluating the performance of statistical methods on GWAS simulations of real size to identify finite size effects affecting method performances.

Statistical power

The statistical power of each method is evaluated with a Bonferroni corrected p-value threshold of 0.05 as in the previous section. The relative power of each method is reported in Fig. 4: fastepi is the most powerful method in 61 scenarios, IndOR in 45, DSS in 38, GBOOST in 5, and SHEsisEpi in 5. Among methods with a good FDR control DSS is the most powerful in most scenarios (46 vs 41 for SHEsisEpi). However fastepi was the only method with a non null measured power for scenarios with strong LD r²=0.5. Interestingly DSS is the most powerful method in almost all scenarios with no LD. GBOOST was the most powerful method in 5 scenarios with no LD, while none of the remaining method were the most powerful in any of the scenarios. The influence of the disease parameters on the methods power is illustrated through a canonical correlation analysis in Fig. 5, where we focus on the two first components. Component 1 is anti-correlated to r², whereas component 2 is positively correlated to f and ρ. β_a and β_b are not correlated to components 1 or 2. The power of DSS and fastepi are similarly correlated to f and ρ (component 1) but the power of DSS is anticorrelated to r², whereas the power of fastepi is positively correlated to r² (as seen on Fig. 4). Conversely, the power of GBOOST and SHEsisEpi is not correlated to component 1 or component 2. The variations of power of GBOOST appear very similar to SHEsisEpi and the opposite to fastepi. This analysis also suggests as well that the variation of power of IndOR and DSS on one hand, and GBOOST, SHEsisEpi on the other are only weakly correlated. This observation is consistent across all models M₁- M₄ considered here (the separated canonical correlation analysis for each model are reported in Additional file 1). LD between the causal SNPs was the parameter most correlated to power variations, while the SNPs main effect coefficients r_a, r_b were only weakly correlated. fastepi is the only method that shows a clear increase of power with LD. Figure 6 depicts the difference of power at SNP and LD block level for each method and scenario (one point per method-scenario). Analyses located on the first diagonal provided equal power at both the SNP and block level, while those in the top left have a gain in power when performing the analysis at SNP level. With no LD (r²=0) the difference of power at SNP and LD block level is limited for all methods. For intermediate LD (r²=0.2) a strong increase of power at LD block level was measured for all methods in almost all scenarios. For strong LD (r²=0.5) the estimated power of DSS, GBOOST and IndOR was null for all scenarios at SNP and LD block level, the estimated power of SHEsisEpi is null for all scenarios at SNP level and increases up to 0.86 at LD block level, whilst fastepi presented similar power at both level.

For studies focusing on interaction between SNPs that are not in LD our results indicate that DSS will be the most powerful method, while maintaining good control of the FPR. GBOOST appears as to be a good complementary solution to DSS.

Impact of cohort size

Table 2 summarizes for cohort sizes ranging from n=n₀=n₁=500 to n=10,000 the smallest epistasis effects ρ_0.8 detectable at a power of 0.8 for each method, for dominant-dominant epistasis. For n=500 no method can detect epistasis effect ρ<5 between common variants (f>0.3), while for n=5000 IndOR, DSS, GBOOST and SHEsisEpi can detect epistasis effect ρ<2. DSS is the most powerful method in all simulations, except for small cohort sizes (n≤1000), small MAF (f=0.15), and one SNP with no main effect (r_a=1), were GBOOST is the most powerful. Results for all models and for LD values in {0.,0.2,0.5} are reported in the Additional file 1.

Table 2 Smallest epistasis effect ρ detectable with a power 0.8 for each method for various cohort size

Full size table

Impact of MAF

Figure 7 depicts the influence of MAF on the smallest epistasis effect ρ_0.8 detectable with a power of 0.8. Results are given for each method, and for the 4 disease models M₁ to M₄. DSS is the most powerful method in all scenarios considered, except for f<0.2 in the dominant-dominant (M₁) model and for f<0.3 in the multiplicative (M₃) model, where GBOOST is slightly more powerful. The good performance of GBOOST in the multiplicative model can be explained by the simulated model corresponding to the full logistic regression model used in the likelihood ratio test underlying GBOOST. For models M₁ and M3 all methods except DSS reach maximum power between f=0.15 and f=0.35 with a significant decrease in power in the vicinity of f=0.5. In M₁ models epistasis effect ρ<5 can be detected by DSS for f>0.125, by GBOOST for 0.125<f<0.3, by SHEsisEpi for 0.175<f<0.25, and by IndOR for 0.3<f<0.4. Fastepi is not able to detect epistasis effects ρ<10 and reachs a maximum power at f=0.175.

AUC comparison.

To have a global view of how the methods perform we compute the Receiver Operating Characteristic (ROC) curves, defined by the true positive detection rate of the causal pair of SNPs vs the false positive rate for various p-value thresholds. The area under the ROC curve (AUC) is represented for each method and each scenario in Fig. 8. This approach directly evaluates the capacity of each method to separate causal SNPs pairs, independent of the method used to compute the p-value. As found previously DSS, presents the highest AUC for most scenarios with no LD. fastepi is the only method to efficiently identify causal SNP pairs with high LD values (r²=0.5). IndOR has the lowest AUC in all scenarios. Similar conclusions are found at a block level.

Computational performance

Fastepi and SHEsisEpi are the two fastest methods (31s and 33s respectively), DSS has a intermediate speed (148s), and IndOR and GBOOST are the slowest methods (270s and 248s respectively). Table 3 reports measured and extrapolated computation time for 4 common SNP chips with up to 2.5M SNPs (Illumina Omni2.5-8) [40] and for cohort sizes up to 5 million (today GIANT is the largest GWAS identified and includes more than 300,000 patients [41]). As computation can be easily parallelized we can extrapolate that a giant GWAS with 5 million samples sequenced with an Omni2.5-8 Chip can be exhaustively analyzed in less than 20 days with DSS using a 30 Tesla K40 GPU cluster.

Table 3 Measured and extrapolated computation time of each method for various commercially available SNP chips and total number of samples in the GWAS

Full size table

Application to WTCCC T2D

Table 4 report the number of SNP pairs with FDR<0.05 for each method and the consensus between each two methods (off diagonal elements). IndOR detected 446 pairs in interactions but as observed above one has to consider that the current implementation might not allow for an accurate control of the false positive. GBOOST detected 230 pairs, DSS 119, fastepi 9, and SHEsisEpi none. The lowest number of SNPs detected by fastepi is coherent with the lowest power of the method observed in the simulations. While there is a relative large overlap between the pairs detected by GBOOST and IndOR, DSS detected a completely distinct set of pairs from the other methods.

Table 4 Number of SNP pairs with FDR<0.05 for each method (diagonal) and overlap between the pairs detected between two methods (off diagonal)

Full size table

Table 5 represents the overlap between the SNPs detected by epistasis tests and those detected by univariate association tests. The majority of SNPs detected in epistasis are not detected by univariate test (388 pairs over 654 have no detected main effect). IndOR is the only method to detect epistasis between SNPs having both main effects. We notice that in this situation our simulations indicate that the type 1 error rate might be incorrectly controlled by IndOR. None of the SNPs detected by DSS are detected by univariate test. Finally the majority of SNP pairs detected by gboost presents a main effect on one of the two SNPs.

Table 5 For each method number of SNP pairs (a,b) detected in interaction (FDR<0.05) such that (first line) both a and b are identified by univariate association test (FDR<0.05), (second line) one of a or b is identified, and (last line) neither a or b is identified

Full size table

These observations confirm that GBOOST and DSS detects different types of epistasis and could be combined for a potential increase in power.

Figure 9 presents the distribution of MAF for the SNPs detected by each method. For DSS the distribution of MAF is uniform between 0.1 and 0.5 which is coherent with the results of the simulations. The pairs detected by gboost and IndOR are present a larger proportion of common SNPs with MAF greater than 0.3. This observation is coherent with the results of the simulations in non-dominant models (M₂- M₄). Figure 10 depicts the distribution of LD of the SNP pairs detected by each method. Most SNP pairs are in linkage equilibrium of in weak LD (r²=0.05 to 0.01). GBOOST is the only method to detect a larger proportion of SNP pairs in weak LD.

The type of genomic regions including SNPs with interactions are reported in Table 6. IndOR is the only method to detect more interaction between protein coding regions. This observation might indicates that IndOR tends to detect interactions corresponding to a biological definition of epistasis as claimed by its authors [13]. GBOOST and DSS detect mostly epistasis between SNPs in intergenic regions. Among the 211 genes detected by the three methods DSS, GBOOST and IndOR only 2 are detected by univariate tests (Fig. 11).The Genes detected by DSS are distinct from those detected by other methods (10% of overlap only), which is consistent with the observation at SNP level. Only two genes are detected by univariate association tests. Interestingly each method detects distinct gene-gene interactions (see the epistasis networks in the Additional file 1).

Table 6 Number of SNP pair (a,b) identified on each type of locus pair for each method

Full size table

Discussion

Ten years after the first large scale GWAS from the Welcome Trust Case Control Consortium (WTCCC) [28] many results are available about the influence of single variants on various phenotypes. More than 35000 unique SNP-trait associations with p-values < 10⁻⁸ are reported in GWAS catalog [42]. Meta-analysis and replication studies [43] have identified and validated many susceptibility loci, for instance in type 2 diabetes [44], Alzheimer’s disease [45], cardiometabolic diseases [46], and ovarian cancer [47]. A large panel of tools are available to interpret the association at a biological level, these include fine mapping of GWAS signal, genome annotation, SNP prioritization tools, and gene set analysis tools such as DEPICT [41, 48]. Conversely results regarding genetic interactions, expected to explain an important proportion of the heritability of complex diseases [7], are far less established. In particular, accurate control of the power and type 1 error rate of the statistical tests of interaction in real GWAS has still to be improved. Indeed, recent studies on the WTCCC datasets have identified that significant differences exist in terms of stability and results overlap between epistasis detection methods [14, 49].

The objective of this work was to evaluate the performances of epistasis detection methods in quasi real GWAS conditions, where true causal interactions are known. We introduced a new GWAS simulation pipeline that combines the advantages of previous simulations approaches: generation of a large set of genetic samples with realistic LD [22], and simulation of a realistic GWAS cohort that can take into account disease models with epistasis between SNPs in LD using backward sampling [23]. We evaluated five epistasis detection methods using this pipeline: SHEsisEpi [26], fastepi [25], IndOR [13], DSS [14], and GBOOST [27]. In total 234 different disease models were considered when generating GWAS, with 1000 cases and controls and 129,238 SNPs reproducing the LD structure of the EUR group of the 1000 Genomes Project [29].

Our results indicate that using the asymptotic χ² distribution to compute the p-value of IndOR and fastepi does not enable an accurate control of the false positive rate in presence of causal SNPs in LD. With respect to IndOR the first reason for the apparent inaccurate control comes from the definition of epistasis. IndOR relies on a definition of epistasis based on Odds ratio independence [13] different from the statistical definition assumed in this paper. Only biological investigation can validate whether more true biological interactions are detected using one definition or the other. The second reason is numerical. Fixing an arbitrary score to SNP pairs with a singular V_ϕ matrix is expected to modify the score distribution. For GBOOST, DSS and SHEsisEpi the p-value computed from the asymptotic χ² distribution allows an acceptable control of the false positive rate, with Bonferroni found to be slightly conservative, due to the correlation between SNPs. When implementing epistasis detection methods it should be noted that contingency tables with empty cells occurs at an important rate: assuming independent SNPs with uniform MAF distribution between 0.05 and 0.5 the probability of having a bivariate contingency table with no double recessive genotype in a GWAS of 1000 samples is $\int _{0.05}^{0.5}\left (1-f^{4}\right)^{1000}/0.45 = 0.25$. Using a Axiom GW EU chip more than 10¹⁰ such contingency tables would be expected, for which the χ² approximation can be invalid, and numerical issues can appear as in the case of the 2χ² test [32] implemented in the GWISFI distribution.

DSS is the most powerful method in most scenarios with no LD between the causal SNPs, with GBOOST being a good complementary solution to DSS with close performances in terms of power. In the presence of LD between the causal SNPs, IndOR is the most powerful method in most scenarios. However, given the more important false positive rate of this method the false discovery rate might be higher. Given that the power of DSS and GBOOST are influenced differently by the disease model parameter, a combination approach combining both scores might improve the overall power. This strategy is to be supported by the results on the WTCCC data on T2D were DSS and GBOOST detected distinct SNPs and genes in epistasis.

DSS also has the largest AUC for almost all scenarios, which confirms the good performance of the method. IndOR had the lowest AUC in all scenarios, which can be explained in part, by the different epistasis definition underlying the method.

Similar results are obtained when grouping SNPs by LD blocks, with only a limited increase of power for SNP pairs that are not in LD, and an important increase of power for SNP pairs in LD.

We show that strong dominant-dominant (M₁) and multiplicative (M₃) epistasis effects expected to lie in the range 2<ρ<5 can be detected with sufficient power by all methods except fastepi in relatively small cohort size of 2000 cases and controls. However we show that such interaction will only be detected in a narrow MAF window, except for DSS which is the only method able to detect interacting SNPs with MAF ranging from 0.1 to close to 0.5. Detecting weak epistasis effects ρ<2 will require GWAS with more than 10,000 cases and controls, which may become available in the coming years.

We conclude that computation time is no longer a limiting issue to perform exhaustive bivariate interaction analysis of a large GWAS. Our results suggest that SNPs with no or weak LD (r²<0.2) should be analyzed with DSS complemented with GBOOST in order to achieve the best statistical performances. The analysis of SNPs in LD is more complicated, because fastepi is the best method identified in terms of AUC but does not allow for a good control of the FPR and also because most SNPs in LD are located in the a same functional DNA region. The biological interpretation of intra-region interaction can be more complicated than for intergenic interactions. A possible strategy could be, therefore, to focus only on SNP pairs with no LD which would give valuable information on the biology underlying the disease. If the missing heritability in GWAS can be explained by epistasis between common variants, the existing methods will be powerful enough to detect them in GWAS with as few as tens of thousands of cases plus controls, as is likely to be available in the coming years.

In terms of disease understanding and new therapeutic strategies, the most notable impact of these approaches might not be the additional heritability identified, but the insight into disease network biology. While univariate GWAS analysis can identify isolated candidate genes, epistasis methods additionally identify interaction networks between risk genes. Complex diseases such as Diabetes or NASH results from the dysregulation of complex genetic and metabolic networks. For at least a decade it has been argued that disease complexity and redundancy in biological pathways might be responsible for the limited effect of many monotherapies [50]. The recent success in cancer, HIV [51], and cardiology [52] multitarget therapies appears to confirm this idea, creating a high interest in multitarget discovery in other therapeutic area, which might be supported by epistasis analysis of GWAS.

Conclusion

This study confirms that computation time is no longer a limiting factor for performing an exhaustive search of epistasis in large GWAS. For this task, using DSS on SNP pairs with limited LD seems to be a good strategy to achieve the best statistical performance. A combination approach using both DSS and GBOOST is supported by the simulation results and the analysis of the WTCCC dataset demonstrated that this approach can detect distinct genes in epistasis. Finally, weak epistasis between common variants will be detectable with existing methods when GWAS of a few tens of thousands cases and controls are available. The major remaining challenges are (i) the lack of unique and well defined epistasis definition at a biological level, and (ii) the relatively limited amount of tools for translating statistical interactions at a SNP level into biological mechanisms. Several tools have recently emerged to fill this last gap. For instance Emily [18] and Stanislas et al. [19] have developed two methods for epistasis analysis at a gene level, Ma et al. [20], an epistasis analysis at a gene level for continuous phenotypes, and Lin et al. [53], introduced an epistasis test in meta-analysis. The present work could therefore be expanded to evaluate the performance of these new tools in a realistic framework. Other influencing factor should also be evaluated such as chip coverage and imputation, or the effect of population structure.

References

Visscher PM, Brown MA, McCarthy MI, Yang J. Five years of GWAS discovery. Am J Hum Genet. 2012; 90(1):7–24. https://doi.org/10.1016/j.ajhg.2011.11.029.
Article PubMed PubMed Central CAS Google Scholar
Manolio TA, Collins FS, Cox NJ, Goldstein DB, Hindorff LA, Hunter DJ, McCarthy MI, Ramos EM, Cardon LR, Chakravarti A, Cho JH, Guttmacher AE, Kong A, Kruglyak L, Mardis E, Rotimi CN, Slatkin M, Valle D, Whittemore AS, Boehnke M, Clark AG, Eichler EE, Gibson G, Haines JL, Mackay TFC, McCarroll SA, Visscher PM. Finding the missing heritability of complex diseases. Nature. 2009; 461(7265):747–53. https://doi.org/10.1038/nature08494.
Article PubMed PubMed Central CAS Google Scholar
Maher B. Personal genomes: The case of the missing heritability. Nature. 2008; 456(7218):18–21. https://doi.org/10.1038/456018a.
Article PubMed CAS Google Scholar
Polderman TJC, Benyamin B, de Leeuw CA, Sullivan PF, van Bochoven A, Visscher PM, Posthuma D. Meta-analysis of the heritability of human traits based on fifty years of twin studies. Nat Genet. 2015; 47(7):702–9. https://doi.org/10.1038/ng.3285.
Article PubMed CAS Google Scholar
de los Campos G, Sorensen D, Gianola D. Genomic Heritability: What Is It?PLoS Genet. 2015; 11(5):1–21. https://doi.org/10.1371/journal.pgen.1005048.
Article CAS Google Scholar
Phillips PC. Epistasis [mdash] the essential role of gene interactions in the structure and evolution of genetic systems. Nat Rev Genet. 2008; 9(11):855–67. https://doi.org/10.1038/nrg2452.
Article PubMed PubMed Central CAS Google Scholar
Zuk O, Hechter E, Sunyaev SR, Lander ES. The mystery of missing heritability: Genetic interactions create phantom heritability. Proc Natl Acad Sci U S A. 2012; 109(4):1193–8. https://doi.org/10.1073/pnas.1119675109.
Article PubMed PubMed Central Google Scholar
Wei W-H, Hemani G, Haley CS. Detecting epistasis in human complex traits. Nat Rev Genet. 2014; 15(11):722–33. https://doi.org/10.1038/nrg3747.
Article PubMed CAS Google Scholar
Niel C, Sinoquet C, Dina C, Rocheleau G. A survey about methods dedicated to epistasis detection. Front Genet. 2015;6(SEP). https://doi.org/10.3389/fgene.2015.00285.
Cordell HJ. Epistasis: what it means, what it doesn’t mean, and statistical methods to detect it in humans,. Hum Mol Genet. 2002; 11(20):2463–8. https://doi.org/10.1093/hmg/11.20.2463.
Article PubMed CAS Google Scholar
Wang Y, Liu G, Feng M, Wong L. An empirical comparison of several recent epistatic interaction detection methods. Bioinformatics. 2011; 27(21):2936–43. https://doi.org/10.1093/bioinformatics/btr512.
Article PubMed CAS Google Scholar
Frost HR, Amos CI, Moore JH. A global test for gene-gene interactions based on random matrix theory. Genet Epidemiol. 2016; 40(8):689–701. https://doi.org/10.1002/gepi.21990.
Article PubMed PubMed Central Google Scholar
Emily M. IndOR: A new statistical procedure to test for SNP-SNP epistasis in genome-wide association studies. Stat Med. 2012; 31(21):2359–73. https://doi.org/10.1002/sim.5364.
Article PubMed CAS Google Scholar
Goudey B, Rawlinson D, Wang Q, Shi F, Ferra H, Campbell RM, Stern L, Inouye MT, Ong CS, Kowalczyk A. GWIS - model-free, fast and exhaustive search for epistatic interactions in case-control GWAS. BMC Genomics. 2013; 14(Suppl 3):10. https://doi.org/10.1186/1471-2164-14-S3-S10.
Article Google Scholar
Yu Z, Demetriou M, Gillen DL. Genome-Wide Analysis of Gene-Gene and Gene-Environment Interactions Using Closed-Form Wald Tests Genetic Epidemiology. Genet Epidemiol. 2015; 0:1–10. https://doi.org/10.1002/gepi.21907.
Google Scholar
Wan X, Yang C, Yang Q, Xue H, Fan X, Tang NLS, Yu W. BOOST: A fast approach to detecting gene-gene interactions in genome-wide case-control studies. Am J Hum Genet. 2010; 87(3):325–40. https://doi.org/10.1016/j.ajhg.2010.07.021.
Article PubMed PubMed Central CAS Google Scholar
Dudek SM, Motsinger AA, Velez DR, Williams SM, Ritchie MD. Data simulation software for whole-genome association and other studies in human genetics. Pac Symp Biocomput. 2006; 510:499–510.
Google Scholar
Emily M. AGGrEGATOr: A Gene-based GEne-Gene interActTiOn test for case-control association studies. Stat Appl Genet Mol Biol. 2016; 15(2):151–71. https://doi.org/10.1515/sagmb-2015-0074.
Article PubMed CAS Google Scholar
Stanislas V, Dalmasso C, Ambroise C. Eigen-Epistasis for detecting Gene-Gene interactions. BMC Bioinformatics. 2017; 18:54. https://doi.org/10.1186/s12859-017-1488-0.
Article PubMed PubMed Central CAS Google Scholar
Ma L, Clark AG, Keinan A. Gene-Based Testing of Interactions in Association Studies of Quantitative Traits. PLoS Genetics. 2013; 9(2):1–12. https://doi.org/10.1371/journal.pgen.1003321.
Article CAS Google Scholar
Su Z, Marchini J, Donnelly P. HAPGEN2: Simulation of multiple disease SNPs. Bioinformatics. 2011; 27(16):2304–5. https://doi.org/10.1093/bioinformatics/btr341.
Article PubMed PubMed Central CAS Google Scholar
Li C, Li M. GWAsimulator: A rapid whole-genome simulation program. Bioinformatics. 2008; 24(1):140–2. https://doi.org/10.1093/bioinformatics/btm549.
Article PubMed CAS Google Scholar
Perduca V, Sinoquet C, Mourad R, Nuel G. Alternative methods for H1 simulations in genome-wide association studies. Hum Hered. 2012; 73(2):95–104. https://doi.org/10.1159/000336194.
Article PubMed CAS Google Scholar
Spencer CCA, Su Z, Donnelly P, Marchini J. Designing genome-wide association studies: Sample size, power, imputation, and the choice of genotyping chip. PLoS Genetics. 2009;5(5). https://doi.org/10.1371/journal.pgen.1000477.
Schüpbach T, Xenarios I, Bergmann S, Kapur K. FastEpistasis: A high performance computing solution for quantitative trait epistasis. Bioinformatics. 2010; 26(11):1468–9. https://doi.org/10.1093/bioinformatics/btq147.
Article PubMed PubMed Central CAS Google Scholar
Hu X, Liu Q, Zhang Z, Li Z, Wang S, He L, Shi Y. SHEsisEpi, a GPU-enhanced genome-wide SNP-SNP interaction scanning algorithm, efficiently reveals the risk genetic epistasis in bipolar disorder. Cell Res. 2010; 20(7):854–7. https://doi.org/10.1038/cr.2010.68.
Article PubMed Google Scholar
Yung LS, Yang C, Wan X, Yu W. GBOOST: A GPU-based tool for detecting gene-gene interactions in genome-wide case control studies. Bioinformatics. 2011; 27(9):1309–10. https://doi.org/10.1093/bioinformatics/btr114.
Article PubMed PubMed Central CAS Google Scholar
Wellcome T, Case T, Consortium C. Genome-wide association study of 14,000 cases of seven common diseases and 3,000 shared controls,. Nature. 2007; 447(7145):661–78. https://doi.org/10.1038/nature05911.
Article CAS Google Scholar
Auton A, Abecasis GR, Altshuler DM, Durbin RM, Abecasis GR, Bentley DR, Chakravarti A, Clark AG, Donnelly P, Eichler EE, Flicek P, Gabriel SB, Gibbs RA, Green ED, Hurles ME, Knoppers BM, Korbel JO, Lander ES, Lee C, Lehrach H, Mardis ER, Marth GT, McVean GA, Nickerson DA, Schmidt JP, Sherry ST, Wang J, Wilson RK, Gibbs RA, Boerwinkle E, Doddapaneni H, Han Y, Korchina V, Kovar C, Lee S, Muzny D, Reid JG, Zhu Y, Wang J, Chang Y, Feng Q, Fang X, Guo X, Jian M, Jiang H, Jin X, Lan T, Li G, Li J, Li Y, Liu S, Liu X, Lu Y, Ma X, Tang M, Wang B, Wang G, Wu H, Wu R, Xu X, Yin Y, Zhang D, Zhang W, Zhao J, Zhao M, Zheng X, Lander ES, Altshuler DM, Gabriel SB, Gupta N, Gharani N, Toji LH, Gerry NP, Resch AM, Flicek P, Barker J, Clarke L, Gil L, Hunt SE, Kelman G, Kulesha E, Leinonen R, McLaren WM, Radhakrishnan R, Roa A, Smirnov D, Smith RE, Streeter I, Thormann A, Toneva I, Vaughan B, Zheng-Bradley X, Bentley DR, Grocock R, Humphray S, James T, Kingsbury Z, Lehrach H, Sudbrak R, Albrecht MW, Amstislavskiy VS, Borodina TA, Lienhard M, Mertes F, Sultan M, Timmermann B, Yaspo M-L, Mardis ER, Wilson RK, Fulton L, Fulton R, Sherry ST, Ananiev V, Belaia Z, Beloslyudtsev D, Bouk N, Chen C, Church D, Cohen R, Cook C, Garner J, Hefferon T, Kimelman M, Liu C, Lopez J, Meric P, O’Sullivan C, Ostapchuk Y, Phan L, Ponomarov S, Schneider V, Shekhtman E, Sirotkin K, Slotta D, Zhang H, McVean GA, Durbin RM, Balasubramaniam S, Burton J, Danecek P, Keane TM, Kolb-Kokocinski A, McCarthy S, Stalker J, Quail M, Schmidt JP, Davies CJ, Gollub J, Webster T, Wong B, Zhan Y, Auton A, Campbell CL, Kong Y, Marcketta A, Gibbs RA, Yu F, Antunes L, Bainbridge M, Muzny D, Sabo A, Huang Z, Wang J, Coin LJM, Fang L, Guo X, Jin X, Li G, Li Q, Li Y, Li Z, Lin H, Liu B, Luo R, Shao H, Xie Y, Ye C, Yu C, Zhang F, Zheng H, Zhu H, Alkan C, Dal E, Kahveci F, Marth GT, Garrison EP, Kural D, Lee W-P, Fung Leong W, Stromberg M, Ward AN, Wu J, Zhang M, Daly MJ, DePristo MA, Handsaker RE, Altshuler DM, Banks E, Bhatia G, del Angel G, Gabriel SB, Genovese G, Gupta N, Li H, Kashin S, Lander ES, McCarroll SA, Nemesh JC, Poplin RE, Yoon SC, Lihm J, Makarov V, Clark AG, Gottipati S, Keinan A, Rodriguez-Flores JL, Korbel JO, Rausch T, Fritz MH, Stütz AM, Flicek P, Beal K, Clarke L, Datta A, Herrero J, McLaren WM, Ritchie GRS, Smith RE, Zerbino D, Zheng-Bradley X, Sabeti PC, Shlyakhter I, Schaffner SF, Vitti J, Cooper DN, Ball EV, Stenson PD, Bentley DR, Barnes B, Bauer M, Keira Cheetham R, Cox A, Eberle M, Humphray S, Kahn S, Murray L, Peden J, Shaw R, Kenny EE, Batzer MA, Konkel M. A global reference for human genetic variation. Nature. 2015; 526(7571):68–74. https://doi.org/10.1038/nature15393.
Article PubMed CAS Google Scholar
Younossi ZM, Koenig AB, Abdelatif D, Fazel Y, Henry L, Wymer M. Global epidemiology of nonalcoholic fatty liver disease—Meta-analytic assessment of prevalence, incidence, and outcomes. Hepatology. 2016; 64(1):73–84. https://doi.org/10.1002/hep.28431.
Article PubMed Google Scholar
Wang Q, Shi F, Kowalczyk A, Campbell RM, Goudey B, Rawlinson D, Harwood A, Ferra H, Kowalczyk A. GWISFI: A universal GPU interface for exhaustive search of pairwise interactions in case-control GWAS in minutes. In: Proceedings - 2014 IEEE International Conference on Bioinformatics and Biomedicine, IEEE BIBM 2014: 2014. p. 403–409. https://doi.org/10.1109/BIBM.2014.6999192.
Canela-Xandri O, Julià A, Gelpí JL, Marsal S. Unveiling Case-Control Relationships in Designing a Simple and Powerful Method for Detecting Gene-Gene Interactions. Genet Epidemiol. 2012; 36(7):710–6. https://doi.org/10.1002/gepi.21665.
Article PubMed Google Scholar
Hu T, Chen Y, Kiralis JW, Collins RL, Wejse C, Sirugo G, Williams SM, Moore JH, Hardy J, Singleton A, Hirschhorn J, Daly M, Wang W, Barratt B, Clayton D, Shendure J, Ji H, Clark A, Boerwinkle E, Hixson J, Moore J, Asselbergs F, Williams S, Carlborg O, Haley C, Cordell H, Cordell H, Moore J, Moore J, Williams S, Moore J, Williams S, Upstill-Goddard R, Eccles D, Fliege J, Anastassiou D, Chanda P, Sucheston L, Zhang A, Chanda P, Sucheston L, Zhang A, Fan R, Zhong M, Wang S, Hu T, Sinnott-Armstrong N, Kiralis J, McKinney B, Crowe J, Guo J, Moore J, Barney N, Tsai C-T, Moore J, Gilbert J, Tsai C-T, Cover T, Thomas J, Chechik G, Globerson A, Tishby N, Varadan V, Miller DI, Anastassiou D, Olesen R, Wejse C, Velez D, Urbanowicz R, Kiralis J, Fisher J, Urbanowicz R, Kiralis J, Sinnott-Armstrong N, Bjerregaard-Andersen M, Silva Zd, Ravn P, Shannon P, Markiel A, Ozier O, West D, Greene C, Himmelstein D, Nelson H, Varadan V, Anastassiou D, Barreiro L, Neyrolles O, Babb C, Tailleux L, Pham-Thi N, Bergeron-Lafaurie A, Mantovani A, Garlanda C, Doni A, Azzurri A, Sow O, Amedei A, Doherty T, Arditi M, Schroder N, Schumann R, Liu P, Stenger S, Li H, Bornman L, Campbell S, Fielding K, Gringhuis S, Dunnen Jd, Litjens M. An information-gain approach to detecting three-way epistatic interactions in genetic association studies. J Am Med Inform Assoc. 2013; 20(4):630–6. https://doi.org/10.1136/amiajnl-2012-001525.
Article PubMed PubMed Central Google Scholar
Kam-Thong T, Czamara D, Tsuda K, Borgwardt K, Lewis CM, Erhardt-Lehmann A, Hemmer B, Rieckmann P, Daake M, Weber F, Wolf C, Ziegler A, Pütz B, Holsboer F, Schölkopf B, Müller-Myhsok B. EPIBLASTER-fast exhaustive two-locus epistasis detection strategy using graphical processing units. Eur J Hum Genet. 2011; 19(4):465–71. https://doi.org/10.1038/ejhg.2010.196.
Article PubMed CAS Google Scholar
Hiersche M, Rühle F, Stoll M. Postgwas: advanced GWAS interpretation in R. PLoS ONE. 2013; 8(8):71775. https://doi.org/10.1371/journal.pone.0071775.
Article CAS Google Scholar
Barrett JC, Fry B, Maller J, Daly MJ. Haploview: Analysis and visualization of LD and haplotype maps. Bioinformatics. 2005; 21(2):263–5. https://doi.org/10.1093/bioinformatics/bth457.
Article PubMed CAS Google Scholar
Gabriel SB, Schaffner SF, Nguyen H, Moore JM, Roy J, Blumenstiel B, Higgins J, DeFelice M, Lochner A, Faggart M, Liu-Cordero SN, Rotimi C, Adeyemo A, Cooper R, Ward R, Lander ES, Daly MJ, Altshuler D. The structure of haplotype blocks in the human genome. Sci (New York). 2002; 296(5576):2225–9. https://doi.org/10.1126/science.1069424.
Article CAS Google Scholar
Press WH, Teukolsky SA, Vetterling WT, Flannery BP. Numerical Recipes: The Art of Scientific Computing. In: Numerical Recipes: The Art of Scientific, New york: edn. New York: Cambridge University Press: 2007. p. 454. Chap. Section 9.
Google Scholar
William B. Detection of Epistasis in Genome-Wide Association Studies. PhD thesis. 2016.
Ha N-T, Freytag S, Bickeboeller H. Coverage and efficiency in current SNP chips. Eur J Hum Genet. 2014; 22(9):1124–30. https://doi.org/10.1038/ejhg.2013.304.
Article PubMed PubMed Central CAS Google Scholar
Locke AE, Kahali B, Berndt SI, Justice AE, Pers TH, Day FR, Powell C, Vedantam S, Buchkovich ML, Yang J, Croteau-Chonka DC, Esko T, Fall T, Ferreira T, Gustafsson S, Kutalik Z, Luan J, Magi R, Randall JC, Winkler TW, Wood AR, Workalemahu T, Faul JD, Smith JA, Hua Zhao J, Zhao W, Chen J, Fehrmann R, Hedman AK, Karjalainen J, Schmidt EM, Absher D, Amin N, Anderson D, Beekman M, Bolton JL, Bragg-Gresham JL, Buyske S, Demirkan A, Deng G, Ehret GB, Feenstra B, Feitosa MF, Fischer K, Goel A, Gong J, Jackson AU, Kanoni S, Kleber ME, Kristiansson K, Lim U, Lotay V, Mangino M, Mateo Leach I, Medina-Gomez C, Medland SE, Nalls MA, Palmer CD, Pasko D, Pechlivanis S, Peters MJ, Prokopenko I, Shungin D, Stancakova A, Strawbridge RJ, Ju Sung Y, Tanaka T, Teumer A, Trompet S, van der Laan SW, van Setten J, Van Vliet-Ostaptchouk JV, Wang Z, Yengo L, Zhang W, Isaacs A, Albrecht E, Arnlov J, Arscott GM, Attwood AP, Bandinelli S, Barrett A, Bas IN, Bellis C, Bennett AJ, Berne C, Blagieva R, Bluher M, Bohringer S, Bonnycastle LL, Bottcher Y, Boyd HA, Bruinenberg M, Caspersen IH, Ida Chen YD, Clarke R, Daw EW, de Craen AJ, Delgado G, Dimitriou M, Doney AS, Eklund N, Estrada K, Eury E, Folkersen L, Fraser RM, Garcia ME, Geller F, Giedraitis V, Gigante B, Go AS, Golay A, Goodall AH, Gordon SD, Gorski M, Grabe HJ, Grallert H, Grammer TB, Grassler J, Gronberg H, Groves CJ, Gusto G, Haessler J, Hall P, Haller T, Hallmans G, Hartman CA, Hassinen M, Hayward C, Heard-Costa NL, Helmer Q, Hengstenberg C, Holmen O, Hottenga JJ, James AL, Jeff JM, Johansson A, Jolley J, Juliusdottir T, Kinnunen L, Koenig W, Koskenvuo M, Kratzer W, Laitinen J, Lamina C, Leander K, Lee NR, Lichtner P, Lind L, Lindstrom J, Sin Lo K, Lobbens S, Lorbeer R, Lu Y, Mach F, Magnusson PK, Mahajan A, McArdle WL, McLachlan S, Menni C, Merger S, Mihailov E, Milani L, Moayyeri A, Monda KL, Morken MA, Mulas A, Muller G, Muller-Nurasyid M, Musk AW, Nagaraja R, Nothen MM, Nolte IM, Pilz S, Rayner NW, Renstrom F, Rettig R, Ried JS, Ripke S, Robertson NR, Rose LM, Sanna S, Scharnagl H, Scholtens S, Schumacher FR, Scott WR, Seufferlein T, Shi J, Vernon Smith A, Smolonska J, Stanton AV, Steinthorsdottir V, Stirrups K, Stringham HM, Sundstrom J, Swertz MA, Swift AJ, Syvanen AC, Tan ST, Tayo BO, Thorand B, Thorleifsson G, Tyrer JP, Uh HW, Vandenput L, Verhulst FC, Vermeulen SH, Verweij N, Vonk JM, Waite LL, Warren HR, Waterworth D, Weedon MN, Wilkens LR, Willenborg C, Wilsgaard T, Wojczynski MK, Wong A, Wright AF, Zhang Q, LifeLines Cohort S, Brennan EP, Choi M, Dastani Z, Drong AW, Eriksson P, Franco-Cereceda A, Gadin JR, Gharavi AG, Goddard ME, Handsaker RE, Huang J, Karpe F, Kathiresan S, Keildson S, Kiryluk K, Kubo M, Lee JY, Liang L, Lifton RP, Ma B, McCarroll SA, McKnight AJ, Min JL, Moffatt MF, Montgomery GW, Murabito JM, Nicholson G, Nyholt DR, Okada Y, Perry JR, Dorajoo R, Reinmaa E, Salem RM, et al.Genetic studies of body mass index yield new insights for obesity biology. Nature. 2015; 518(7538):197–206. https://doi.org/10.1038/nature14177.
Article PubMed PubMed Central CAS Google Scholar
MacArthur J, Bowler E, Cerezo M, Gil L, Hall P, Hastings E, Junkins H, McMahon A, Milano A, Morales J, MayPendlington Z, Welter D, Burdett T, Hindorff L, Flicek P, Cunningham F, Parkinson H. The new NHGRI-EBI Catalog of published genome-wide association studies (GWAS Catalog). Nucleic Acids Res. 2017; 45(D1):896–901. https://doi.org/10.1093/nar/gkw1133.
Article CAS Google Scholar
Evangelou E, Ioannidis JPA. Meta-analysis methods for genome-wide association studies and beyond. Nat Rev Genet. 2013; 14(6):379–89. https://doi.org/10.1038/nrg3472.
Article PubMed CAS Google Scholar
Mahajan A, Go MJ, Zhang W, Below JE, Gaulton KJ, Ferreira T, Horikoshi M, Johnson AD, Ng MCY, Prokopenko I, Saleheen D, Wang X, Zeggini E, Abecasis GR, Adair LS, Almgren P, Atalay M, Aung T, Baldassarre D, Balkau B, Bao Y, Barnett AH, Barroso I, Basit A, Been LF, Beilby J, Bell GI, Benediktsson R, Bergman RN, Boehm BO, Boerwinkle E, Bonnycastle LL, Burtt N, Cai Q, Campbell H, Carey J, Cauchi S, Caulfield M, Chan JCN, Chang L-C, Chang T-J, Chang Y-C, Charpentier G, Chen C-H, Chen H, Chen Y-T, Chia K-S, Chidambaram M, Chines PS, Cho NH, Cho YM, Chuang L-M, Collins FS, Cornelis MC, Couper DJ, Crenshaw AT, van Dam RM, Danesh J, Das D, de Faire U, Dedoussis G, Deloukas P, Dimas AS, Dina C, Doney ASF, Donnelly PJ, Dorkhan M, van Duijn C, Dupuis J, Edkins S, Elliott P, Emilsson V, Erbel R, Eriksson JG, Escobedo J, Esko T, Eury E, Florez JC, Fontanillas P, Forouhi NG, Forsen T, Fox C, Fraser RM, Frayling TM, Froguel P, Frossard P, Gao Y, Gertow K, Gieger C, Gigante B, Grallert H, Grant GB, Groop LC, Groves CJ, Grundberg E, Guiducci C, Hamsten A, Han B-G, Hara K, Hassanali N, Hattersley AT, Hayward C, Hedman AK, Herder C, Hofman A, Holmen OL, Hovingh K, Hreidarsson AB, Hu C, Hu FB, Hui J, Humphries SE, Hunt SE, Hunter DJ, Hveem K, Hydrie ZI, Ikegami H, Illig T, Ingelsson E, Islam M, Isomaa B, Jackson AU, Jafar T, James A, Jia W, Jöckel K-H, Jonsson A, Jowett JBM, Kadowaki T, Kang HM, Kanoni S, Kao WHL, Kathiresan S, Kato N, Katulanda P, Keinanen-Kiukaanniemi SM, Kelly AM, Khan H, Khaw K-T, Khor C-C, Kim H-L, Kim S, Kim YJ, Kinnunen L, Klopp N, Kong A, Korpi-Hyövälti E, Kowlessur S, Kraft P, Kravic J, Kristensen MM, Krithika S, Kumar A, Kumate J, Kuusisto J, Kwak SH, Laakso M, Lagou V, Lakka TA, Langenberg C, Langford C, Lawrence R, Leander K, Lee J-M, Lee NR, Li M, Li X, Li Y, Liang J, Liju S, Lim W-Y, Lind L, Lindgren CM, Lindholm E, Liu C-T, Liu JJ, Lobbens S, Long J, Loos RJF, Lu W, Luan J, Lyssenko V, Ma RCW, Maeda S, Mägi R, Männistö S, Matthews DR, Meigs JB, Melander O, Metspalu A, Meyer J, Mirza G, Mihailov E, Moebus S, Mohan V, Mohlke KL, Morris AD, Mühleisen TW, Müller-Nurasyid M, Musk B, Nakamura J, Nakashima E, Navarro P, Ng P-K, Nica AC, Nilsson PM, Njølstad I, Nöthen MM, Ohnaka K, Ong TH, Owen KR, Palmer CNA, Pankow JS, Park KS, Parkin M, Pechlivanis S, Pedersen NL, Peltonen L, Perry JRB, Peters A, Pinidiyapathirage JM, Platou CGP, Potter S, Price JF, Qi L, Radha V, Rallidis L, Rasheed A, Rathmann W, Rauramaa R, Raychaudhuri S, Rayner NW, Rees SD, Rehnberg E, Ripatti S, Robertson N, Roden M, Rossin EJ, Rudan I, Rybin D, Saaristo TE, Salomaa V, Saltevo J, Samuel M. Genome-wide trans-ancestry meta-analysis provides insight into the genetic architecture of type 2 diabetes susceptibility. Nat Genet. 2014; 46(3):234–44. https://doi.org/10.1038/ng.2897.
Article PubMed CAS Google Scholar
Lambert J-C, Ibrahim-Verbaas CA, Harold D, Naj AC, Sims R, Bellenguez C, Jun G, DeStefano AL, Bis JC, Beecham GW, Grenier-Boley B, Russo G, Thornton-Wells TA, Jones N, Smith AV, Chouraki V, Thomas C, Ikram MA, Zelenika D, Vardarajan BN, Kamatani Y, Lin C-F, Gerrish A, Schmidt H, Kunkle B, Dunstan ML, Ruiz A, Bihoreau M-T, Choi S-H, Reitz C, Pasquier F, Hollingworth P, Ramirez A, Hanon O, Fitzpatrick AL, Buxbaum JD, Campion D, Crane PK, Baldwin C, Becker T, Gudnason V, Cruchaga C, Craig D, Amin N, Berr C, Lopez OL, De Jager PL, Deramecourt V, Johnston JA, Evans D, Lovestone S, Letenneur L, Morón FJ, Rubinsztein DC, Eiriksdottir G, Sleegers K, Goate AM, Fiévet N, Huentelman MJ, Gill M, Brown K, Kamboh MI, Keller L, Barberger-Gateau P, McGuinness B, Larson EB, Green R, Myers AJ, Dufouil C, Todd S, Wallon D, Love S, Rogaeva E, Gallacher J, St George-Hyslop P, Clarimon J, Lleo A, Bayer A, Tsuang DW, Yu L, Tsolaki M, Bossù P, Spalletta G, Proitsi P, Collinge J, Sorbi S, Sanchez-Garcia F, Fox NC, Hardy J, Naranjo MCD, Bosco P, Clarke R, Brayne C, Galimberti D, Mancuso M, Matthews F, Moebus S, Mecocci P, Del Zompo M, Maier W, Hampel H, Pilotto A, Bullido M, Panza F, Caffarra P, Nacmias B, Gilbert JR, Mayhaus M, Lannfelt L, Hakonarson H, Pichler S, Carrasquillo MM, Ingelsson M, Beekly D, Alvarez V, Zou F, Valladares O, Younkin SG, Coto E, Hamilton-Nelson KL, Gu W, Razquin C, Pastor P, Mateo I, Owen MJ, Faber KM, Jonsson PV, Combarros O, O’Donovan MC, Cantwell LB, Soininen H, Blacker D, Mead S, Mosley TH, Bennett DA, Harris TB, Fratiglioni L, Holmes C, de Bruijn RFAG, Passmore P, Montine TJ, Bettens K, Rotter JI, Brice A, Morgan K, Foroud TM, Kukull WA, Hannequin D, Powell JF, Nalls MA, Ritchie K, Lunetta KL, Kauwe JSK, Boerwinkle E, Riemenschneider M, Boada M, Hiltunen M, Martin ER, Schmidt R, Rujescu D, Wang L-S, Dartigues J-F, Mayeux R, Tzourio C, Hofman A, Nöthen MM, Graff C, Psaty BM, Jones L, Haines JL, Holmans PA, Lathrop M, Pericak-Vance MA, Launer LJ, Farrer LA, van Duijn CM, Van Broeckhoven C, Moskvina V, Seshadri S, Williams J, Schellenberg GD, Amouyel P. Meta-analysis of 74,046 individuals identifies 11 new susceptibility loci for Alzheimer’s disease. Nat Genet. 2013; 45(12):1452–8. https://doi.org/10.1038/ng.2802.
Article PubMed PubMed Central CAS Google Scholar
Atanasovska B, Kumar V, Fu J, Wijmenga C, Hofker MH. GWAS as a Driver of Gene Discovery in Cardiometabolic Diseases. Trends Endocrinol Metab. 2015; 26(12):722–32. https://doi.org/10.1016/j.tem.2015.10.004.
Article PubMed CAS Google Scholar
Pharoah PDP, Tsai Y-Y, Ramus SJ, Phelan CM, Goode EL, Lawrenson K, Buckley M, Fridley BL, Tyrer JP, Shen H, Weber R, Karevan R, Larson MC, Song H, Tessier DC, Bacot F, Vincent D, Cunningham JM, Dennis J, Dicks E, Aben KK, Anton-Culver H, Antonenkova N, Armasu SM, Baglietto L, Bandera EV, Beckmann MW, Birrer MJ, Bloom G, Bogdanova N, Brenton JD, Brinton LA, Brooks-Wilson A, Brown R, Butzow R, Campbell I, Carney ME, Carvalho RS, Chang-Claude J, Chen YA, Chen Z, Chow W-H, Cicek MS, Coetzee G, Cook LS, Cramer DW, Cybulski C, Dansonka-Mieszkowska A, Despierre E, Doherty JA, Dörk T, du Bois A, Dürst M, Eccles D, Edwards R, Ekici AB, Fasching PA, Fenstermacher D, Flanagan J, Gao Y-T, Garcia-Closas M, Gentry-Maharaj A, Giles G, Gjyshi A, Gore M, Gronwald J, Guo Q, Halle MK, Harter P, Hein A, Heitz F, Hillemanns P, Hoatlin M, Høgdall E, Høgdall CK, Hosono S, Jakubowska A, Jensen A, Kalli KR, Karlan BY, Kelemen LE, Kiemeney LA, Kjaer SK, Konecny GE, Krakstad C, Kupryjanczyk J, Lambrechts D, Lambrechts S, Le ND, Lee N, Lee J, Leminen A, Lim BK, Lissowska J, Lubiński J, Lundvall L, Lurie G, Massuger LFAG, Matsuo K, McGuire V, McLaughlin JR, Menon U, Modugno F, Moysich KB, Nakanishi T, Narod SA, Ness RB, Nevanlinna H, Nickels S, Noushmehr H, Odunsi K, Olson S, Orlow I, Paul J, Pejovic T, Pelttari LM, Permuth-Wey J, Pike MC, Poole EM, Qu X, Risch HA, Rodriguez-Rodriguez L, Rossing MA, Rudolph A, Runnebaum I, Rzepecka IK, Salvesen HB, Schwaab I, Severi G, Shen H, Shridhar V, Shu X-O, Sieh W, Southey MC, Spellman P, Tajima K, Teo S-H, Terry KL, Thompson PJ, Timorek A, Tworoger SS, van Altena AM, van den Berg D, Vergote I, Vierkant RA, Vitonis AF, Wang-Gohrke S, Wentzensen N, Whittemore AS, Wik E, Winterhoff B, Woo YL, Wu AH, Yang HP, Zheng W, Ziogas A, Zulkifli F, Goodman MT, Hall P, Easton DF, Pearce CL, Berchuck A, Chenevix-Trench G, Iversen E, Monteiro ANA, Gayther SA, Schildkraut JM, Sellers TA. GWAS meta-analysis and replication identifies three new susceptibility loci for ovarian cancer. Nat Genet. 2013; 45(4):362–70. https://doi.org/10.1038/ng.2564.
Article PubMed PubMed Central CAS Google Scholar
Pers TH, Karjalainen JM, Chan Y, Westra H-J, Wood AR, Yang J, Lui JC, Vedantam S, Gustafsson S, Esko T, Frayling T, Speliotes EK, Boehnke M, Raychaudhuri S, Fehrmann RSN, Hirschhorn JN, Franke L. Biological interpretation of genome-wide association studies using predicted gene functions. Nat Commun. 2015; 6:5890. https://doi.org/10.1038/ncomms6890.
Article PubMed PubMed Central CAS Google Scholar
Bedo J, Rawlinson D, Goudey B, Ong CS. Stability of bivariate GWAS biomarker detection. PLoS ONE. 2014; 9(4). https://doi.org/10.1371/journal.pone.0093319.
Zimmermann GR, Lehár J, Keith CT. Multi-target therapeutics: when the whole is greater than the sum of the parts. Drug Discov Today. 2007; 12(1-2):34–42. https://doi.org/10.1016/j.drudis.2006.11.008.
Article PubMed CAS Google Scholar
Xu L, Pegu A, Rao E, Doria-Rose N, Beninga J, McKee K, Lord DM, Wei RR, Deng G, Louder M, Schmidt SD, Mankoff Z, Wu L, Asokan M, Beil C, Lange C, Leuschner WD, Kruip J, Sendak R, Kwon YD, Zhou T, Chen X, Bailer RT, Wang K, Choe M, Tartaglia LJ, Barouch DH, O’Dell S, Todd JP, Burton DR, Roederer M, Connors M, Koup RA, Kwong PD, Yang ZY, Mascola JR, Nabel GJ. Trispecific broadly neutralizing HIV antibodies mediate potent SHIV protection in macaques. Science. 2017; 358(6359):85–90. https://doi.org/10.1126/science.aan8630.
Article PubMed PubMed Central CAS Google Scholar
Bramblett T, Teleb M, Albaghdadi A, Agrawal H, Mukherjee D. Heart Failure with Preserved Ejection Fraction: Entresto a Possible Option. Cardiovasc Hematol Disorders-Drug Targets. 2017; 17(2):80–5. https://doi.org/10.2174/1871529X17666170703120237.
Article CAS Google Scholar
Lin C, Chu CM, Su SL. Epistasis Test in meta-analysis: A multi-parameter Markov chain Monte Carlo model for consistency of evidence. PLoS ONE. 2016; 11(4):1–17. https://doi.org/10.1371/journal.pone.0152891.
Google Scholar

Download references

Acknowledgements

The authors thanks Adam Kowalcyk for granted access to the DSS software and fruitful discussions and the WTCCC for granted access the WTCCC1 GWAS on T2D.

Funding

The work of all authors was supported by funding from Sanofi-Aventis Recherche et Developpement.

Availability of data and materials

The datasets generated and analyzed during the current study are available from the corresponding author on reasonable request.

Author information

Authors and Affiliations

SANOFI R&D, Translational Sciences, Chilly Mazarin, 91385, France
Clément Chatelain & Franck Augé
Laboratoire de Probabilités et Modèles Aléatoires, Université Pierre et Marie Curie, 4, place Jussieu, Paris Cedex 05, 75252, France
Guillermo Durand
SANOFI R&D, Biostatistics & Programming, Chilly Mazarin, 91385, France
Vincent Thuillier

Authors

Clément Chatelain
View author publications
You can also search for this author in PubMed Google Scholar
Guillermo Durand
View author publications
You can also search for this author in PubMed Google Scholar
Vincent Thuillier
View author publications
You can also search for this author in PubMed Google Scholar
Franck Augé
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

All authors contributed to the design of the study. GD selected the simulation and epistasis detection methods. CC and GD implemented and conducted the simulations. CC drafted the manuscript and all authors critically read and edit the manuscript. All authors approved the final manuscript.

Corresponding author

Correspondence to Clément Chatelain.

Ethics declarations

Ethics approval and consent to participate

Not applicable.

Competing interests

The authors declare that they have no competing interests.

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Additional file

Additional file 1

Simulation results on all scenarios and epistasis networks detected in the T2D GWAS. (PDF 331 kb)

Rights and permissions

Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.

Reprints and permissions

About this article

Cite this article

Chatelain, C., Durand, G., Thuillier, V. et al. Performance of epistasis detection methods in semi-simulated GWAS. BMC Bioinformatics 19, 231 (2018). https://doi.org/10.1186/s12859-018-2229-8

Download citation

Received: 26 August 2017
Accepted: 04 June 2018
Published: 18 June 2018
DOI: https://doi.org/10.1186/s12859-018-2229-8

Performance of epistasis detection methods in semi-simulated GWAS