Skip to main content
Figure 6 | BMC Bioinformatics

Figure 6

From: The choice of null distributions for detecting gene-gene interactions in genome-wide association studies

Figure 6

The procedure of using independent data sets in hypothesis testing. The whole data set D is partitioned into three subsets: D(1), D(2) and D(3). A screening method is applied to D(1). After screening, only a subset of features survives, denoted as A1. Then modeling methods are applied to D(2), but only involving the features in A 1 . This modeling process may further select a subset of features from A1, denoted as A2. Thus, A2 ⊂ A1. For feature assessment, hypothesis testing is applied to the features in A2 using the data set D(3). The correction factor for multiple testing is calculated based on the size of A2. After feature assessment, the significant features are collected in A3 and A3 ⊂ A2. They are finally used for genetic mapping.

Back to article page