Skip to main content

Table 1 Effect of the presence of DEG when applying the classic permutation strategy to the PLGEM-STN statistic.

From: A power law global error model for the identification of differentially expressed genes in microarray data

  FPR vs. significance level estimated vs. observed FDR
  including DEG excluding DEG including DEG excluding DEG
# of genes in data set slope intercept adj. R^2 slope intercept adj. R^2 slope intercept adj. R^2 slope intercept adj. R^2
22300 1.690 3.070 0.856 0.871 -0.114 0.938 0.187 -0.383 0.314 1.090 -0.692 0.826
10000 1.710 2.470 0.881 0.888 -0.083 0.931 0.224 -0.247 0.433 1.040 -0.555 0.824
5000 1.460 1.270 0.880 0.877 -0.147 0.934 0.093 -0.228 0.067 1.080 -0.425 0.836
2500 1.590 1.180 0.888 0.864 -0.155 0.939 0.092 -0.176 0.135 1.110 -0.348 0.853
1500 1.670 0.935 0.908 0.876 -0.166 0.948 0.078 -0.122 0.170 1.130 -0.182 0.880
1000 1.720 0.689 0.874 0.944 0.002 0.956 0.038 -0.122 0.082 0.991 -0.226 0.897
500 1.900 0.263 0.864 0.857 -0.255 0.956 0.062 -0.030 0.307 1.160 0.038 0.909
200 2.490 -0.059 0.875 0.905 -0.378 0.946 0.064 -0.012 0.426 1.050 0.233 0.914
  1. Eight data sets with different percentages of DEG were constructed from the Latin Square data set by keeping the 62 known spiked-in probe sets, but randomly removing increasing amounts of the remaining probe sets reaching the total indicated in the first column. The null distributions of the PLGEM-STN statistics were evaluated through the classic permutation strategy either including or excluding DEG. A wide range of significance levels was used to select DEG and correlation between the FPR and the significance level or between estimated and observed FDR was evaluated through linear regressions in log-log plots. Table reports slopes, intercepts and adjusted r2 of linear models. See text for details on estimation of FDR.