Figure 2

Cumulative distribution of p-values for two-sided test case with sample size n = 9. P-values calculated from random samples based on (dashed blue line) and (dashed green line) give reliable corrections, while the naïve p-value (dashed red line) overstates the significance of the test.