Characteristics of simulated data. The left plot shows the genewise population differences contrasted with the mean differences in simulated data. Population differences - were set for each gene by randomly drawing from N(0,1). Simulated differences stem from drawing data from a multivariate distribution with these given population means. The right plot shows boxplots of all 3000 genes for all 50 samples of the simulated data for the training set (the test set is very similar and not shown).