Skip to main content

Table 2 Fraction of null hypotheses rejected by the Anderson-Darling tests for best fit to 7 distribution functions.

From: Validation and characterization of DNA microarray gene expression data distribution and associated moments

Dataset

Probe set no.

Normal

Weibull

Extreme Value

Logistic

Lognormal

Log-logistic

At least one of the distributions not rejected

Craniofacial

6215

0.72

0.79

0.82

0.69

0.75

0.71

0.46

Liver

6228

0.82

0.93

0.95

0.79

0.83

0.8

0.25

Brain

25146

0.77

0.92

0.93

0.69

0.77

0.68

0.35

Housekeeping

23

0.7

1

1

1

0.48

1

0.52

Male

19532

0.46

NA

0.96

0.24

NA

NA

0.82

Female

18915

0.43

NA

0.96

0.21

NA

NA

0.85

  1. The fraction is the number of probe sets that reject a given hypothesis out of the number of the probe sets (that is given in the second column). The number of probe sets in the second column were the ones that were assumed to be unaffected by the conditions involved in the generation of all samples in each of the six data sets - "Craniofacial", "Liver", "Brain", "Housekeeping", "Male" and "Female". Probe sets were deemed to be unaffected for the first three data sets using the Kruskal Wallis test as described in the Methods section. The "Housekeeping" data set had 6219 samples and the 23 probe sets analyzed corresponding to the so-called housekeeping genes [47] that are supposed to be essential for cell-survival under most conditions. The probesets for the "Male" and "Female" data sets were identified using the procedure detailed in the Methods section. Note some of the log ratio data for the "Male" and "Female" data sets are negative and so cannot be tested for goodness-of-fit to some of the distributions. The results for these distributions are listed as "NA".