Skip to main content

Table 3 Table showing stability results produced on a real dataset of sample size 16. Table 3 shows stability scores produced on a given dataset of a sample size of n = 16. We split the dataset into two halves each containing 8 subjects. The left dataset is resampled 6 times producing 6 samples of sample sizes 3 to 8, respectively. Similarly the right dataset is resampled to produce 6 samples. We measured the strength of the association between the clusters produced on every pair of samples (one sample from left and other from right dataset both of same sample size) using Cramer's v2. Columns in the table represent number of clusters (k) and rows represent sample sizes. Stability score quantified for k = 10 and sample size 8 is 0.3699. This table shows there is 37% agreement between the clusters produced (k = 10) on pair of samples (a sample from left dataset and other from right dataset both of sample size 8).

From: Reproducible Clusters from Microarray Research: Whither?

K (CLUSTERS)
   2 3 4 5 6 7 8 9 10
S A M P L E S I Z E 3 0.5883 0.47091 0.4503 0.4028 0.3809 0.3600 0.3313 0.3107 0.2992
  4 0.5799 0.48045 0.4244 0.3894 0.365 0.3469 0.3132 0.297 0.2858
  5 0.5738 0.48296 0.4297 0.3982 0.3644 0.3430 0.3195 0.3013 0.2790
  6 0.6433 0.54638 0.5142 0.4727 0.4405 0.4066 0.3817 0.3616 0.3396
  7 0.6534 0.54821 0.5250 0.4826 0.4462 0.4211 0.3915 0.3679 0.348
  8 0.6759 0.58447 0.5520 0.5045 0.4700 0.4592 0.4160 0.3975 0.3699