Skip to main content

Table 5 Confusion matrix of MAS and RMA clustering results. Clustering into k = 12 clusters was performed using k-means clustering, using correlation distance on 3000 probe sets. A cell at position (i, j) shows the number of samples assigned to cluster i on data pre-processed using MAS and to cluster j on data pre-processed using RMA. The Jaccard index between these two clusterings is 0.55.

From: The effect of oligonucleotide microarray data pre-processing on the analysis of patient-cohort studies

   Number of samples in RMA clusters
   1 2 3 4 5 6 7 8 9 10 11 12
Number of samples in MAS clusters 1 33 1 1 0 1 1 1 0 0 1 1 0
  2 0 33 0 0 1 0 0 0 0 0 2 0
  3 0 3 28 2 0 0 0 0 0 0 0 0
  4 0 0 7 24 0 1 0 1 0 0 0 0
  5 0 1 0 0 21 2 0 0 0 0 0 0
  6 0 0 1 3 0 21 0 0 0 0 0 0
  7 0 0 0 0 0 0 21 0 0 0 0 0
  8 0 0 0 1 0 0 0 18 0 0 0 0
  9 0 0 0 0 4 0 0 0 10 0 0 8
  10 0 0 0 0 0 0 0 0 1 10 0 0
  11 0 3 0 0 0 0 0 0 0 0 10 0
  12 0 0 1 0 0 0 0 0 7 0 0 0