Skip to main content

Table 5 Confusion matrix of MAS and RMA clustering results. Clustering into k = 12 clusters was performed using k-means clustering, using correlation distance on 3000 probe sets. A cell at position (i, j) shows the number of samples assigned to cluster i on data pre-processed using MAS and to cluster j on data pre-processed using RMA. The Jaccard index between these two clusterings is 0.55.

From: The effect of oligonucleotide microarray data pre-processing on the analysis of patient-cohort studies

  

Number of samples in RMA clusters

  

1

2

3

4

5

6

7

8

9

10

11

12

Number of samples in MAS clusters

1

33

1

1

0

1

1

1

0

0

1

1

0

 

2

0

33

0

0

1

0

0

0

0

0

2

0

 

3

0

3

28

2

0

0

0

0

0

0

0

0

 

4

0

0

7

24

0

1

0

1

0

0

0

0

 

5

0

1

0

0

21

2

0

0

0

0

0

0

 

6

0

0

1

3

0

21

0

0

0

0

0

0

 

7

0

0

0

0

0

0

21

0

0

0

0

0

 

8

0

0

0

1

0

0

0

18

0

0

0

0

 

9

0

0

0

0

4

0

0

0

10

0

0

8

 

10

0

0

0

0

0

0

0

0

1

10

0

0

 

11

0

3

0

0

0

0

0

0

0

0

10

0

 

12

0

0

1

0

0

0

0

0

7

0

0

0