Skip to main content

Table 1 Data set description

From: Clustering cancer gene expression data: a comparative study

Dataset Chip Tissue n #C Dist. Classes m d
Armstrong-V1 [52] Affy Blood 72 2 24,48 12582 1081
Armstrong-V2 [52] Affy Blood 72 3 24,20,28 12582 2194
Bhattacharjee [9] Affy Lung 203 5 139,17,6,21,20 12600 1543
Chowdary [13] Affy Breast, Colon 104 2 62,42 22283 182
Dyrskjot [14] Affy Bladder 40 3 9,20,11 7129 1203
Golub-V1 [3] Affy Bone marrow 72 2 47,25 7129 1877
Golub-V2 [3] Affy Bone marrow 72 3 38,9,25 7129 1877
Gordon [53] Affy Lung 181 2 31,150 12533 1626
Laiho [15] Affy Colon 37 2 8,29 22883 2202
Nutt-V1 [54] Affy Brain 50 4 14,7,14,15 12625 1377
Nutt-V2 [54] Affy Brain 28 2 14,14 12625 1070
Nutt-V3 [54] Affy Brain 22 2 7,15 12625 1152
Pomeroy-V1 [55] Affy Brain 34 2 25,9 7129 857
Pomeroy-V2 [55] Affy Brain 42 5 10,10,10,4,8 7129 1379
Ramaswamy [50] Affy Multi-tissue 190 14 11,10,11,11,22,10,11,10,30,11,11,11,11,20 16063 1363
Shipp [56] Affy Blood 77 2 58,19 7129 798
Singh [19] Affy Prostate 102 2 58,19 12600 339
Su [57] Affy Multi-tissue 174 10 26,8,26,23,12,11,7,27,6,28 12533 1571
West [58] Affy Breast 49 2 25,24 7129 1198
Yeoh-V1 [20] Affy Bone marrow 248 2 43,205 12625 2526
Yeoh-V2 [20] Affy Bone marrow 248 6 15,27,64,20,79,43 12625 2526
Alizadeh-V1 [4] cDNA Blood 42 2 21,21 4022 1095
Alizadeh-V2 [4] cDNA Blood 62 3 42,9,11 4022 2093
Alizadeh-V3 [4] cDNA Blood 62 4 21,21,9,11 4022 2093
Bittner [10] cDNA Skin 38 2 19, 19 8067 2201
Bredel [11] cDNA Brain 50 3 31,14,5 41472 1739
Chen [12] cDNA Liver 180 2 104,76 22699 85
Garber [59] cDNA Lung 66 4 17,40,4,5 24192 4553
Khan [60] cDNA Multi-tissue 83 4 29,11,18,25 6567 1069
Lapointe-V1 [16] cDNA Prostate 69 3 11,39,19 42640 1625
Lapoint-V2 [16] cDNA Prostate 110 4 11,39,19,41 42640 2496
Liang [17] cDNA Brain 37 3 28,6,3 24192 1411
Risinger [18] cDNA Endometrium 42 4 13,3,19,7 8872 1771
Tomlins-V1 [61] cDNA Prostate 104 5 27,20,32,13,12 20000 2315
Tomlins-V2 [61] cDNA Prostate 92 4 27,20,32,13 20000 1288
  1. The data sets present different values for features such as type of microarray chip (second column), tissue type (third column), number of samples (fourth column), number of classes (fifth column), distribution of samples within the classes (sixth column), dimensionality (seventh column) and dimensionality after feature selection (last column).