Skip to main content

Table 1 Data set description

From: Clustering cancer gene expression data: a comparative study

Dataset

Chip

Tissue

n

#C

Dist. Classes

m

d

Armstrong-V1 [52]

Affy

Blood

72

2

24,48

12582

1081

Armstrong-V2 [52]

Affy

Blood

72

3

24,20,28

12582

2194

Bhattacharjee [9]

Affy

Lung

203

5

139,17,6,21,20

12600

1543

Chowdary [13]

Affy

Breast, Colon

104

2

62,42

22283

182

Dyrskjot [14]

Affy

Bladder

40

3

9,20,11

7129

1203

Golub-V1 [3]

Affy

Bone marrow

72

2

47,25

7129

1877

Golub-V2 [3]

Affy

Bone marrow

72

3

38,9,25

7129

1877

Gordon [53]

Affy

Lung

181

2

31,150

12533

1626

Laiho [15]

Affy

Colon

37

2

8,29

22883

2202

Nutt-V1 [54]

Affy

Brain

50

4

14,7,14,15

12625

1377

Nutt-V2 [54]

Affy

Brain

28

2

14,14

12625

1070

Nutt-V3 [54]

Affy

Brain

22

2

7,15

12625

1152

Pomeroy-V1 [55]

Affy

Brain

34

2

25,9

7129

857

Pomeroy-V2 [55]

Affy

Brain

42

5

10,10,10,4,8

7129

1379

Ramaswamy [50]

Affy

Multi-tissue

190

14

11,10,11,11,22,10,11,10,30,11,11,11,11,20

16063

1363

Shipp [56]

Affy

Blood

77

2

58,19

7129

798

Singh [19]

Affy

Prostate

102

2

58,19

12600

339

Su [57]

Affy

Multi-tissue

174

10

26,8,26,23,12,11,7,27,6,28

12533

1571

West [58]

Affy

Breast

49

2

25,24

7129

1198

Yeoh-V1 [20]

Affy

Bone marrow

248

2

43,205

12625

2526

Yeoh-V2 [20]

Affy

Bone marrow

248

6

15,27,64,20,79,43

12625

2526

Alizadeh-V1 [4]

cDNA

Blood

42

2

21,21

4022

1095

Alizadeh-V2 [4]

cDNA

Blood

62

3

42,9,11

4022

2093

Alizadeh-V3 [4]

cDNA

Blood

62

4

21,21,9,11

4022

2093

Bittner [10]

cDNA

Skin

38

2

19, 19

8067

2201

Bredel [11]

cDNA

Brain

50

3

31,14,5

41472

1739

Chen [12]

cDNA

Liver

180

2

104,76

22699

85

Garber [59]

cDNA

Lung

66

4

17,40,4,5

24192

4553

Khan [60]

cDNA

Multi-tissue

83

4

29,11,18,25

6567

1069

Lapointe-V1 [16]

cDNA

Prostate

69

3

11,39,19

42640

1625

Lapoint-V2 [16]

cDNA

Prostate

110

4

11,39,19,41

42640

2496

Liang [17]

cDNA

Brain

37

3

28,6,3

24192

1411

Risinger [18]

cDNA

Endometrium

42

4

13,3,19,7

8872

1771

Tomlins-V1 [61]

cDNA

Prostate

104

5

27,20,32,13,12

20000

2315

Tomlins-V2 [61]

cDNA

Prostate

92

4

27,20,32,13

20000

1288

  1. The data sets present different values for features such as type of microarray chip (second column), tissue type (third column), number of samples (fourth column), number of classes (fifth column), distribution of samples within the classes (sixth column), dimensionality (seventh column) and dimensionality after feature selection (last column).