Skip to main content

Table 8 Datasets

From: Comparative evaluation of set-level techniques in predictive classification of gene expression samples

Dataset

Genes

Class 1

Class 2

Source

Reference

Adenocarcinoma

14023

8

29

GDS2201

[31]

ALL/AML

10056

24

24

Broad institute

[32]

Brain/muscle

13380

41

20

-

[4]

Breast tumors

14023

16

27

GDS1329

[33]

Clear cell sarcoma

14023

18

14

GDS1282

[34]

Colitis and Crohn 1

14902

42

26

GDS1615

[35]

Colitis and Crohn 2

14902

42

59

GDS1615

[35]

Colitis and Crohn 3

14902

26

59

GDS1615

[35]

Diabetes

13380

17

17

Broad institute

[5]

Heme/stroma

13380

18

33

-

[4]

Gastric cancer

5664

8

22

GDS1210

[36]

Gender

15056

15

17

Broad institute

[1]

Gliomas

14902

26

59

GDS1975

[37]

Gliomas 2

31835

23

81

GDS1962

[38]

Lung cancer Boston

5217

31

31

Broad institute

[39]

Lung cancer Michigan

5217

24

62

Broad institute

[40]

Lung cancer - smokers

14023

90

97

GDS2771

[41]

Melanoma

14902

18

45

GDS1375

[42]

p53

10101

33

17

Broad institute

[1]

Parkinson 1

14902

22

33

GDS2519

[43]

Parkinson 2

14902

22

50

GDS2519

[43]

Parkinson 3

14902

33

50

GDS2519

[43]

Pheochromocytoma

14023

38

37

GDS2113

[44]

Pleural mesothelioma

14902

10

44

GDS1220

[45]

Pollution

37804

88

41

-

[46]

Prostate cancer

14023

18

45

GDS1390

[47]

Sarcoma and hypoxia

14902

15

39

GDS1209

[48]

Smoking

5664

18

26

GDS2489

[49]

Squamous-cell carcinoma

9460

22

22

GDS2520

[50]

Testicular seminoma

9460

22

14

GDS2842

[51]

  1. Number of genes interrogated and number of samples in each of the two classes of each dataset.