Skip to main content

Table 2 The overall number of genes selected by OAA and ECOC classifiers

From: Multiclass classification of microarray data samples with a reduced number of genes

        

p-valuesa

 

Dataset

M

N

B-ECOC

B-OAA

G-ECOC(F)

G-OAA(G)

F ≠ G

F < G

MW

200 Montecarlo 4:1 train-test partitions at η = 5

Lymphoma

3

NA

NA

4

NA

22

NA

NA

NA

SRCBT

4

9

14.22

6

37

23

< 2.2e-16

< 2.2e-16

1

Brain

5

9

28.1

19

177

109.5

5.08e-05

2.54e-05

0.99975

NCI60

8

9

45.11

34

310

326

9.31e-07

0.27804

0.07651

Staunton

9

12

46

34.11

387

296

9.91e-08

4.95e-08

0.99993

GCM RM

11

11

142

36

800

365.5

< 2.2e-16

2.76e-08

1

Su

11

13

126

62

1056

916

5.36e-12

1.15e-24

0.99978

GCM

14

12

322

128

2096

1406

< 2.2e-16

< 2.2e-16

1

200 Montecarlo 4:1 train-test partitions at η = 10

Lymphoma

3

11

4.27

4

12

22

5.52e-08

1

9.85e-09

SRCBT

4

9

12.22

6

33

23

< 2.2e-16

< 2.2e-16

1

Brain

5

15

16.16

19

109.5

109.5

0.03970

0.01984

0.54495

NCI60

8

14

42.12

39

286.5

326

9.31e-07

0.95599

0.00105

Staunton

9

19

40.03

34.11

381.5

296

6.95e-10

3.48e-10

0.99997

GCM RM

11

12

72

36

570

365.5

< 2.2e-16

1.66e-19

1

Su

11

17

112

62

940

916

1.82e-10

9.11e-11

0.98387

GCM

14

12

322

128

2078

1406

< 2.2e-16

< 2.2e-16

1

200 Montecarlo 4:1 train-test partitions at η = 15

Lymphoma

3

11

4.26

4

12

22

3.05e-08

1

3.85e-09

SRCBT

4

9

12.22

6

33

23

< 2.2e-16

< 2.2e-16

1

Brain

5

18

16.06

19

105

109.5

0.03970

0.01984

0.15586

NCI60

8

16

36.15

39

251

326

9.31e-07

1

3.23e-05

Staunton

9

19

34.09

34.11

373.5

296

4.81e-09

2.41e-09

0.99989

GCM RM

11

12

72

36

561

365.5

< 2.2e-16

1.66e-19

1

Su

11

17

112

62

924.5

916

1.34e-09

6.69e-10

0.97006

GCM

14

12

322

128

2066

1406

< 2.2e-16

< 2.2e-16

1

  1. The number of genes selected by OAA and ECOC classifiers of size at most ⌈η·log2M⌉ under bounded optimum S 2N gene selection over 200 Montecarlo 4:1 train-test partitions. M and n respectively denote the median number of binary classifiers at OAA and ECOC classifiers. B-ECOC and B-OAA respectively denote the median number of genes per binary SVM at ECOC and OAA classifiers. G-ECOC and G-OAA respectively denote the median overall number of genes selected at ECOC and OAA classifiers. G-ECOC and G-OAA are denoted as F and G for purposes of KS tests, respectively.
  2. a p-values of two-sided KS tests, one-sided KS tests and one-sided MW tests. The alternative hypothesis of two-sided KS tests is "the number of genes selected by ECOC classifiers (F) is different from that of OAA classifiers (G)", i.e., the relationship between corresponding CDFs is F ≠ G. The alternative hypothesis for one sided KS tests is "the number of genes selected by ECOC classifiers (F) is greater than that OAA classifiers (G)", i.e., the relationship between corresponding CDFs is F <G. The alternative hypothesis of one-sided MW tests is "the median number of genes selected by ECOC classifiers is less than that of OAA classifiers".