Skip to main content

Table 1 Description of datasets

From: How to balance the bioinformatics data: pseudo-negative sampling

Dataset

Positive

Negative

Attributes

Ratio

CMC

333

1140

9

3.4

Haberman

81

225

3

2.7

Solar Flare

69

1320

10

19.1

Oil

41

896

49

21.9

PDNA-543

9549

134995

180

14.1

PDNA-316

5609

67109

180

11.9

SNP

183

2891

25

15.7