Skip to main content

Table 1 The properties of the datasets obtained from the UCI repository

From: A voting-based machine learning approach for classifying biological and clinical datasets

Dataset name

#instances

#features

#classes

Data type

Missing values

LIV

345

6

2

Numerical and binary

NO

PID

768

8

2

numerical

NO

SHD

270

13

2

Numerical and binary

NO

CHD2

303

13

2

Numerical and binary

NO

CHD5

303

13

5

Numerical and binary

NO

HEP

150

19

2

Numerical and binary

YES

PAR

197

22

2

Real

YES

WDBC

569

31

2

Real

NO

LUNG

32

56

3

Numerical and binary

YES

ARRYTM

452

279

16

Double

YES

PARKINSON

756

754

2

Numerical and binary

NO

ARCENE

900

10,000

2

Numerical

NO

GENEEXPR

801

20,531

5

Double

NO