Skip to main content

Table 1 Benchmark datasets

From: CFMDS: CUDA-based fast multidimensional scaling for genome-scale data

Dataset

Source

Number of Attributes

Number of Instances

Pearson's Median Skewness Coefficient

Coefficient of Variation

IRIS

UCI ML Repository

4

150

0.34

0.64

Dermatology

UCI ML Repository

33

366

-0.61

0.42

M. musculus Microarray

GEO

4,000

2,000

0.94

1.08

S. cerevisiae Microarray

GEO

1,000

9,300

0.73

0.56

MNIST

MNIST

784

10,000

-0.13

0.14

  1. UCI ML Repository is UCI Machine Learning Repository http://archive.ics.uci.edu/ml/datasets.html. GEO is Gene Expression Omnibus http://www.ncbi.nlm.nih.gov/geo/. MNIST is the MNIST Database of handwritten digits http://yann.lecun.com/exdb/mnist/. M. musculus Microarray is a modified dataset from Mus musculus microarrays in GEO and S. cerevisiae Microarray is a modified dataset from Saccharomyces cerevisiae microarrays in GEO. MNIST dataset is from scanned handwritten digit images of 28 × 28 pixels.