Skip to main content

Table 3 Datasets used in the experiments

From: Surveying alignment-free features for Ortholog detection in related yeast proteomes by using supervised big data classifiers

Dataset id

Proteome pair

Number of protein features

Protein pair per class (non-orthologs; orthologs)

Imbalance ratio (IR)

ScerKlac

S. cerevisiae - K. lactis

29

(31,218,485; 3062)

10,195.456

ScerCgla

S. cerevisiae - C. glabrata

29

(30,562,272; 2843)

10,750.008

CglaKlac

C. glabrata - K. lactis

29

(27,778,732; 1573)

17,659.715

KlacKwal

K. lactis - K. waltii

29

(27,772,372; 2606)

10,657.088