Skip to main content

Table 4 Summary of different PPI datasets for Homo sapiens, Saccharomyces cerevisiae, Escherichia coli, and Drosophila melanogaster. (a) the numbers of coincident proteins and (b) the numbers of coincident interacting and non-interacting protein pairs (Pos and Neg, respectively) in the datasets

From: Protein-protein interaction prediction using a hybrid feature representation and a stacked generalization scheme

(a)

Protein

HS2

HS3

HS4

HS5

  

HS1

4513(11959a)

971(9983)

2460 (10275)

2699 (12777)

  

HS2

–

1043 (7505)

2272 (8057)

2492 (10578)

  

HS3

–

–

620 (4191)

616 (6936)

  

HS4

–

–

–

1472 (7861)

  

Protein

SC2

SC3

SC4

SC5

SC6

 

SC1

1759 (3777)

2078 (2319)

2088 (5593)

0 (2699)

1979 (4690)

 

SC2

–

1762 (3681)

3187 (5540)

0 (3745)

2622 (5093)

 

SC3

–

–

2074 (5514)

0 (2606)

2001 (4574)

 

SC4

–

–

–

0 (5890)

3612 (6248)

 

SC5

–

–

–

–

0 (4878)

 

Protein

EC2

     

EC1

469 (1954)

     

Protein

DM2

     

DM1

295 (7422)

     

(b)

Pos

HS1

HS2

HS3

HS4

HS5

 

Neg

      

HS1

–

8388 (53357)

2282 (46989)

1626 (38891)

514 (37604)

 

HS2

87 (214057b)

 –

2742 (34220)

1505 (26703)

451 (25363)

 

HS3

5 (49266)

59 (189302)

–

463 (15271)

194 (13141)

 

HS4

4 (40513)

15 (180592)

2 (15732)

–

272 (4309)

 

HS5

0 (40454)

5 (180539)

1 (15670)

0 (6917)

–

 

Pos

SC1

SC2

SC3

SC4

SC5

SC6

Neg

      

SC1

–

1985 (17236)

3587 (4213)

3372 (5113)

0 (4456)

3526 (17687)

SC2

4 (19190)

–

2073 (17009)

2534 (17233)

0 (15738)

4479 (28016)

SC3

10 (7790)

8 (19074)

–

3532 (4841)

0 (4344)

3728 (17373)

SC4

4 (14783)

12 (26057)

3 (14672)

–

0 (5029)

3602 (18184)

SC5

0 (4456)

0 (15738)

0 (4344)

0 (11331)

 –

0 (17757)

SC6

43 (52507)

76 (63756)

28 (52410)

42 (59383)

0 (49094)

–

Pos

EC1

EC2

    

Neg

      

EC1

–

384 (7737)

    

EC2

3 (8118)

–

    

Pos

DM1

DM2

    

Neg

      

DM1

–

15 (22281)

    

DM2

0 (22296)

–

    
  1. HS Homo sapiens, SC Saccharomyces cerevisiae, EC Escherichia coli, DM Drosophila melanogaster aNumbers in parentheses are the total numbers of non-duplicated proteins in the two datasets, e.g. HS1 and HS2
  2. bNumbers in parentheses are the total numbers of non-duplicated protein pairs in the two datasets, e.g. HS1 and HS2