Skip to main content

Table 3 Comparison of our heuristic to find high similarity pairs of proteins (HistoFasta) to the exact algorithm Needleman–Wunsch

From: GENPPI: standalone software for creating protein interaction networks from genomes

AA limit (-aadifflimit)

Check limit (-aacheckminlimit)

Number of similar proteins

Mean identity

Median identity

Min identity

0

26

336

100.00

100.00

100.00

0

25

336

100.00

100.00

100.00

0

24

360

99.95

100.00

97.96

0

23

366

99.91

100.00

96.94

0

22

368

99.90

100.00

96.94

0

21

370

99.87

100.00

94.68

0

20

372

99.83

100.00

91.75

0

19

382

99.60

100.00

85.57

1

26

360

99.95

100.00

97.87

1

25

370

99.84

100.00

92.55

1

24

390

99.07

100.00

29.21

1

23

428

96.21

100.00

29.21

1

22

500

89.38

100.00

17.33

1

21

784

71.68

97.70

17.33

1

20

2164

52.27

39.60

17.33

1

19

6120

43.26

36.36

15.00

  1. For the creation of the core pangenome, we need only the higher matches