Skip to main content

Table 2 Clustering results for Build_Fam, OrthoMCL and TribeMCL

From: Databases of homologous gene families for comparative genomics

 

Build_Fam

Ortho_MCL

Tribe_MCL

Parameters

50/80

40/80

E-value

HSP

1.5

4.0

E-value

HSP

Nb. clustered seq.

119222

144956

157993

186779

171129

169507

157993

186779

% clustered seq.

54%

66%

72%

85%

78%

77%

72%

85%

Nb. families

20706

17043

19608

19344

23966

31343

19608

19344

Avg. seq./family

5.76

8.51

8.06

9.66

7.14

5.41

8.06

9.66

Families ≥ 1000

1

6

1

1

0

0

1

1

Largest family

1580

2642

1121

1185

479

281

1121

1185

Families sp. = 1

10359 (50%)

8050 (47%)

8379 (43%)

6735 (35%)

7828 (33%)

10134 (32%)

8379 (43%)

6735 (35%)

Families sp. = 50

13 (0.6‰)

34 (2‰)

19 (1‰)

30 (1.6‰)

27 (1.1‰)

5 (0.2‰)

19 (1‰)

30 (1.6‰)

Familles sp. ≥ 25

504 (2.4%)

620 (3.6%)

630 (3.2%)

744 (3.9%)

734 (3.1%)

554 (1.8%)

630 (3.2%)

744 (3.9%)

  1. The parameters used for the algorithms correspond to the similarity/length combination in the case of Build_Fam, to the inflation parameter in the case of OrthoMCL, and to the two scores used in the case of TribeMCL. The three last lines give the number and percentage of families containing only one species (sp. = 1), 50 different species (sp. = 50), and at least 25 different species (sp. ≥ 25).