Skip to main content

Table 1 Detailed description of the three subsets of the COG database based on threshold number of members in each family

From: EnsembleFam: towards more accurate protein family prediction in the twilight zone

Name

Min no. of members

No. of families

No. of proteins

COG-500-1074

500

1074

1,129,428

COG-250-1796

250

1796

1,389,595

COG-100-2892

100

2892

1,565,976