Skip to main content

Table 1 Statistics for the most abundant clades. The information for all 131 abundant clades is provided in Additional file 1: Table S1

From: Clustering analysis of proteins from microbial genomes at multiple levels of resolution

Clade Taxonomic content No. annotated No. nonclonal No. protein No. protein No. conservative
Id   genomes annotated genomes coding regions sequences inclade clusters
19668 Escherichia, Shigella 2277 929 3303114 310023 3894
19507 Acinetobacter 749 280 774670 133653 3034
19252 Helicobacter pylori 309 216 254806 191419 1244
20139 Enterococcus genus 242 155 306721 33249 2106
20104 Streptococcus genus 347 139 163066 61589 1394
20137 Enterococcus genus 300 139 309061 45809 2314
19669 Salmonella, Citrobacter 638 134 478093 112833 3940
19672 Enterobacter, Escherichia, Klebsiella 350 132 593750 84168 4726
19537 Pseudomonas 229 118 622138 100992 5511
21194 Vibrio 271 118 433416 150390 4015
19400 Neisseria genus 204 109 162808 29688 1596
19988 Staphylococcus aureus 3827 108 235562 43260 2309
20122 Streptococcus agalactiae 285 103 165898 17943 1704
19671 Enterobacter Lelliottia 80 70 229896 102783 3476
20021 Bacillus 101 70 250224 101171 3919
20103 Streptococcus suis 92 69 97200 48055 1541
19543 Pseudomonas 108 68 219354 114229 3551
19270 Campylobacter jejuni 97 63 85618 29112 1444
20116 Streptococcus mutans 165 62 100740 28671 1672
19993 Staphylococcus genus 92 59 114655 23197 2014