Skip to main content

Table 2 Clustering for Redundancy with the Sargasso Sea Sequences The number of sequences in each database at different redundancy levels. Sargasso comprised sections ead, eae, eaf, eag, eah, eai and eak of the Sargasso Sea resource, BactArch was a combination of bacterial and archaea sequences from the SWISS-PROT and TREMBL databases.

From: An analysis of the Sargasso Sea resource and the consequences for database composition

  100% 90% 80% 70% 60% 50%
Sargasso 780756 509450 394592 310768 245027 188241
BactArch 761237 535059 485811 434773 379386 318309