Skip to main content

Table 2 Clustering for Redundancy with the Sargasso Sea Sequences The number of sequences in each database at different redundancy levels. Sargasso comprised sections ead, eae, eaf, eag, eah, eai and eak of the Sargasso Sea resource, BactArch was a combination of bacterial and archaea sequences from the SWISS-PROT and TREMBL databases.

From: An analysis of the Sargasso Sea resource and the consequences for database composition

 

100%

90%

80%

70%

60%

50%

Sargasso

780756

509450

394592

310768

245027

188241

BactArch

761237

535059

485811

434773

379386

318309