Figure 2From: Sifting through genomes with iterative-sequence clustering produces a large, phylogenetically diverse protein-family resourceSFam Size Distribution. The distribution of SFam size, measured as the number of sequences belonging to each SFam, is illustrated in this histogram. The x-axis represents the log SFam size while the y-axis represents the number of SFams of a given size.Back to article page