Table 1 Protein Sequences Data Sets

From: SEQOPTICS: a protein sequence clustering system

Data set 1 2 3 4
From Pfam (197) Pfam (268) NCBI (319) Swiss-Prot (295)
Families cytoB(75) bac_globin(51) cytoC(86) GAPDH(122)
  GABAR(54) IGA1(98) GABAR(44) casein kappa(62)
  bac_globin(51) band3(119) GAPDH(47) globin (111)
  glucokinase(17)   GFAT(78)  
  1. Note: The number in parenthesis is the number of sequences in each family