Skip to main content

Table 1 Annotating the CAFA set with BAR+

From: How to inherit statistically validated annotation within BAR+ protein clusters

  Cov MFO OR BPO MFO BPO CCO ALL-O Pfam ALL-O OR Pfam PDB°
Eukaryotes 90% 20,532 17,389 17,131 16,430 22,733 24,038 26,378 8,054
[32,143]^ 70% 1,448        
Prokaryotes 90% 9,660 8,915 8,202 4,723 9,843 10,772 11,088 5,924
[12,295]^ 70% 224        
Unknown 90% 36 32 32 10 36 50 50 4
[57]^ 70% 4        
Total   30,228 26,336 25,365 21,163 32,612 34,860 37,516 13,982
[44,495]^          2,047*
  1. Cov: Coverage, the ratio of the length of the intersection of the aligned regions on the two sequences and the overall length of the alignment (namely the sum of the lengths of the two sequences minus the intersection length). For both Cov values Sequence Identity (SI) is ≥ 40%. MFO: Molecular Function Ontology; BPO: Biological Process Ontology; CCO: Cellular Component Ontology. ALL-O: number of sequences with predicted MFO OR BPO OR CCO. Pfam terms. ALL-O OR Pfam: the union of ALL-O and Pfam. °PDB: sequences that inherit a structural template from a cluster HMM within BAR+ [20]. ^ CAFA/BAR+ set sequences from Eukaryotes, Prokaryotes, and Unknown organisms. *Sequences with a corresponding PDB structure.