Skip to main content

Table 1 Statistics on the source datasets and calculated EC-Pfam associations

From: ECDomainMiner: discovering hidden associations between enzyme commission numbers and Pfam domains

  Dataset EC-Pfam associations Distinct 4-digit EC numbers Distinct Pfam entries
Source SIFTS 6306 2648 2611
Datasets SwissProt 18,917 4013 3101
  TrEMBL 124,699 3751 5703
  UniRule 141,990 1020 2907
  Merged 262,571 4648 6639
Reference InterPro 1515 688 1284
ECDomainMiner With CS above threshold 8 2 5 6 3 7 0 1 3 0 2 2
Results (Overlap with InterPro) (1 4 6 1) (6 8 8) (1 2 4 5)
  Including low CS 2 0 , 7 2 8 4 4 5 5 3 6 1 3
  (Overlap with InterPro) (1 4 9 8) (6 8 8) (1 2 7 3)
  1. CS is the Confidence Score
  2. All italicized entries are calculated by ECDomainMiner
\