Skip to main content

Table 1 Statistics on the source datasets and calculated EC-Pfam associations

From: ECDomainMiner: discovering hidden associations between enzyme commission numbers and Pfam domains

 

Dataset

EC-Pfam associations

Distinct 4-digit EC numbers

Distinct Pfam entries

Source

SIFTS

6306

2648

2611

Datasets

SwissProt

18,917

4013

3101

 

TrEMBL

124,699

3751

5703

 

UniRule

141,990

1020

2907

 

Merged

262,571

4648

6639

Reference

InterPro

1515

688

1284

ECDomainMiner

With CS above threshold

8 2 5 6

3 7 0 1

3 0 2 2

Results

(Overlap with InterPro)

(1 4 6 1)

(6 8 8)

(1 2 4 5)

 

Including low CS

2 0 , 7 2 8

4 4 5 5

3 6 1 3

 

(Overlap with InterPro)

(1 4 9 8)

(6 8 8)

(1 2 7 3)

  1. CS is the Confidence Score
  2. All italicized entries are calculated by ECDomainMiner