Skip to main content

Table 1 Sensitivity, selectivity, specificity and correlation of PROSITE test cases and the corresponding extended patterns.

From: A structural study for the optimisation of functional motifs encoded in protein sequences

ACa

typea

Snb

Slb

Spb

Cb

AA_TRNA_LIGASE_II_1

prosite

0.648

0.802

0.9994

0.720

 

extended 1

0.648

0.908

0.9997

0.767

 

extended 2

0.719

0.813

0.9994

0.764

AA_TRNA_LIGASE_II_2

prosite

0.531

0.464

0.9977

0.494

 

extended 1

0.528

0.617

0.9988

0.569

 

extended 2

0.622

0.528

0.9979

0.572

ASP_PROTEASE

prosite

0.974

0.836

0.9996

0.902

 

extended 1

0.974

0.921

0.9998

0.947

 

extended 2

0.984

0.900

0.9998

0.941

EGF_1

prosite

0.679

0.792

0.9993

0.733

 

extended 1

0.679

0.936

0.9998

0.796

 

extended 2

0.750

0.864

0.9995

0.805

LIPOCALIN

prosite

0.667

0.461

0.9992

0.554

 

extended 1

0.667

0.864

0.9999

0.759

 

extended 2

0.771

0.871

0.9999

0.820

RRM_RNP_1

prosite

0.582

0.537

0.9985

0.558

 

extended 1

0.582

0.719

0.9993

0.646

 

extended 2

0.629

0.614

0.9988

0.620

THIOL_PROTEASE_HIS

prosite

0.785

0.459

0.9986

0.596

 

extended 1

0.785

1.000

1.0000

0.886

  1. a Pattern accession number (AC) in the PROSITE database and the type of pattern (type): PROSITE, extended 1 or extended 2b Sensitivity Sn (defined as Sn = TP/(TP + FN)), selectivity Sl (Sl = TP/(TP + FP)), specificity Sp (Sp = TN/(TN + FP))and correlation C (C = [TP × TN - FP × FN]/ [(TP + FP) × (FP + TN) × (TN + FN) × (FN + TP)]1/2) of the patterns on the SWISS-PROT release 40.8. TN is the number of true negatives, which is calculated as the total number of sequences on the SWISS-PROT release 40.8 (= 101659) less the sum of true positive, false negative and partial sequences.