Skip to main content

Table 5 Discovery of STPs across major domains using PDB protein sequence data and PredSTP

From: PredSTP: a highly accurate SVM based model to predict sequential cystine stabilized peptides

PDB subset

Total # of proteins analyzed

Total # of chains

Positive chains predicted by PredSTP

Number of proteins containing positive chains

Percentage of positive chains

Eukaryotes

45751

102748

636

139a

0.61

Eubacteria

31664

80664

3

2

0.003

Archaea

3127

8366

0

0

0

Viruses

4629

18642

4

3

0.02

Unassigned

479

980

10

10

1.02

  1. aFor eukaryotes, 139chains were obtained after screening 636 chains and removing those with ≥ 30 % sequence identity