Skip to main content

Table 1 Data statistics of experimentally verified phosphorylation sites in each resource.

From: Incorporating substrate sequence motifs and spatial amino acid composition to identify kinase-specific phosphorylation sites on protein three-dimensional structures

Data set

Data Resource

Version

Number of phosphorylation sites

Number of phosphorylated proteins

   

S

T

Y

 

Training set

Phospho.ELM

9.0

26,136

6,316

3,118

8,690

 

UniProtKB

20120711

92,221

23,289

14,337

34,040

 

Combined (NR 1 )

-

98,376

25,269

15,188

35,047

Independent testing set

PhosphoSitePlus

20120730

73,969

19,946

14,696

18,550

 

PHOSIDA

1.0

7,391

1,300

278

2,212

 

SysPTM

1.1

30,307

6,643

2,255

10,667

 

HPRD

9.0

34,273

10,761

4,121

7,753

 

Combined (NR 1 )

-

97,753

27,421

16,531

23,813

  1. 1NR, non-redundant.