Skip to main content

Table 1 Number of sites of training and independent testing set.

From: Characterization and identification of protein O-GlcNAcylation sites with substrate specificity

Data resource O-GlcNAcylated sites (Positive data) Non-O-GlcNAcylated sites (Negative data)
Training set dbOGAP Serine 240 16740
   Threonine 135 10079
   Ser and Thr 375 26819
Independent testing set UniProtKB Serine 57 4488
   Threonine 51 2978
   Ser and Thr 108 7466
  OGlycBase Serine 24 1013
   Threonine 24 694
   Ser and Thr 48 1707
  PhosphoSitePlus Serine 779 58082
   Threonine 582 34217
   Ser and Thr 1361 92299
  Non-redundant dataset Serine 578 41075
   Threonine 470 23920
   Ser and Thr 1048 64995
\