Skip to main content

Table 1 A statistical summary of glycosylated proteins and glycosylation sites collected from 2007, 2010, 2013, and 2016 data

From: Positive-unlabelled learning of glycosylation sites in the human proteome

Year

Type

Initial dataset prior to redundancy removal

Final dataset after redundancy removal

Num. of sites

Num. of substrates

Num. of sites

Num. of substrates

2007

C-linked

36

10

36

10

N-linked

1245

537

1208

520

O-linked

321

101

320

100

2010

C-linked

38

12

38

12

N-linked

2175

908

2118

872

O-linked

345

114

344

113

2013

C-linked

43

15

43

15

N-linked

2508

1004

2442

965

O-linked

474

178

455

162

2016

C-linked

46

17

46

17

N-linked

2805

1111

2728

1066

O-linked

698

221

679

212