Skip to main content

Table 2 Statistics of the proteins' molecular weight

From: Representative transcript sets for evaluating a translational initiation sites predictor

  

[0, 20)

[20, 40)

[40, 60)

[60, 80)

[80, 100)

[100, 120)

[120, 140)

[140, +∞)

  

PR

LR

PR

LR

PR

LR

PR

LR

PR

LR

PR

LR

PR

LR

PR

LR

H. sap.

I

13%

0%

28%

0%

23%

0%

13%

0%

8%

0%

5%

0%

3%

0%

7%

0%

 

II

0%

100%

40%

55%

45%

39%

15%

64%

0%

100%

0%

100%

0%

100%

0%

100%

 

III

0%

100%

47%

83%

39%

83%

14%

89%

0%

100%

0%

100%

0%

100%

0%

100%

M. mus.

I

11%

0%

32%

0%

23%

0%

13%

0%

8%

0%

5%

0%

3%

0%

6%

0%

 

II

0%

100%

41%

59%

44%

38%

15%

63%

0%

100%

0%

100%

0%

100%

0%

100%

 

III

0%

100%

46%

82%

40%

79%

14%

87%

0%

100%

0%

100%

0%

100%

0%

100%

C. ele.

I

19%

0%

33%

0%

24%

0%

11%

0%

5%

0%

3%

0%

2%

0%

3%

0%

 

II

0%

100%

43%

56%

44%

39%

13%

60%

0%

100%

0%

100%

0%

100%

0%

100%

 

III

0%

100%

44%

92%

44%

89%

13%

93%

0%

100%

0%

100%

0%

100%

0%

100%

D. mel.

I

14%

0%

25%

0%

22%

0%

14%

0%

8%

0%

5%

0%

3%

0%

8%

0%

 

II

0%

100%

38%

55%

45%

39%

17%

64%

0%

100%

0%

100%

0%

100%

0%

100%

 

III

0%

100%

44%

85%

41%

84%

14%

91%

0%

100%

0%

100%

0%

100%

0%

100%

  1. This table reports the statistics of the molecular weight of the proteins corresponding to the transcript sequences included in Phases I, II, III data sets (Unit: KDa). To facilitate the analysis, the entire MW value range has been divided into several smaller ranges (e.g., 0–20 KDa, 20–40 KDa, etc). Throughout the table, two measures have been employed: PR, short for Population Ratio, is a horizontal comparison which compares the protein population within a group against other groups in the same phase; whereas LR, short for Loss Ratio, is a vertical comparison which, for a certain group, considers the portion of the protein population that have been lost through phase transitions. More specifically, given a particular aspect X under investigation (e.g. MW, pI, etc), PR calculates the portion of the number of proteins whose value on X is within a particular range (e.g, [20, 40) KDa, [8, 9) pH, etc) over that of the entire protein population in the same phase. LR computes the ratio of the number of eliminated proteins whose value on X is within a particular range Y over that of the original (Phase I) protein population within that range Y. The data that have been highlighted in italic and bold font are the outstanding numbers within the row it belongs to.