Skip to main content

Table 3 An example of protein location prediction. The host YGL115W has four guest proteins that share four statistically significant signatures. The host and all its guests with known location were found in cytoplasm. Thus the location of YGL208W was predicted as cytoplasm. The prediction was then confirmed with the ontology annotation in SGD database. The p-value of the occurrence is the probability that a single random subsequence of the length of the motif matches the motif.

From: Discover protein sequence signatures from protein-protein interaction data

Guest

Motif ID

P-value

Guest location

YER027C

YGL115W_1

3.17E-76

cytoplasm

YGL208W

YGL115W_1

7.48E-75

 

YDR422C

YGL115W_1

4.78E-48

cytoplasm

YER027C

YGL115W_2

3.87E-56

cytoplasm

YGL208W

YGL115W_2

8.48E-57

 

YDR422C

YGL115W_2

3.64E-37

cytoplasm

YER027C

YGL115W_3

6.83E-77

cytoplasm

YGL208W

YGL115W_3

6.37E-71

 

YDR028C

YGL115W_3

9.81E-38

cytoplasm

YER027C

YGL115W_4

5.62E-22

cytoplasm

YGL208W

YGL115W_4

7.23E-24

 

YDR477W

YGL115W_4

1.89E-14

cytoplasm