From: Automatic discovery of cross-family sequence features associated with protein function
A. All keywords | ||||||
---|---|---|---|---|---|---|
 | Experiment | Control I | Control II | |||
 | mean | SE | mean | SE | mean | SE |
CC on training set | 0.265 | 0.00461 | 0.162 | 0.00226 | 0.209 | 0.00288 |
CC on testing set | 0.112 | 0.00453 | 0.00451 | 0.00294 | 0.0664 | 0.00406 |
Top 10 keywords | secreted nuclear membrane cytoplasmic DNA biosynthesis RNA integral meiosis catalyzes | |||||
B. Subcellular location keywords excluded | ||||||
 | Experiment | Control I | Control II | |||
 | mean | SE | mean | SE | mean | SE |
CC on training set | 0.231 | 0.00325 | 0.179 | 0.00240 | 0.213 | 0.00303 |
CC on testing set | 0.0603 | 0.00619 | 0.00619 | 0.00276 | 0.0402 | 0.00350 |
Top 10 keywords | inhibits biosynthesis transcription catalyzes DNA atp bacteria stimulates transcriptional gram-negative |