Skip to main content

Table 3 Classifier performance in predicting GO terms in mouse, quantified by area under the ROC curve (AUC) and precision at 20% recall (P20R).

From: Combining heterogeneous data sources for accurate functional annotation of proteins

Namespace

 

AUC

  

P20R

 
 

MF

BP

CC

MF

BP

CC

Cross-species

0.90

0.67

0.81

0.52

0.16

0.42

Species-specific

0.86

0.83

0.86

0.42

0.29

0.46

Multi-view

0.91

0.81

0.88

0.57

0.30

0.58

Chain

0.89

0.82

0.87

0.51

0.28

0.52

  1. The cross-species classifier uses only sequence data; the species-specific classifier uses a collection of genomic data--PPI, gene expression, and protein-GO term co-mention mined from the biomedical literature. The multi-view and chain classifiers are two approaches for integrating cross-species and species-specific data. The presented values are averages across all GO terms considered in a particular namespace. The results were obtained using five-fold cross-validation.