Skip to main content

Table 1 Performances over a test set of 15,000 publications

From: GOTA: GO term annotation of biomedical literature

Method a

Info b

IT c

 

CAFA c

BC c

TREC c

 
  

i P 1

i R 10

h F max

h R 10

M R R 10

R 10

GOTA

PM

0.43

0.64

0.43

0.69

0.40

0.46

GOTA

T+A

0.42

0.64

0.43

0.68

0.39

0.45

GOTA

T

0.41

0.63

0.42

0.68

0.39

0.44

RandFR

N/A

0.20

0.33

0.20

0.33

0.18

0.15

RandIC

N/A

0.21

0.27

0.18

0.31

0.03

0.08

GOTA Φ P

PM

0.37

0.64

0.41

0.67

0.38

0.44

GOTA Φ P

T+A

0.35

0.62

0.40

0.66

0.36

0.41

GOTA Φ P

T

0.35

0.62

0.40

0.66

0.36

0.41

GOTA Φ T

PM

0.28

0.41

0.30

0.49

0.16

0.17

GOTA Φ T

T+A

0.24

0.37

0.27

0.46

0.11

0.12

GOTA Φ T

T

0.22

0.35

0.26

0.44

0.09

0.10

  1. aMethod used for the classification. RandFR and RandIC are baseline predictors, based on the distribution of GO terms in the training set
  2. bInformations used in prediction: PM = title, abstract, references and publication year (PubMed); T+A = title and abstract; T = title; N/A = no information
  3. cMetrics definitions are in the “Evaluation metrics” section. In top section of the table, for each metric, the best result is highlighted in italic