Skip to main content

Table 8 Overlapping entities intra-corpus and inter-corpora

From: Investigating heterogeneous protein annotations toward cross-corpora utilization

  The number of overlapping entities
Data AIMed GENETAG GENIA-AIMed GENIA-GENETAG
AIMed 449 113 117 -
GENETAG 113 204 - 108
GENIA-AIMed 117 - 347 -
GENIA-GENETAG - 108 - 332
  1. "GENIA-AIMed" and "GENIA-GENETAG" represent the GENIA data used in Exp 2 and Exp 4 of Table 6, which are compatible with AIMed and GENETAG, respectively. The number of gene/protein mentions in each portion is 2039.