Skip to main content

Table 8 Overlapping entities intra-corpus and inter-corpora

From: Investigating heterogeneous protein annotations toward cross-corpora utilization

 

The number of overlapping entities

Data

AIMed

GENETAG

GENIA-AIMed

GENIA-GENETAG

AIMed

449

113

117

-

GENETAG

113

204

-

108

GENIA-AIMed

117

-

347

-

GENIA-GENETAG

-

108

-

332

  1. "GENIA-AIMed" and "GENIA-GENETAG" represent the GENIA data used in Exp 2 and Exp 4 of Table 6, which are compatible with AIMed and GENETAG, respectively. The number of gene/protein mentions in each portion is 2039.