Skip to main content

Table 1 Dataset statistics

From: B-LBConA: a medical entity disambiguation model based on Bio-LinkBERT and context-aware mechanism

  

NCBI

ADR

ShARe/CLEF

Train set

5932

7038

5816

Test set

960

6343

5351

Refined test

206(21.4%)

1544(24.3%)

1487(2.8%)

NIL

Train set

0

47

1641

Test set

0

18

1750

Refined test

0

2

536

Concepts

Train set

668

1517

1034

Test set

203

1323

942

Refined test

140

857

879