Skip to main content

Table 1 Statistics of the PharmaCoNER 2019 Corpus

From: Improving deep learning method for biomedical named entity recognition by using entity definition information

Statistic

#Training

#Development

#Test

#Background

RECORDS

500

250

250

3751

SENTENCES

8776

4028

4260

\

NORMALIZABLES

2304

1121

973

\

NO_NORMALIZABLES

24

16

10

\

PROTEINAS

1405

745

859

\

UNCLEAR

89

44

0

\