Skip to main content

Table 1 Statistics of the datasets

From: Long short-term memory RNN for biomedical named entity recognition

 

Training

Dev

Test

GM

   

  Sentences

13500

1500

5000

  One-word Entities

7051

805

2831

  Multi-word Entities

9355

1047

3494

  Total Entities

16406

1852

6325

JNLPBA

   

  Sentences

16691

1855

3856

  One-word Entities

19476

2170

3466

  Multi-word Entities

26765

2890

5196

  Total Entities

46241

5060

8662