Skip to main content

Table 1 Statistics of the datasets

From: Long short-term memory RNN for biomedical named entity recognition

  Training Dev Test
GM    
  Sentences 13500 1500 5000
  One-word Entities 7051 805 2831
  Multi-word Entities 9355 1047 3494
  Total Entities 16406 1852 6325
JNLPBA    
  Sentences 16691 1855 3856
  One-word Entities 19476 2170 3466
  Multi-word Entities 26765 2890 5196
  Total Entities 46241 5060 8662