Skip to main content

Table 1 Statistics of the NCBI, GM, and CDR corpora

From: Biomedical named entity recognition using deep neural networks with contextual information

CorpusEntityUnitTrainingDevelopTestTotal (Unit)
NCBIDiseaseAbstracts592100100792 (abstracts)
GMGeneSentences15000-500020000 (sentences)
CDRDisease, ChemicalsAbstracts5005005001500 (abstracts)