Skip to main content

Advertisement

Table 1 BioNER corpora in experiments

From: DTranNER: biomedical named entity recognition with deep learning-based label-label transition model

DatasetsNumber of SentencesEntity TypesEntity CountsMax Entity LengthAverage Entity Length
BC2GM [35]20128Gene/Protein2458326 tokens2.44 tokens
BC4CHEMD [36]87682Chemical/Drug84310137 tokens2.19 tokens
BC5CDR-Chemical [37]13935Chemical/Drug1593556 tokens1.33 tokens
BC5CDR-Disease [37]13935Disease1285219 tokens1.65 tokens
NCBI-Disease [38]7284Disease688122 tokens2.21 tokens