Skip to main content

Advertisement

Table 2 BIO sentence processing example. This table illustrates a BIO (beginning-inside-outside) processing of a sentence, obtained from a drug label of “Zylelig”, an anti-cancer medicine. Every drug sectioned with a unique id (S3 in the given sentence). Every token within the sections has the property Offset which is the character count before the first character of a given token

From: Machine learning-based identification and rule-based normalization of adverse drug reactions in drug labels

Raw TextBIO encodingSectionOffsetLength
FatalB-ADRS327635
andOS327693
seriousB-SEVS327737
intestinalB-ADRS3278110
perforationI-ADRS3279211
occurredOS328048
inOS328132
Zydelig-treatedOS3281615
patientsOS328328
.OS328401