Skip to main content

Table 2 BIO sentence processing example. This table illustrates a BIO (beginning-inside-outside) processing of a sentence, obtained from a drug label of “Zylelig”, an anti-cancer medicine. Every drug sectioned with a unique id (S3 in the given sentence). Every token within the sections has the property Offset which is the character count before the first character of a given token

From: Machine learning-based identification and rule-based normalization of adverse drug reactions in drug labels

Raw Text

BIO encoding

Section

Offset

Length

Fatal

B-ADR

S3

2763

5

and

O

S3

2769

3

serious

B-SEV

S3

2773

7

intestinal

B-ADR

S3

2781

10

perforation

I-ADR

S3

2792

11

occurred

O

S3

2804

8

in

O

S3

2813

2

Zydelig-treated

O

S3

2816

15

patients

O

S3

2832

8

.

O

S3

2840

1