Skip to main content

Table 3 Statistics of named entities

From: Accelerating the annotation of sparse named entities by dynamic sentence selection

 

# Entities

Sentences (%)

CoNLL: LOC

7,140

5,127 (36.5%)

CoNLL: MISC

3,438

2,698 (19.2%)

CoNLL: ORG

6,321

4,587 (32.7%)

CoNLL: PER

6,600

4,373 (31.1%)

GENIA: DNA

2,017

5,251 (28.3%)

GENIA: RNA

225

810 (4.4%)

GENIA: cell_line

835

2,880 (15.5%)

GENIA: cell_type

1,104

5,212 (28.1%)

GENIA: protein

5,272

13,040 (70.3%)