Skip to main content

Table 3 Performance of logistic regression classifier trained with different feature sets and tested on the BIOADI corpus and the AB3P corpus

From: BIOADI: a machine learning approach to identifying abbreviations and definitions in biological literature

Training Corpus

AB3P corpus

BIOADI corpus

Test Corpus

BIOADI corpus

AB3P corpus

Feature Set(s)

Precision

Recall

F-score

Precision

Recall

F-Score

M

0.9155

0.7242

0.8087

0.9392

0.8082

0.8688

M + L

0.9153

0.7489

0.8238

0.9401

0.8207

0.8763

M + L + N

0.9260

0.7995

0.8581

0.9556

0.8398

0.8939

M + L + N + C

0.9352

0.7995

0.8620

0.9586

0.8464

0.8990

  1. M, String morphological features; L, LF tokens; N, Numeric features; C, Contextual features