Skip to main content

Table 1 Orthographic Feature

From: Recognition of protein/gene names from text using an ensemble of classifiers

Features 1–11

e.g.

Features 12–21

e.g.

Comma

,

OneCap

T

Dot

.

AllCaps

CSF

Parenthesis

() []

CapLowAlpha

All

RomanDigit

II

CapMixAlpha

IgM

GreekLetter

Beta

LowMixAlpha

kDa

StopWord

in, at

AlphaDigitAlpha

H2A

ATCGsequence

ACAG

AlphaDigit

T4

OneDigit

5

DigitAlphaDigit

6C2

AllDigits

60

DigitAlpha

19D

DigitCommaDigit

1,25

Others

Other

DigitDotDigit

0.5

 Â