Skip to main content

Table 4 Term Weights in the SMART System.

From: Data-poor categorization and passage retrieval for Gene Ontology Annotation in Swiss-Prot

Term Frequency

First Letter

f(tf)

n (natural)

tf

l (logarithmic)

1 + log(tf)

a (augmented)

α + β × (tf/max(tf))

Inverse Document Frequency

Second Letter

f(1/df)

n(no)

1

t(full)

log(N /df)

Normalization

Third Letter

f(length)

n(no)

1

c(cosine)