Training set statistics | Method | ||||
---|---|---|---|---|---|
 | cdf | ctf | ctf-icdf | Stemming | Synonyms |
(a) Number of positive examples | -.49 | -.50 | -.55 | -.51 | -.53 |
(b) Sum of article counts | -.39 | -.39 | -.45 | -.40 | -.43 |
(c) Maximum article count | -.20 | -.19 | -.23 | -.20 | -.26 |
(d) Mean article count | .09 | .10 | .06 | .10 | .00 |
(e) Median article count | .18 | .17 | .17 | .16 | .20 |
(f) Minimum article count | .32 | .33 | .34 | .34 | .33 |
(g) Variance of articles counts | -.01 | -.00 | -.03 | -.01 | -.09 |
(h) Skewness of article counts | -.33 | -.35 | -.36 | -.35 | -.40 |