| Regular expression method |  | Term voting method |  |
---|---|---|---|---|
TOTAL terms | 98435 | 100% | 98435 | 100% |
Selected set | 13755 | 14% | 13755 | 14% |
Excluded set | 84680 | 86% | 84680 | 86% |
Sample of excluded | 3140 | 100% | Â | Â |
Wrong (false negative) | 49 | 1.6% | Â | Â |
Correct (true negative) | 3091 | 98.4% | Â | Â |
Proportionate number of bona fide terms in excluded set | 1321 | Â | Â | Â |
Sample of included | 2070 | 100% | 2287 | 100% |
Wrong (false positive) | 1538 | 74.3% | 1974 | 86.3% |
Correct (true positive) | 532 | 25.7% | 313 | 13.7% |
Probable number of bona fide terms in selected set | 3535 | Â | 1883 | Â |
Recall | 0.728 | Â | Â | Â |
Precision | 0.257 | Â | 0.137 | Â |