From: Mapping biological entities using the longest approximately common prefix method
Dataset | D 1 | D 2 | D 3 | D 4 |
---|---|---|---|---|
Jaccard | 70 | 20 | 568 | 324 |
Jaro | 105 | 25 | 3,637 | 1,102 |
Jaro-Winkler | 115 | 26 | 3,617 | 1,265 |
Levenshtein | 1,273 | 301 | 57,811 | 16,596 |
Monge-Elkan | 6,240 | 1,340 | 258,502 | 77,555 |
Needleman-Wunsch | 1,294 | 258 | 57,982 | 15,918 |
Smith-Waterman | 1,444 | 293 | 58,753 | 17,519 |
TFIDF | 132 | 37 | 928 | 558 |
Soft TFIDF | 208 | 144 | 186,937 | 11,983 |
LACP | 40 | 11 | 202 | 233 |