pyMeSHSim: an integrative python package for biomedical named entity recognition, normalization, and comparison of MeSH terms

Table 2 Performance comparing pyMeSHSim, DNorm, TaggerOne to Nelson’s manual work with similarity threshold set to 1

Method	Recall^a	Precision^b	F1^c
pyMeSHSim (with SCRs)	0.94	0.56	0.70
pyMeSHSim (no SCRs)	0.94	0.54	0.68
DNorm	0.32	0.62	0.42
TaggerOne	0.49	0.64	0.55

^a\( all=\frac{TP}{TP+ FN} \), where TP (true positive) is the number of phenotypes whose parsing results matched the manual work at determined similarity threshold. The similarity between MeSH terms identified by the two methods were measured with Lin score, and called as a TP or FP when their similarity was higher or lower than the determined threshold. FN (false negative) is the number of unrecognized phenotypes.
^b\( cision=\frac{TP}{TP+ FP} \), where FP is the number of phenotypes whose parsing results mismatched the manual work at determined similarity threshold.
^c\( 1=\frac{2\times precision\times recall}{precision+ recall} \) .

ISSN: 1471-2105