Skip to main content

Table 4 Overview of systems which aimed at full coverage. The most frequent sense was used as the baseline method. We represent the results of Xu et al by using MeSH codes in the second row for the sake of comparability. The results of a C4.5 decision tree using the MeSH features are present in the third row. The systems of the two last rows first apply the combined co-author graph based heuristics and when they cannot decide they use the supervised prediction of the cosine similarity metric or the decision tree.

From: The strength of co-authorship in gene name disambiguation

Method

Human

Mouse

Fly

Yeast

Baseline

59.3%–99.1%

79%

66.7%

65.5%

Xu et al [14, 15] MeSH

86.3%–94.4%

90.7%–99.4%

69.4%–99.7%

78.9%–98.4%

Decision tree

84.68%–100%

90.90%–99.84%

72.53%–99.85%

74.49%–100%

Co-author heuristics + similarity

91.87%–99.19%

98.54%–99.75%

97.20%–100%

94.15%–99.70%

Co-author heuristics + decision tree

94.35%–100%

98.85%–99.91%

96.05%–99.85%

99.63%–100%