Skip to main content

Table 6 Performance of the Fusion Approach on Okapi 2007 and 2006

From: A robust approach to optimizing multi-source information for enhancing genomics retrieval performance

Components Okapi 2007 Okapi 2006
  document aspect passage2 document aspect
word 0.2108 0.1080 0.0364 0.3140 0.1237
sentence 0.1805 0.0970 0.0350 0.3030 0.1206
paragraph 0.1588 0.0616 0.0333 0.3109 0.1410
reciprocal 0.2219 (5.29%) 0.1237 (14.51%) 0.0478 (31.40%) 0.3168 (1.07%) 0.1449 (12.25%)
CombMNZ-with-normalization 0.1703 (-19.20%) 0.0643 (-40.43%) 0.0270 (-25.92%) 0.2352 (-26.55%) 0.0498 (-61.46%)
CombMNZ-with-assigned-weights 0.1777 (-15.72%) 0.0701 (-35.12%) 0.0273 (-24.88%) 0.2441 (-23.78%) 0.0524 (-59.43%)
CombMNZ-with-multiple 0.1730 (-17.93%) 0.0651 (-39.73%) 0.0277 (-24.01%) 0.2375 (-25.85%) 0.0508 (-60.62%)
CombSUM 0.1818 (-13.76%) 0.0718 (-33.56%) 0.0297 (-18.43%) 0.2559 (-20.10%) 0.0719 (-44.32%)
  1. We examine the proposed robust approach on the single model with Okapi BM25. First of all, the baselines are from three different indices under the same IR model, BM25, instead of those from three kind of IR models. Second, three indices are built on the 2007 and 2006 genomics data sets according to three passage extraction methods [11, 12]. Here “word” stands for “word-base”, “sentence” for “sentence-base” and “paragraph” for “paragraph-base”. Third, the Okapi tuning parameters of the selected runs are (k1, b) = (0.5, 1.3). The values in the parentheses are the relative rates of improvement over the best results of the baselines.
\