Skip to main content

Table 5 Inter-Annotator Agreement (IAA) over all ratings in the manual evaluation of the retrieved answers

From: A question-entailment approach to question answering

Assessors

IAA

Partial IAA

 

P (%)

F1 (%)

P (%)

F1 (%)

A vs. B

80.80

89.38

90.13

94.81

A vs. C

77.92

87.59

88.42

93.85

Average

79.36

88.48

89.27

94.33

  1. Partial IAA over two ratings “Correct” and “Incorrect”