University of Turku in the BioNLP'11 Shared Task

BMC Bioinformatics

Table 3 Devel and test results for the BioNLP'11 Shared Task

The performance of our new system on the BioNLP'09 ST GENIA dataset is shown for reference, with task 3 omitted due to a changed metric. For GE-tasks, the Approximate Span & Recursive matching criterion is used. In many tasks, the development and test set results differ considerably, which may be partially explained by noise unseen due to lack of cross-validation and by the event distribution not being stratified across the sets.

ISSN: 1471-2105