| BC3 Dev Set | Multi-word | MeSH Term | Stemmed GRs | Feature Cut | Higher Order |
---|
 |  | UNI | BI | TRI |  |  |  |  |
Run 1 | Â | X | X | Â | X | Â | Â | Â |
Run 2 | X | X | X | Â | X | Â | Â | Â |
Run 3 | Â | X | X | X | X | X | X | Â |
Run 4 | X | X | X | X | X | X | X | Â |
Run 5 | Â | X | X | X | X | X | X | X |
- The training data used in official submissions includes all examples of previous BioCreative PPI article tasks. However, the BioCreative III development set was selectively added for training in different runs. Unigrams (UNI), bigrams (BI), and trigrams (TRI) were used as multi-word features. MeSH feature is unigrams and bigrams from MeSH terms. For grammar relations (GRs), stemming was performed on Run 3 through Run 5. Feature cut was performed based on the frequency threshold four.