Figure 5
From: Rule-based knowledge aggregation for large-scale protein sequence analysis of influenza A viruses

Experimental workflow of this study. The workflow has three main stages: the retrieval and merging of the source documents from public databases; the extraction of metadata by multiple structural rules; and the semantic restructuring of the sequence metadata, which identifies isolates, and subsequently re-annotates the sequences.