Skip to main content
Fig. 5 | BMC Bioinformatics

Fig. 5

From: Machine learning for discovering missing or wrong protein function annotations

Fig. 5

Procedure used to update each Gene Ontology dataset. The sequence IDs are extracted from the 2007 dataset, and used to query new terms using UniProt. Obsolete and replaced terms are removed and merged into a single term, respectively. A hierarchy (subset of the Gene Ontology) is built using the new annotations. Finally, the old annotations are removed, and the new dataset is created by concatenating the new annotations with the feature vector and IDs

Back to article page