Skip to main content
Fig. 4 | BMC Bioinformatics

Fig. 4

From: PhyloSophos: a high-throughput scientific name mapping algorithm augmented with explicit consideration of taxonomic science, and its application on natural product (NP) occurrence database processing

Fig. 4

Integration of species-compound pair information using standardization pipeline. A: Statistics of species-compound pairs collected from four NP occurrence databases. Dark blue: unique species-compound pairs, Gold: duplicate species-compound pairs (denote multiple support from NP occurrence databases), Grey: pairs which contain scientific names that are not mapped to NCBI taxonomy at least at species level. B: Taxonomic composition of scientific names found in species-compound pair information. Green: plant, Purple: fungi, Dark blue: metazoa (animal), Gold: bacteria, Cyan: archaea, Dark red: virus, Grey: others. C: Degree of support of species-compound pairs. Dark blue: one database (n = 737,314), Gold: two databases (n = 223,560), Grey: three databases (n = 35.046), Purple: four databases (n = 5,609)

Back to article page