Skip to main content
Figure 2 | BMC Bioinformatics

Figure 2

From: Quantitative biomedical annotation using medical subject heading over-representation profiles (MeSHOPs)

Figure 2

Alternative Approaches for Generating MeSHOPs depicted as Word Clouds. All MeSHOPs depict annotation of the HTT gene that is causal for Huntington Disease. (A) Raw counts. (B) Statistical enrichment scores. The top 150 terms in the profile are shown with the font size of the term is proportional to the negative log p-value for the term. Note the presence of many general terms which are implied by more specific terms, such as “Vertebrates”, “Primates”, “Chordata” and “Mammals” being present, but covered by the term “Humans”. Also, when studying a set of human genes, the terms “Humans” and “Genes” are commonly occurring and should be down-weighted accordingly. (C) Redundancy Filtered HTT Gene Biomedical Term Word Cloud. This is a word cloud where the more general terms have been filtered out from (B), leaving only the most specific terms in the profile. For example, the term “Repetitive Sequences, Nucleic Acid” seen in (B) has been filtered out due to the presence of the term “Trinucleotide Repeat Expansion”. (D) Redundancy Filtered HTT Gene Biomedical Term Word Cloud using human gene background. This is a word cloud when taking only the subset of PubMed articles related to human genes as the background, while also applying the filtering seen in (C).

Back to article page