Skip to main content
Fig. 4 | BMC Bioinformatics

Fig. 4

From: A novel procedure on next generation sequencing data analysis using text mining algorithm

Fig. 4

Two-way hierarchical clustering analysis of the LDA-derived strain-topic matrix. The complete-link hierarchical clustering algorithm was applied on the Euclidean distance measures of the topics in any two of the strains in the dataset. The heat map shows that the 119 strains are clustered into four groups (I to IV groups). I: Agona; II: Saintpaul, Paratyphi B, Schwarzengrund and Stanley; III: Typhimurium, Typhimurium var.5- and 4,[5],12:i:- ; IV: Heidelberg. The color histogram from blue to red shows the value of the topic weights of the strains ranged from 0 to 1

Back to article page