Skip to main content
Fig. 5 | BMC Bioinformatics

Fig. 5

From: CSN: unsupervised approach for inferring biological networks based on the genome alone

Fig. 5

Exploring novel genomes using the CSN algorithm. a The Common Substring Network of Metagenomic sample MGYA00382686. This network include 1685 nodes (sequences) and 136,996 edges. b Node degree distribution. Dot plot describing the distribution of node degrees in a log-log scale for CSN’s nodes. Dashed lines represent the regression lines between degree and number of times it appears in the graph. [Spearman: rho = − 0.512, p = 3.89*10− 23; Pearson: rho = − 0.204 p = 2.05*10− 4] c Metagenomic sample SAFE analysis. 11 terms are found to be enriched in a certain region: r2 - IPR002932, r3- IPR006860, r4- IPR000209 IPR011991, r5- IPR004358, r6-IPR010930, IPR016156, r7-IPR000795, IPR010559, IPR015883, IPR035684 (see Table S15 for more details about the functions and their definitions)

Back to article page