Skip to main content
Figure 2 | BMC Bioinformatics

Figure 2

From: On the necessity of dissecting sequence similarity scores into segment-specific contributions for inferring protein homology, function prediction and annotation

Figure 2

Regression analysis output (slope β ^ and coefficient of determination r2) for both SMART (version 6) and Pfam (release 27) domains. Figure A and B depict the histograms of the slopes β ^ for the original versus reconstructed scores for SMART domains calculated for HMMER2 and HMMER3 respectively while Figure C and D depict the histograms of the slopes β ^ for the Pfam domains. The HMMER2 results exhibit high reproducibility at an average β ^ of 1.000 ± 0.001 (SMART) and 1.000 ± 0.002 (Pfam) while HMMER3 results also show good, though slightly worse reproducibility with average β ^ of 1.015 ± 0.017 (SMART) and 1.017 ± 0.013 (Pfam). Figures E, F, G and H shows the corresponding histograms for the goodness of fit, in terms of r2. Similarly, the HMMER2 reconstruction exhibits excellent fit at an average r2 of 1.000 ± 0.003 (SMART) and 1.000 ± 0.007 (Pfam). HMMER3 reconstruction closely followed at an average r2 of 0.997 (SMART) and 0.998 (Pfam) over a slightly larger variation of 0.007 (SMART/Pfam). In hindsight, all values of β ^ and r2 converges to one with little variation and this implies that the reconstruction workflow for HMMER2/3 scores are highly reproducible.

Back to article page