Skip to main content
Figure 8 | BMC Bioinformatics

Figure 8

From: A discriminative method for protein remote homology detection and fold recognition combining Top-n-grams and latent semantic analysis

Figure 8

The flowchart of generating Top- n -grams. The multiple sequence alignment is obtained by PSI-BLAST. The protein sequence frequency profile is calculated from the multiple sequence alignment. The frequencies of the 20 standard amino acids in the protein sequence profile are sorted in descending order and then the sorted protein sequence frequency profile is converted to Top-n-grams by combining the n most frequent amino acids.

Back to article page