Skip to main content
Figure 2 | BMC Bioinformatics

Figure 2

From: Gene prediction in eukaryotes with a generalized hidden Markov model that uses hints from external sources

Figure 2

Combined hints. The information retrieved from a combination of EST and protein database searches. The input DNA sequence contains one gene of which the dark boxes are the coding parts. At first, ESTs matching the DNA sequence are found and clustered. The concatenation of the segments of the input DNA sequence which are aligned to the clustered ESTs is then searched against a protein database. The protein match can be used to infer which part of the EST consensus sequence was coding. In this example the alignment of the protein started at the first position of its amino acid sequence. Thus a likely translation start site (start hint) can be inferred.

Back to article page