Skip to main content
Figure 1 | BMC Bioinformatics

Figure 1

From: ASPIC: a novel method to predict the exon-intron structure of a gene that is optimally compatible to a set of transcript sequences

Figure 1

The figure illustrates two gene-factorizations into 7 and 4 pseudo-exons of the genomic sequence G. Let S1, S2 and S3 be EST sequences in S agreeing to the genomic sequence G, where sequence S1 = ABDEF, S2 = ABCDE and S3 = BDEFG, each letter in {A, B, C, D, E, F, G} denotes a sequence (A). In (B) and (C) two alternative EST-genome alignments of sequences S1, S2 and S3 are represented: each EST factorization of S i associated with the EST-genome alignment is shadowed. Pseudo-exons in the gene-factorization are colored white, while introns are in grey. Segments labelled by letters represent regions of the genomic sequence that align to a substring of the input sequence of the corresponding letter. Note that an approach that aligns independently each sequence S1, S2 and S3 to G, one after the other, may produce the gene-factorization <A, B, C, D, F, E, G> consisting of 7 pseudo-exons (B), while the one minimizing the number of pseudo-exons provides only 4 pseudo-exons (C). Indeed, there are EST factorizations of each S i that are compatible or variant compatible with the gene-factorization G E = <AB, C, DE, FG>. More precisely, <AB, DE, F> is an EST-factorization of S1 that is compatible to G E . Then <AB, C, DE> is an EST-factorization of S2 compatible to G E . Finally, <B, DE, FG> is an EST-factorization of S3 compatible with G E (C).

Back to article page