Skip to main content
Figure 6 | BMC Bioinformatics

Figure 6

From: Investigating selection on viruses: a statistical alignment approach

Figure 6

Model Setup. A graphic representation of our method. As input, we give the 'ancestral' sequence S1, its gene structure G, our desired partition P and our region annotation R of the partition segments. We also input the 'descendent' sequence S2, as well as our seed parameters for ( f 1 r , f 2 r , f 3 r ) R MathType@MTEF@5@5@+=feaagaart1ev2aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xH8viVGI8Gi=hEeeu0xXdbba9frFj0xb9qqpG0dXdb9aspeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGaciGaaiaabeqaaeqabiWaaaGcbaGaeiikaGIaemOzay2aa0baaSqaaiabigdaXaqaaiabdkhaYbaakiabcYcaSiabdAgaMnaaDaaaleaacqaIYaGmaeaacqWGYbGCaaGccqGGSaalcqWGMbGzdaqhaaWcbaGaeG4mamdabaGaemOCaihaaOGaeiykaKYaaSbaaSqaaiabdkfasbqabaaaaa@3C61@ , a, and b. From this we may generate both our seed emission matrix E and the type-annotation-array t = [t1, t2, t3] belonging to each locus along the sequence S1. These then get input into our alignment procedure, which subsequently over the sum of all possible alignments, calculates the expected counts C of a certain substitution of a certain type in a certain region. This information gets transferred to our maximum-likelihood (ML) method, which generates our new parameter values, maximizing the expected observations C. The resulting emission matrix E gets fed back into our alignment procedure, and the loop continues until a change in parameters is below some given threshold.

Back to article page