Skip to main content

Table 1 Examples of calculation of emission probabilities

From: Detection of viral sequence fragments of HIV-1 subfamilies yet unknown

Pos.

Sub./Src.

A

B

C

D

1

2

3

 

Nucl.

G

T

G

T

G

T

G

T

G

T

G

T

G

T

1

freq

89

0

360

0

393

0

3

0

846

0

    
 

p

0.9989

0.0011

0.9997

0.0003

0.9997

0.0003

0.969

0.031

0.9999

0.0001

    

2

freq

65

24

355

5

382

11

3

0

65

24

740

19

  
 

p

0.73

0.27

0.986

0.014

0.972

0.028

0.969

0.031

0.73

0.27

0.975

0.025

  

3

freq

30

59

325

35

364

29

0

3

30

59

689

64

0

3

 

p

0.34

0.66

0.903

0.097

0.926

0.074

0.0031

0.969

0.34

0.66

0.915

0.085

0.0031

0.969

  1. Simplified example of position- and subtype-wise nucleotide frequencies of HIV. For three sites the subtype-wise nucleotide frequencies for subtypes A, B, C, and D are given on the left side of the table. Below them the emission probabilities estimated on the basis of only on the frequencies of the respective subtypes (using ) are shown. The different typefaces (regular, bold, italic) indicate which subtypes should be jointly modelled (i.e. belong to the same source). On the right-hand side of the table, the nucleotide frequencies of the sources (i.e. the aggregated frequencies of the subtypes belonging to it) and the emission probabilities estimated on the basis of these frequencies are given (using the same ). For the sake of simplicity, only the nucleotides G and T are assumed to exist. Apart from this simplification and the restriction to 4 subtypes, the example is taken from actual HIV-1 sequences.