Accurate and fast methods to estimate the population mutation rate from error prone sequences

Knudsen, Bjarne; Miyamoto, Michael M

doi:10.1186/1471-2105-10-247

BMC Bioinformatics

Table 1 Definitions of the parameters and factors used in Equation (13) of the KM model

From: Accurate and fast methods to estimate the population mutation rate from error prone sequences

Parameter or factor	Description
m _s	The number of segregating sites within the current set of sampled and/or ancestral sequences
σ_i	Counts the number of "singletons" for sampled or ancestral sequence i. Here, "singleton" refers both to the derived mutations of the shared polymorphisms for the sampled sequences as well as to those of the observed singletons (in the strict sense) within the original dataset (Figure 3).
P_c(α_i(S))	Probability of α_i(S), which is the current set of sampled and/or ancestral sequences prior to a mutation in sequence i
P_c(β_ij(S))	Probability of β_ij(S), which is the current set of sampled and/or ancestral sequences after the coalescence of combinable sequences i and j (s_i~ s_j; see below)
P_c(S)	Probability of S, which is the current ordered set of n sampled and/or ancestral sequences (s₁, s₂, ..., s_n) during a particular coalescent interval in the genealogy
s_i, s_j	Sampled and/or ancestral sequences i and j (where i ≠ j)
s_i~ s_j	Signifies that the available regions of sampled and/or ancestral sequences i and j are at least compatible and that the two are therefore combinable (i.e., can coalesce)
\|s_i\|	Measures the relative degree to which sampled or ancestral sequence i is a complete or partial sequence. Thus, Σ_i\|s_i\| summarizes the total available length of all sampled and/or ancestral sequences during a particular coalescent interval.
\|S\|	Summarizes the current number of sampled and/or ancestral sequences during a particular coalescent interval in the genealogy

Back to article page

ISSN: 1471-2105

Contact us

General enquiries: journalsubmissions@springernature.com