Position dependent mismatch discrimination on DNA microarrays – experiments and model

Naiser, Thomas; Kayser, Jona; Mai, Timo; Michel, Wolfgang; Ott, Albrecht

doi:10.1186/1471-2105-9-509

Research article
Open access
Published: 01 December 2008

Position dependent mismatch discrimination on DNA microarrays – experiments and model

Thomas Naiser¹,
Jona Kayser¹,
Timo Mai¹,
Wolfgang Michel¹ &
…
Albrecht Ott^1,2

BMC Bioinformatics volume 9, Article number: 509 (2008) Cite this article

6842 Accesses
35 Citations
Metrics details

Abstract

Background

The propensity of oligonucleotide strands to form stable duplexes with complementary sequences is fundamental to a variety of biological and biotechnological processes as various as microRNA signalling, microarray hybridization and PCR. Yet our understanding of oligonucleotide hybridization, in particular in presence of surfaces, is rather limited. Here we use oligonucleotide microarrays made in-house by optically controlled DNA synthesis to produce probe sets comprising all possible single base mismatches and base bulges for each of 20 sequence motifs under study.

Results

We observe that mismatch discrimination is mostly determined by the defect position (relative to the duplex ends) as well as by the sequence context. We investigate the thermodynamics of the oligonucleotide duplexes on the basis of double-ended molecular zipper. Theoretical predictions of defect positional influence as well as long range sequence influence agree well with the experimental results.

Conclusion

Molecular zipping at thermodynamic equilibrium explains the binding affinity of mismatched DNA duplexes on microarrays well. The position dependent nearest neighbor model (PDNN) can be inferred from it. Quantitative understanding of microarray experiments from first principles is in reach.

Background

The well-known double-helix structure of nucleic acids results from sequence-specific binding between complementary single strands. Sequential base pairing between A·T and C·G base pairs along the two complementary strands results in the formation of stable duplexes. This so called hybridization process is fundamental to many biological processes and biotechnologies. Microarrays consist of surface-tethered probe sequences, which act as specific scavengers for their respective complementary target sequence. The molecular recognition enables a highly parallel detection of nucleic acid sequences in complex target mixtures. Hybridization also occurs with single mismatched (MM) base pairs, however, these duplexes are significantly less stable than the corresponding perfect match (PM) [1, 2]. The single base pair mismatch-discrimination capability of short (~20 nt) oligonucleotide probes provides an important diagnostic tool for the detection of point-mutations and single nucleotide polymorphisms (SNPs) [3]. DNA duplex stability arises from hydrogen bonding and base stacking interactions (the latter comprise van der Waals interactions, electrostatic and hydrophobic interactions between adjacent base pairs). According to the well-established nearest-neighbor model, thermodynamically a nucleic acid duplex can be considered the sum of these nearest-neighbor (NN) interactions [4–6]. The binding free energy of an oligonucleotide duplex can be predicted from the nearest-neighbor free energy parameters: The helix propagation parameters (one for each of the 10 possible base-pair doublets in case of a DNA/DNA duplex) account for the duplex sequence. Further parameters provide corrections for duplex initiation, A·T terminal pairs or a symmetry penalty in case of self-complementary sequences. The NN model adequately predicts oligonucleotide duplex melting temperatures T_Min bulk solution [7]. Datasets of Watson-Crick NN parameters [8] provide the basis for nucleic acid structure and melting temperature prediction software like the DINAMelt web server [9] (UNAFold), the HYTHER server and others. The NN model can be extended beyond the Watson-Crick pairs to include single base MM defects [7, 10].

In spite of good knowledge about nucleic acid hybridization in solution, the prediction of binding affinities on DNA microarrays remains empirical. Recent microarray studies [11–15] report, that the influence of even a point defect on hybridization signal intensity cannot be predicted easily. In particular the influence of defect position on the hybridization signal is stronger than the influence of MM-type [12, 14, 16].

Experiments show that the two-state nearest-neighbor (TSNN) approach [7], which has been very successful in predicting duplex stability in solution, does not appropriately describe MM binding affinities on DNA microarrays. The NN model does not account for the position of the individual NN pairs [7], except for the outermost ones. Based on microarray data, Zhang et al. [17] proposed a position dependent nearest-neighbor (PDNN) model. The model assumes that the duplex binding free energy can be expressed as a weighted sum of stacking energies with empirically derived positional weight parameters [17–21]. The purpose of this study is to investigate the influence of point defects on (surface bound) hybridization experimentally and theoretically. Previous studies investigate mismatch discrimination with samples of very different sequence motifs [11, 12]. However, other effects such as secondary structure formation or competitive binding may reduce the visibility of the impact of the MM-defect on the binding affinity. To avoid such complications we performed experiments with fixed sequence motifs: We focus on small variations of the probe sequences. We perform hybridization studies with home-made microarrays comprising sets of very similar probe sequences. We use a single target sequence in each hybridization assay in order to avoid inter-target binding as well as target competition of different sequences for one and the same probe sequence. In order to avoid excluded volume interactions or secondary structure we limit the length of the target sequence to be of the order of the probes. These simplifications (described in detail in [14]) enable a detailed investigation of the influences of defect type, defect position, flanking base pairs and the sequence motif on the binding affinity. The extensive set of hybridization affinities obtained from our experiments enables us to perform a very complete analysis. We compare the experimental data to theoretical modeling based on a double-ended molecular zipper approach (the double-ended nucleic acid zipper has been previously described by [22–26]). We find that in order to reproduce the microarray hybridization signal in our model, the heterogeneity of binding affinities – mostly owing to in situ synthesis-related probe defects (e.g. probe polydispersity) – needs to be taken into account. More than that, synthesis defects arise as useful for parallel detection of many different sequences.

Methods

DNA Microarray Hybridization Experiments

Hybridization assays are performed on high-density oligonucleotide microarrays (see Fig. 1). These microarrays (DNA Chips) are fabricated in-house [14] on the basis of light-directed solid-phase combinatorial chemistry [27, 28]. A "maskless" photolithographic technique [13, 29–31] based on a digital micromirror device type spatial light modulator (DMD™, Texas Instruments Inc.) enables tailor-made design of DNA microarrays (with up to 25000 different probe sequences) on a laboratory scale. Point defects – single base substitutions, insertions and deletions – are produced in the in situ synthesis process by variation of the nucleotide coupling scheme for the particular probe sequence.

Protocols for the preparation of dendrimer-functionalized microarray substrates (adapted from [32]) and for the light-directed synthesis (based on NPPOC-phosphoramidites [33]), as well as details on the hybridization assay and on fluorescence microscopy based microarray analysis (Fig. 1) are provided in Naiser et al. [14, 15].

In each microarray hybridization assay a probe set of cognate probes with purposefully introduced point mutations – derived from a common probe sequence motif – is hybridized against a single target sequence, which perfectly matches the probe sequence motif. We systematically vary defect type and defect position to provide the complete "defect profile" of hybridization affinities with probe sets. We include not only all single base mismatches (MMs), but also, in order to investigate mismatch discrimination in a broader context of other sequence defects, we consider single base bulges (originating from insertions and deletions) as well as probes with multiple defects. Since the ca. 130 probes within each probe set differ only by single bases we are able to distinguish between defect-positional and sequence influence. In our experimental conditions hybridization equilibrium is reached after a few tens of minutes. Further details can be found in [14].

Results

Position Dependent Influence of Single Base MMs and Bulges on Probe-Target Binding Affinity

From the fluorescence micrograph (Fig. 1), we obtain the hybridization signals, which we plot as a function of defect position (Fig. 2). We note a strong influence of the defect position on probe-target binding affinity which is larger than the influence of the defect type. We find that bulge defects display a very similar position-dependent influence on hybridization signal intensity to mismatches. Furthermore we observe that the magnitude of mismatch discrimination (and bulge discrimination) at a particular defect position (i.e. the shape of the defect profile) depends on the duplex sequence.

As can directly be inferred from Fig. 2, defects in the middle of the probes are most destabilizing. In the center of a 16 mer duplex a single nucleotide MM typically reduces the hybridization signal to 0–40% of the corresponding PM duplex hybridization signal. Defect type and nearest-neighbor effects have less influence on the hybridization signal than defect position. Our experiments show a mostly monotonous decrease of hybridization signals over a range of typically 5–8 defect positions (for 16 mer probes and up to 14 positions for some 25 mer sequence motifs) from the duplex ends towards the center of the duplex. This is consistent with previous work [11, 12].

In order to separate the defect positional influence (DPI) for a particular probe sequence motif from the defect type related influences we run a moving average filter on the defect profile. We observe that the DPI is not only a simple function of the distance between the defect and the duplex-ends, but it is also related to the nucleotide sequence (compare Fig. 2A and 2B and Fig. 3A and 3B).

We also perform hybridization experiments on oligonucleotide duplexes with two single base deletion defects at varying positions x and y. The results show that the binding affinity depends also on the relative position of the defects (for details see Additional file 1, Fig. S5 and [26]). The hybridization signal is largest if each defect is located close to an end. Lowest binding affinities are observed for defect configurations which divide the sequence into three roughly equally long subsequences. Closely spaced defects (with a distance of less than four nucleotides) systematically increase their impact with distance.

Discussion

Single base mismatches and base bulges alike show a strong, trough-shaped position-dependent influence biased by the considered sequence motif. Experimental evidence for an influence of the sequence context (beyond the nearest neighbors) on the stability of single base pair MMs has been reported previously (hybridization of short 31 bp linear oligonucleotide duplexes in bulk solution) by Benight and coworkers [34], however such effects have not yet been systematically quantified. The commonly used two-state model of nucleic acid hybridization between the microarray probe P and the target strand T resulting in the formation of the duplex D is described by Eq. 1.

P + T ⇌_{k_{d i s s}}^{k_{n u c}} D

(1)

In thermodynamic equilibrium duplex nucleation (determined by the slow nucleation rate k_nuc) is balanced by duplex dissociation with the dissociation rate k_diss. The widely used two-state nearest-neighbor model (including mismatched NN-dimers as described by [10]) cannot provide an explanation for this positional influence, it does not account for the position of the individual nearest-neighbor dimers. We assume that the nucleation rates k_nucof very similar duplexes (differing by a single base pair, e.g. a PM duplex and a corresponding mismatched duplex) are virtually identical. Thus, the positional dependence observed experimentally can be expected to result from differences in k_diss. In agreement with [25] we show that the positional influence originates from end-domain unzipping. Our experimental findings suggest a common mechanism for DPI, that is independent of the defect type. Further, the relatively long range of the DPI (Fig. 3A and 3B) suggests that molecular dynamics may well be a good candidate for an explanation. The symmetry of DPI (with respect to the duplex ends) and sequence-specific deviations from the symmetry indicate a zipping related mechanism. Thus, in order to account for partial denatured duplex states, we use a double-ended zipper model of the oligonucleotide duplex to determine mismatched oligonucleotide duplex stabilities as a function of defect position. We consider a situation in thermodynamic equilibrium.

Double-ended Zipper Model

We check if a double-ended zipper model [22–25] (Fig. 4), considering end-domain-denaturation only, is appropriate to describe the experimental observations. Internal denaturation, due to the large bubble initiation barrier (owing to stacking interactions towards both sides of a nucleotide) and due to the relatively short length of the duplexes, is expected to be negligible [22]. Using a partition function approach the impact of point defects is investigated at thermodynamic equilibrium. We perform this analytically, independently of a particular sequence, as well as numerically with sequence-dependence – using unified NN-parameters [8].

According to Craig et al. [35] a kinetic scheme describing helix growth and dissociation is given in Eq. 2.

P + T ⇌_{k_{-}}^{k_{n u c}} D_{1} ⇌_{k_{-}}^{k_{+}} D_{2} ⇌_{k_{-}}^{k_{+}} ... ⇌_{k_{-}}^{k_{+}} D_{n - 1} ⇌_{k_{-}}^{k_{+}} D_{n}

(2)

k₊ and k_- are the fast zipping and unzipping rates determined by the nearest-neighbor propagation parameters of the individual base-pair doublets. The time-evolution of the oligonucleotide zipper can be considered a biased random walk with a finite probability for complete dissociation (described by the duplex dissociation rate k_diss). Since we consider thermodynamic equilibrium, we can use a partition function for fast numerics.

Partition Function Approach (PFA) to Investigate Oligonucleotide Duplex Thermodynamics

We use a partition function approach [22–25] and investigate if the double-ended zipper model can reproduce our experimental results. On the basis of unified NN-Parameters [8] we calculate statistical weights of partially denatured duplex states. The effect of partial binding with respect to microarray data was discussed earlier in [24–26].

The partition function Z_Dof the duplex (Eq. 3) is the sum of the statistical weights w_k,lof all partially hybridized duplex states S_k,l(see Fig. 4).

Z_{D} = \sum_{k = 0}^{N - 1} \sum_{l = k + 1}^{N} w_{k, l} = \sum_{k = 0}^{N - 1} \sum_{l = k + 1}^{N} e^{Δ G_{k, l}^{\circ} / R T}

(3)

The statistical weight w_k,lof the partially denatured state S_k,lis calculated from the sum $Δ G_{k, l}^{\circ}$ of NN free energies $Δ g_{i}^{\circ}$ of the unzipped duplex sections (Eq. 4). $Δ G_{k, l}^{\circ}$ can be considered as the free energy level of the partially denatured state.

\begin{array}{l} Δ G_{k, l}^{\circ} = \sum_{i = 1}^{k} Δ g_{i}^{\circ} + \sum_{i = l + 1}^{N} Δ g_{i}^{\circ} \\ \begin{matrix} Δ G_{0, l}^{\circ} = \sum_{i = l + 1}^{N} Δ g_{i}^{\circ} & Δ G_{k, N}^{\circ} = \sum_{i = 1}^{k} Δ g_{i}^{\circ} \end{matrix} \end{array}

(4)

NN free energies of Watson-Crick NN-pairs are deduced from unified NN parameters [7].

Δ g_{i}^{\circ} = Δ h_{i}^{\circ} - T \cdot Δ s_{i}^{\circ}

(5)

For the completely dissociated duplexes we estimate partitions functions of probes Z_Pand targets Z_Tas

\begin{matrix} Z_{P} = Z_{T} = e^{Δ G_{D}^{\circ} / (2 R T)} & Δ G_{D}^{\circ} = \sum_{i = 1}^{N} Δ g_{i}^{\circ} \end{matrix}

(6)

For simplicity duplex initiation free energies have been neglected here. Based on the duplex sequence we can now calculate the duplex binding constant

K = \frac{Z_{D}}{Z_{P} Z_{T}} = \frac{Z_{D}}{e^{Δ G_{D}^{\circ} / R T}}

(7)

Consideration of Point Defects

We introduce a defect parameter $Δ g_{d e f}^{\circ}$ (a simplified description of the mismatch NN parameters in [7, 10]) to account for the point defect at the defect position x (a similar approach is described in [25]).

An analytical derivation of the DPI for homopolymer sequences shows that the partition function (provided as a function of defect position – see Eq. 8) is increased for defects located near the duplex ends.

Z_{D} (x) = Z_{D_{P M}} + (e^{\frac{(N - x) Δ g^{\circ}}{R T}} + e^{\frac{x Δ g^{\circ}}{R T}}) (e^{\frac{δ Δ g_{d e f}^{\circ}}{R T}} - 1)

(8)

In Eq. 8 the defect impact $δ Δ g_{d e f}^{\circ} = Δ g_{d e f}^{\circ} - Δ g^{\circ}$ has been factored out, revealing a general (defect-type independent) position dependence that is largely governed by the distance between the defect at position x from the duplex ends. Defects proximate to the duplex ends increase end-domain opening. The partition function is increased due to the number of thermally populated (partially denatured) duplex states. The defect destabilization $δ Δ g_{d e f}^{\circ}$ determines how far Z_Dis elevated in respect to the perfect match partition function (Z_PM≈ 1) and thus how far the DPI propagates into the interior of the duplex. With Eqs. 7 and 8 we obtain an expression for the DPI on the duplex binding constant K(x).

K = \frac{(e^{\frac{x Δ g^{\circ}}{R T}} + e^{\frac{(N - x) Δ g^{\circ}}{R T}}) (e^{\frac{δ Δ g_{d e f}^{\circ}}{R T}} - 1) + 1}{(e^{\frac{N Δ g^{\circ}}{R T}}) (e^{\frac{δ Δ g_{d e f}^{\circ}}{R T}})}

(9)

Fig. 5 illustrates Eq. 9 for two different duplex stabilities.

While defects near the duplex ends result in low mismatch discrimination only (i.e. small reduction of K with respect to the PM binding affinity) defects in the center result in higher MM discrimination as K then approaches the value of the two-state equilibrium constant. NN-pair free energy increments $δ Δ g_{d e f_{37}}^{\circ}$ for single base MMs are in the range of 1 to 3 kcal/mol per NN-pair (derived from NN parameters [8, 10]). Employing these values in Eq. 9 for Δg° = -1.4 kcal/mol (Fig. 5A), DPI propagation is restricted to 3 or 6 NN-pairs, respectively. However, in subsequences with weakly bound NN-pairs (as demonstrated in Fig. 5B) the DPI can propagate further towards the middle of the duplex.

Relation Between the Hybridization Signal and the Binding Free Energy ΔG_D

In order to compare our numerical analysis to the experimentally observed hybridization signals we need to understand how the hybridization signal (fluorescence intensity from hybridized targets) is linked to duplex stability. As detailed below the assumption of a single (homogeneous) binding affinity within a microarray feature of the Langmuir adsorption model does not describe the experimentally observed hybridization signal intensities well. In this section we account for the heterogeneity that is introduced by in situ synthesis related random mutations of the microarray probe sequences.

The importance of the adsorption model for the description of microarray hybridization has been discussed previously in [36–39]. In the simplest description the equilibrium between single stranded probes and targets and hybridized duplexes T + P ⇄ D can be described by a Langmuir-type adsorption isotherm (Eq. 10). Under our experimental conditions targets were in sufficient excess, the target concentration [T] = [T₀] can be taken as constant. Since the hybridization signal intensity is expected to be proportional to the fraction of hybridized probes θ = [D]/[P₀] we will in the following refer to θ as the hybridization signal.

θ = \frac{[D]}{[P_{0}]} = \frac{K \cdot [T_{0}]}{1 + K \cdot [T_{0}]}

(10)

Taking $K = e^{- Δ G_{D}^{\circ} / R T}$ we obtain a sigmoidal relation between the hybridization signal and duplex free energy ΔG_D.

θ = \frac{e^{- Δ G_{D}^{\circ} / R T} \cdot [T_{0}]}{1 + e^{- Δ G_{D}^{\circ} / R T} \cdot [T_{0}]}

(11)

Our experimental data suggest an approximately linear relation between the hybridization signal and the duplex binding free energies (within the free energy range covered by the defect profiles). However, with Eq. 11 an approximately linear relation between θ and ΔG_Dis only provided within a narrow range δ ΔG° ≈ 6 kcal/mol (at T = 310 K and [T₀] = 1 nM). This cannot reproduce the experimentally observed DPI of the hybridization signal, since the free energy range of the defect profile exceeds the transition region. To investigate how the fluorescence intensity of hybridized targets is related to duplex stability on the microarray surface we performed a hybridization assay comprising sets of probes in which the probe length (assumed to be roughly proportional to duplex free energy) is incrementally increased (inset in Fig. 6). The experimental results in Fig. 6 show a sigmoid relation between the hybridization signal and probe length. However the transition region extends over at least 13 base pairs ( $δ Δ G_{D 37}^{\circ}$ ≈ 20 kcal/mol) over which a monotonous increase of the hybridization signal is observed. In agreement with our findings a linear relation between microarray hybridization signals (on spotted microarrays) and duplex binding free energies $Δ G_{D}^{\circ}$ (derived from calorimetric measurements) has been reported recently by Fish et al. [40]. The large deviation from the Langmuir-equation agrees with previous observations [25, 41]. An effective isotherm with a broadened transition region, a Sips-isotherm, has been reported [25, 42, 43] to provide a better description of surface hybridization on microarrays. This isotherm can result from a heterogeneous, gaussian distribution of binding affinities. Reasons given for the heterogeneity include variation of the probe local environment, surface electrostatics [44] and entropic blockage [45]. As we show in the following a major contribution to the heterogeneity of binding affinities is probe polydispersity [25, 38, 46, 47], which is a result of sequence defects generated in the in situ synthesis process of DNA Chips, which introduces single base mismatches, base bulges and truncations.

Assuming a stepwise error rate of 10%, more than 90% of the 25 mer duplexes contain at least one synthesis error [13]. Since the number of synthesis errors per probe follows a binomial distribution, the majority of the strands contains between one and three single base defects.

We calculate binding constants K_iof the individual, randomly "mutated" probe sequences on the basis of the zipper model. Using the approach of Forman et al. [48] we obtain the total hybridization signal by summing up over the distribution of probes, where the contribution of each individual mutated probe θ_iis described by a Langmuir equation (Eq. 10) with the binding constant K_i. Probe polydispersity (in length as well as in sequence) reproduces a "stretched isotherm" [47] (similar to a Sips isotherm), with a significantly broadened transition region. This explains our experimental results in Figs. 6 and 2 well. A simulation of the transfer function θ( $Δ G_{D}^{\circ}$ ) for various error rates and a comparison between the experimental data in Fig. 6 and the corresponding simulation results are provided in Additional file 1, Fig. S6.

Numerical Analysis of Mismatched Duplex Stability-Comparison with Experimental Results

To model experimental results with the partition function approach we choose the NN free energy of the mismatched base pair $Δ g_{d e f}^{\circ}$ as a free parameter. $Δ g_{d e f}^{\circ}$ = 1 kcal/mol (at T = 325 K) describes our experimental observations (in particular the dominating positional influence with respect to defect type-related influences) best – see Fig. 3. This value is also in good agreement with bulk solution parameters [10]. Results of the numerical simulation (in Fig. 3A) demonstrate that the shallower slope of the hybridization signal at the right duplex-end corresponds to a series of weak NN pairs (as anticipated by Eq. 9). The partition function Z(x) largely determines the positional influence. Additionally, as shown in Fig. 3B, defect-type related influence (the difference between MM and PM free energies δ Δg° affects the statistical weight of the completely dissociated duplex w_D) is reflected in the hybridization affinity K(x) and in the hybridization signal θ(x). In addition to single base pair defects our binding affinity model reproduces well our experimental results on the binding affinities of oligonucleotide duplexes with two single base deletion defects (for details see Additional file 1, Fig. S5).

In Fig. 7 we investigate the influence of heterogeneous probe-target binding affinities (see previous paragraph) on the shape of the defect profile. If the range of the mismatched duplex free energies is within the transition region we observe an approximately linear relation between the hybridization intensity and the binding free energy [40]. If the defect profiles free energy range exceeds the narrow transition region (like for example Fig. 7B, at an error rate of 0 percent) the positional influence remains hardly visible.

Approximation of the Zipper Model with a Position Dependent Nearest-Neighbor (PDNN) Model

In order to investigate the generality of our finding, we investigate if PDNN models, which fit experimental data well, can be inferred from our model framework. We note that zippering has been previously proposed as the rationale behind the PDNN model in [25].

In the following we investigate the contribution of each base pair to duplex stability and ask if there is a position-dependent contribution of Watson-Crick NN pairs in the same way as for defects.

This idea is the basis of the PDNN model [17, 21, 41] in which $Δ G_{D}^{\circ}$ is obtained as a position-dependent weighted sum of nearest-neighbor free energies.

Δ G_{D}^{\circ} = \sum_{i = 1}^{N} w_{i} Δ g_{i}^{\circ}

(12)

Following our theoretical approach we create a set of 7500 oligonucleotide duplexes assembled from a given set of NN pairs. Although the TSNN (two-state nearest-neighbor) free energy of these duplexes is identical, the calculation with the zipper model indicates significant differences among the stabilities of the individual duplex sequences (see Additional file 1, Fig. S7). We investigate the positional distribution of NN pairs in the weakest/strongest 5% of the duplexes. We find that in the most stable duplexes the stronger NN-pairs are located in the center whereas in the least stable duplexes the strong NN-pairs are located near the duplex ends. This result has been reproduced with the partition function based UNAfold software (DINAMelt web server [9]) with excellent agreement to the zipper model. A similar investigation (see Fig. 8) employing a set of random duplexes composed of nonidentical NN-pairs confirms the result. In Additional file 1, Fig. S8 we show that duplex free energy values determined with the zipper model can indeed be approximated with a PDNN model. The positional weights – described by a parabolic function w_i(x) – have their maximum in the middle of the duplex. The results in Figs. 8 and Additional file 1, Fig. S8 indicate that the contribution of the outer NN-positions to duplex stability decreases with increasing temperature. At 340 K the three outermost NN pairs (which is in total six of 24 NN pairs) have a significantly reduced contribution to duplex free energy. At a still lower temperature of about 310 K the positional weights converge to w_i(x) = 1, which is equivalent to the TSNN model.

Conclusion

In this paper we studied, experimentally and theoretically, the stability of short (l < 26 bp) linear surface-bound oligonucleotide duplexes with single base defects. We demonstrated that the rationale behind positional dependent models of oligonucleotide duplex stability is the partial denaturation of the duplexes. We have shown, that the strong influence of the defect position on mismatch discrimination [11–14, 16, 49] and the influence of the sequence context – beyond nearest neighbors [14, 34] can be quantitatively inferred from a molecular zipper model. Partial (end-domain-)denaturation of the duplex as proposed by us in [26] as well as in [16, 24, 25] results in a positional influence that is entropic in nature. The zipping process is modulated by the sequential arrangement of the base pairs. The model confirms the observed influence of the sequence context beyond the nearest-neighbors. Further the zipper model provides a theoretical foundation to the positional dependent nearest-neighbor model of Zhang et al. [17].

In the commonly employed two-state nearest-neighbor model, nucleic acid duplex hybridzation/denaturation is considered to be an all-or-none process. According to literature indeed end-fraying effects are expected to be small beyond three bases [34], however, in our studied case, we conclude that end-fraying plays a non-negligible role. This is surprising since the dissociation probability of individual base pairs decreases towards the center of the duplex in an exponential fashion (see Additional file 1, Fig. S4) and remains very low for most NN-pairs.

We propose that the effect of the defect position on probe-target binding affinities becomes apparent in the hybridization signal intensities due to the unavoidable probe polydispersity of optical synthesis. It indeed appears that the positional dependence of single base MM discrimination is more commonly observed on photolithographically produced DNA oligonucleotide arrays [11–14] rather than (in large scale studies) on spotted microarrays [40, 50, 51] or in solution-phase experiments. We notice, however, that in small studies (investigating few sequences) a positional influence in solution [52] and on spotted microarrays [49] has been reported. The probe polydispersity in our experiments smoothes out the steep sigmoid relation between the hybridization intensity and binding free energy ΔG_Dthat is expected for defect free probes, and explains why (within a relatively broad range of $δ Δ G_{D 37}^{\circ}$ ≈ 20 kcal/mol) variations of the binding free energies – like for example the influence of the defect position – are reflected (by means of an approximately linear relation) in the hybridization signal intensities.

References

Nelson JW, Martin FH, Tinoco I: DNA and RNA Oligomer Thermodynamics – The Effect of Mismatched Bases on Double-Helix Stability. Biopolymers 1981, 20(12):2509–2531. 10.1002/bip.1981.360201204
Article CAS PubMed Google Scholar
Patel DJ, Kozlowski SA, Marky LA, Rice JA, Broka C, Dallas J, Itakura K, Breslauer KJ: Structure, dynamics, and energetics of deoxyguanosine-thymidine wobble base pair formation in the self-complementary d(cgtgaattcgcg) duplex in solution. Biochemistry 1982, 21(3):437–444. 10.1021/bi00532a003
Article CAS PubMed Google Scholar
Conner BJ, Reyes AA, Morin C, Itakura K, Teplitz RL, Wallace RB: Detection of sickle-cell beta-s-globin allele by hybridization with synthetic oligonucleotides. Proceedings of the National Academy of Sciences of the United States of America 1983, 80: 278–282. 10.1073/pnas.80.1.278
Article PubMed Central CAS PubMed Google Scholar
Devoe H, Tinoco I: Stability of helical polynucleotides – base contributions. Journal of Molecular Biology 1962, 4(5):500-&.
Article CAS PubMed Google Scholar
Borer PN, Dengler B, Tinoco I, Uhlenbeck OC: Stability of Ribonucleic-Acid Double-Stranded Helices. Journal of Molecular Biology 1974, 86(4):843–853. 10.1016/0022-2836(74)90357-X
Article CAS PubMed Google Scholar
Breslauer KJ, Frank R, Blocker H, Marky LA: Predicting DNA Duplex Stability from the Base Sequence. Proceedings of the National Academy of Sciences of the United States of America 1986, 83(11):3746–3750. 10.1073/pnas.83.11.3746
Article PubMed Central CAS PubMed Google Scholar
SantaLucia J, Hicks D: The thermodynamics of DNA structural motifs. Annual Review of Biophysics and Biomolecular Structure 2004, 33: 415–440. 10.1146/annurev.biophys.32.110601.141800
Article CAS PubMed Google Scholar
SantaLucia J: A unified view of polymer, dumbbell, and oligonucleotide DNA nearest-neighbor thermodynamics. Proceedings of the National Academy of Sciences of the United States of America 1998, 95(4):1460–1465. 10.1073/pnas.95.4.1460
Article PubMed Central CAS PubMed Google Scholar
Markham NR, Zuker M: DINAMelt web server for nucleic acid melting prediction. Nucleic Acids Research 2005, 33: W577-W581. 10.1093/nar/gki591
Article PubMed Central CAS PubMed Google Scholar
Allawi HT, SantaLucia J: Thermodynamics and NMR of internal GT mismatches in DNA. Biochemistry 1997, 36(34):10581–10594. 10.1021/bi962590c
Article CAS PubMed Google Scholar
Wick LM, Rouillard JM, Whittam TS, Gulari E, Tiedje JM, Hashsham SA: On-chip non-equilibrium dissociation curves and dissociation rate constants as methods to assess specificity of oligonucleotide probes. Nucleic Acids Research 2006, 34(3):e26. 10.1093/nar/gnj024
Article PubMed Central PubMed Google Scholar
Pozhitkov A, Noble PA, Domazet-Loso T, Nolte AW, Sonnenberg R, Staehler P, Beier M, Tautz D: Tests of rRNA hybridization to microarrays suggest that hybridization characteristics of oligonucleotide probes for species discrimination cannot be predicted. Nucleic Acids Research 2006, 34(9):e66. 10.1093/nar/gkl133
Article PubMed Central PubMed Google Scholar
Naiser T, Mai T, Michel W, Ott A: Versatile maskless microscope projection photolithography system and its application in light-directed fabrication of DNA microarrays. Review of Scientific Instruments 2006, 77(6):063711. 10.1063/1.2213152
Article Google Scholar
Naiser T, Kayser J, Mai T, Michel W, Ott A: Impact of point-mutations on the hybridization affinity of surface-bound DNA/DNA and RNA/DNA oligonucleotide duplexes: comparison of single base mismatches and base bulges. BMC Biotechnology 2008, 8: 48. 10.1186/1472-6750-8-48
Article PubMed Central PubMed Google Scholar
Naiser T: Characterization of Oligonucleotide Microarray Hybridization: Microarray Fabrication by Light-Directed in situ Synthesis – Development of an Automated DNA Microarray Synthesizer, Characterization of Single Base Mismatch Discrimination and the Position-Dependent Influence of Point Defects on Oligonucleotide Duplex Binding Affinities. PhD thesis, University of Bayreuth 2008. [http://opus.ub.uni-bayreuth.de/volltexte/2008/461/]
Google Scholar
Rennie C, Noyes HA, Kemp SJ, Hulme H, Brass A, Hoyle DC: Strong position-dependent effects of sequence mismatches on signal ratios measured using long oligonucleotide microarrays. BMC Genomics 2008, 9: 317. 10.1186/1471-2164-9-317
Article PubMed Central PubMed Google Scholar
Zhang L, Miles MF, Aldape KD: A model of molecular interactions on short oligonucleotide microarrays. Nature Biotechnology 2003, 21(7):818–821. 10.1038/nbt836
Article CAS PubMed Google Scholar
Naef F, Magnasco MO: Solving the riddle of the bright mismatches: Labeling and effective binding in oligonucleotide arrays. Physical Review E 2003, 68: 011906. 10.1103/PhysRevE.68.011906
Article Google Scholar
Mei R, Hubbell E, Bekiranov S, Mittmann M, Christians FC, Shen MM, Lu G, Fang J, Liu WM, Ryder T, Kaplan P, Kulp D, Webster TA: Probe selection for high-density oligonucleotide arrays. Proceedings of the National Academy of Sciences of the United States of America 2003, 100(20):11237–11242. 10.1073/pnas.1534744100
Article PubMed Central CAS PubMed Google Scholar
Binder H, Preibisch S, Kirsten T: Base pair interactions and hybridization isotherms of matched and mismatched oligonucleotide probes on microarrays. Langmuir 2005, 21(20):9287–9302. 10.1021/la051231s
Article CAS PubMed Google Scholar
Held GA, Grinstein G, Tu Y: Relationship between gene expression and observed intensities in DNA microarraysa modeling study. Nucleic Acids Research 2006, 34(9):e70. 10.1093/nar/gkl122
Article PubMed Central CAS PubMed Google Scholar
Gibbs JH, Dimarzio EA: Statistical mechanics of helix-coil transitions in biological macromolecules. Journal of Chemical Physics 1959, 30: 271–282. 10.1063/1.1729886
Article CAS Google Scholar
Kittel C: Phase transition of a molecular zipper. American Journal of Physics 1969, 37(9):917-&. 10.1119/1.1975930
Article CAS Google Scholar
Deutsch JM, Liang S, Narayan O: Modelling of microarray data with zippering. Preprint q-bio.BM/0406039 v1 2004.
Google Scholar
Binder H: Thermodynamics of competitive surface adsorption on DNA microarrays. Journal of Physics-Condensed Matter 2006, 18(18):S491-S523. 10.1088/0953-8984/18/18/S02
Article CAS Google Scholar
Naiser T, Ehler O, Mai T, Michel W, Ott A: Hybridization to surface-bound oligonucleotide probes: Influence of point defects. 2006.
Google Scholar
Fodor SPA, Read JL, Pirrung MC, Stryer AT, Land Lu, Solas D: Light-directed, spatially addressable parallel chemical synthesis. Science 1991, 251(4995):767–773. 10.1126/science.1990438
Article CAS PubMed Google Scholar
McGall GH, Barone AD, Diggelmann M, Fodor SA, Gentalen E, Ngo N: The efficiency of light-directed synthesis of DNA arrays on glass substrates. Journal of the American Chemical Society 1997, 119(22):5081–5090. 10.1021/ja964427a
Article CAS Google Scholar
Singh-Gasson S, Green RD, Yue YJ, Nelson C, Blattner F, Sussman MR, Cerrina F: Maskless fabrication of light-directed oligonucleotide microarrays using a digital micromirror array. Nature Biotechnology 1999, 17(10):974–978. 10.1038/13664
Article CAS PubMed Google Scholar
Luebke KJ, Balog RP, Mittelman D, Garner HR: Digital optical chemistry: A novel system for the rapid fabrication of custom oligonucleotide arrays. In Microfabricated Sensors, Application of Optical Technology for DNA Analysis. Volume 815. Edited by: Richard Kordal, Usmani, Wai Tak Law. American Chemical Society Publications; 2002:87–106.
Chapter Google Scholar
Nuwaysir EF, Huang W, Albert TJ, Singh J, Nuwaysir K, Pitas A, Richmond T, Gorski T, Berg JP, Ballin J, McCormick M, Norton J, Pollock T, Sumwalt T, Butcher L, Porter D, Molla M, Hall C, Blattner F, Sussman MR, Wallace RL, Cerrina F, Green RD: Gene expression analysis using oligonucleotide arrays produced by maskless photolithography. Genome Research 2002, 12(11):1749–1755. 10.1101/gr.362402
Article PubMed Central CAS PubMed Google Scholar
Le Berre V, Trevisiol E, Dagkessamanskaia A, Sokol S, Caminade AM, Majoral JP, Meunier B, Francois J: Dendrimeric coating of glass slides for sensitive DNA microarrays analysis. Nucleic Acids Research 2003, 31(16):e88. 10.1093/nar/gng088
Article PubMed Central PubMed Google Scholar
Hasan A, Stengele KP, Giegrich H, Cornwell P, Isham KR, Sachleben RA, Pfleiderer W, Foote RS: Photolabile protecting groups for nucleosides: Synthesis and photodeprotection rates. Tetrahedron 1997, 53(12):4247–4264. 10.1016/S0040-4020(97)00154-3
Article CAS Google Scholar
Hall TS, Pancoska P, Riccelli PV, Mandell K, Benight AS: Sequence context and thermodynamic stability of a single base pair mismatch in short deoxyoligonucleotide duplexes. Journal of the American Chemical Society 2001, 123(47):11811–11812. 10.1021/ja016360j
Article CAS PubMed Google Scholar
Craig ME, Crothers DM, Doty P: Relaxation kinetics of dimer formation by self complementary oligonucleotides. Journal of Molecular Biology 1971, 62(2):383-&. 10.1016/0022-2836(71)90434-7
Article CAS PubMed Google Scholar
Halperin A, Buhot A, Zhulina EB: Sensitivity, specificity, and the hybridization isotherms of DNA chips. Biophysical Journal 2004, 86(2):718–730. 10.1016/S0006-3495(04)74150-8
Article PubMed Central CAS PubMed Google Scholar
Burden CJ, Pittelkow YE, Wilson SR: Statistical Analysis of Adsorption Models for Oligonucleotide Microarrays. Statistical Applications in Genetics and Molecular Biology 2004, 3: 35. 10.2202/1544-6115.1095
Article Google Scholar
Held GA, Grinstein G, Tu Y: Modeling of DNA microarray data by using physical properties of hybridization. Proceedings of the National Academy of Sciences of the United States of America 2003, 100(13):7575–7580. 10.1073/pnas.0832500100
Article PubMed Central CAS PubMed Google Scholar
Binder H, Kirsten T, Loeffler M, Stadller PF: Sensitivity of microarray oligonucleotide probes: Variability and effect of base composition. Journal of Physical Chemistry B 2004, 108(46):18003–18014. 10.1021/jp049593g
Article CAS Google Scholar
Fish DJ, Horne MT, Brewood GP, Goodarzi JP, Alemayehu S, Bhandiwad A, Searles RP, Benight AS: DNA multiplex hybridization on microarrays and thermodynamic stability in solution: a direct comparison. Nucleic Acids Research 2007, 35(21):7197–7208. 10.1093/nar/gkm865
Article PubMed Central CAS PubMed Google Scholar
Carlon E, Heim T: Thermodynamics of RNA/DNA hybridization in high-density oligonucleotide microarrays. Physica A-Statistical Mechanics and its Applications 2006, 362(2):433–449. 10.1016/j.physa.2005.09.067
Article CAS Google Scholar
Peterson AW, Wolf LK, Georgiadis RM: Hybridization of mismatched or partially matched DNA at surfaces. Journal of the American Chemical Society 2002, 124(49):14601–14607. 10.1021/ja0279996
Article CAS PubMed Google Scholar
Glazer M, Fidanza JA, McGall GH, Trulson MO, Forman JE, Suseno A, Frank CW: Kinetics of oligonucleotide hybridization to photolithographically patterned DNA arrays. Analytical Biochemistry 2006, 358(2):225–238. 10.1016/j.ab.2006.07.042
Article CAS PubMed Google Scholar
Vainrub A, Pettitt BM: Coulomb blockage of hybridization in two-dimensional DNA arrays. Physical Review E 2002, 66(4):041905. 10.1103/PhysRevE.66.041905
Article Google Scholar
Halperin A, Buhot A, Zhulina EB: Brush effects on DNA chips: thermodynamics, kinetics, and design guidelines. Biophysical Journal 2005, 89(2):796–811. 10.1529/biophysj.105.063479
Article PubMed Central CAS PubMed Google Scholar
Jobs M, Fredriksson S, Brookes AJ, Landegren U: Effect of oligonucleotide truncation on sing le-nucleotide distinction by solid-phase hybridization. Analytical Chemistry 2002, 74: 199–202. 10.1021/ac010555s
Article CAS PubMed Google Scholar
Halperin A, Buhot A, Zhulina EB: On the hybridization isotherms of DNA microarrays: the Langmuir model and its extensions. Journal of Physics-Condensed Matter 2006, 18(18):S463-S490. 10.1088/0953-8984/18/18/S01
Article CAS Google Scholar
Forman JE, Walton ID, Stern D, Rava RP, Trulson MO: Thermodynamics of duplex formation and mismatch discrimination on photolithographically synthesized oligonucleotide arrays. Molecular Modeling of Nucleic Acids 1998, 682: 206–228.
Article CAS Google Scholar
Dorris DR, Nguyen A, Gieser L, Lockner R, Lublinsky A, Patterson M, Touma E, Sendera TJ, Elghanian R, Mazumder A: Oligodeoxyribonucleotide probe accessibility on a three-dimensional DNA microarray surface and the effect of hybridization time on the accuracy of expression ratios. BMC Biotechnology 2003, 3: 6. 10.1186/1472-6750-3-6
Article PubMed Central PubMed Google Scholar
Urakawa H, Noble PA, El Fantroussi S, Kelly JJ, Stahl DA: Single-base-pair discrimination of terminal mismatches by using oligonucleotide microarrays and neural network analyses. Applied and Environmental Microbiology 2002, 68: 235–244. 10.1128/AEM.68.1.235-244.2002
Article PubMed Central CAS PubMed Google Scholar
Weckx S, Carlon E, De Vuyst L, Van Hummelen P: Thermodynamic behavior of short oligonucleotides in microarray hybridizations can be described using Gibbs free energy in a nearest-neighbor model. Journal of Physical Chemistry B 2007, 111(48):13583–13590. 10.1021/jp075197x
Article CAS Google Scholar
Kierzek R, Burkard ME, Turner DH: Thermodynamics of single mismatches in RNA duplexes. Biochemistry 1999, 38(43):14214–14223. 10.1021/bi991186l
Article CAS PubMed Google Scholar
Zhu J, Wartell RM: The effect of base sequence on the stability of RNA and DNA single base bulges. Biochemistry 1999, 38(48):15986–15993. 10.1021/bi9916372
Article CAS PubMed Google Scholar

Download references

Acknowledgements

The authors thank Dr. Pramod Pullarkat for many helpful discussions and suggestions on this work. Our research was supported by the University of Bayreuth.

Author information

Authors and Affiliations

Experimentalphysik I, Universität Bayreuth, D-95440, Bayreuth, Germany
Thomas Naiser, Jona Kayser, Timo Mai, Wolfgang Michel & Albrecht Ott
Experimentalphysik, Universität des Saarlandes, D-66041, Saarbrücken, Germany
Albrecht Ott

Authors

Thomas Naiser
View author publications
You can also search for this author in PubMed Google Scholar
Jona Kayser
View author publications
You can also search for this author in PubMed Google Scholar
Timo Mai
View author publications
You can also search for this author in PubMed Google Scholar
Wolfgang Michel
View author publications
You can also search for this author in PubMed Google Scholar
Albrecht Ott
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Thomas Naiser.

Additional information

Authors' contributions

TN developed the experimental setup and methods, carried out the experiments and statistical data analysis, computational modeling and drafted the manuscript. TM participated in the development of the experimental setup and in data interpretation and helped to draft the manuscript. JK and WM participated in the modeling and helped to draft the manuscript. AO conceived of the study, and participated in its design and coordination and aided in drafting the manuscript.

Electronic supplementary material

12859_2008_2494_MOESM1_ESM.pdf

Additional file 1: Supplementary material. We provide additional figures illustrating our experimental data and theoretical analysis. We further provide the detailed analytical derivation of the defect positional influence on homopolymer sequences. (PDF 493 KB)

Authors’ original submitted files for images

Below are the links to the authors’ original submitted files for images.

Authors’ original file for figure 1

Authors’ original file for figure 2

Authors’ original file for figure 3

Authors’ original file for figure 4

Authors’ original file for figure 5

Authors’ original file for figure 6

Authors’ original file for figure 7

Authors’ original file for figure 8

Rights and permissions

This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Reprints and permissions

About this article

Cite this article

Naiser, T., Kayser, J., Mai, T. et al. Position dependent mismatch discrimination on DNA microarrays – experiments and model. BMC Bioinformatics 9, 509 (2008). https://doi.org/10.1186/1471-2105-9-509

Download citation

Received: 12 August 2008
Accepted: 01 December 2008
Published: 01 December 2008
DOI: https://doi.org/10.1186/1471-2105-9-509

Position dependent mismatch discrimination on DNA microarrays – experiments and model

Abstract

Background

Results

Conclusion

Background

Methods

DNA Microarray Hybridization Experiments

Results

Position Dependent Influence of Single Base MMs and Bulges on Probe-Target Binding Affinity

Discussion

Double-ended Zipper Model

Partition Function Approach (PFA) to Investigate Oligonucleotide Duplex Thermodynamics

Consideration of Point Defects

Relation Between the Hybridization Signal and the Binding Free Energy ΔG D

Numerical Analysis of Mismatched Duplex Stability-Comparison with Experimental Results

Approximation of the Zipper Model with a Position Dependent Nearest-Neighbor (PDNN) Model

Conclusion

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Additional information

Authors' contributions

Electronic supplementary material

Authors’ original submitted files for images

Rights and permissions

About this article

Cite this article

Share this article

Keywords

BMC Bioinformatics

Contact us

Relation Between the Hybridization Signal and the Binding Free Energy ΔG_D