LAVA: An Open-Source Approach To Designing LAMP (Loop-Mediated Isothermal Amplification) DNA Signatures
© Torres et al; licensee BioMed Central Ltd. 2011
Received: 25 January 2010
Accepted: 16 June 2011
Published: 16 June 2011
We developed an extendable open-source Loop-mediated isothermal AMPlification (LAMP) signature design program called LAVA (LAMP Assay Versatile Analysis). LAVA was created in response to limitations of existing LAMP signature programs.
LAVA identifies combinations of six primer regions for basic LAMP signatures, or combinations of eight primer regions for LAMP signatures with loop primers, which can be used as LAMP signatures. The identified primers are conserved among target organism sequences. Primer combinations are optimized based on lengths, melting temperatures, and spacing among primer sites. We compare LAMP signature candidates for Staphylococcus aureus created both by LAVA and by PrimerExplorer. We also include signatures from a sample run targeting all strains of Mycobacterium tuberculosis.
We have designed and demonstrated new software for identifying signature candidates appropriate for LAMP assays. The software is available for download at http://lava-dna.googlecode.com/.
Loop-mediated isothermal amplification (LAMP) is a DNA amplification technique with high specificity, efficiency, and speed, performed under isothermal conditions . We are using LAMP to perform highly sensitive and specific detection of blood-borne pathogens with a new point-of-care instrument that is in development, targeting pathogens such as Staphylococcus aureus, Pseudomonas aeruginosa, and Streptococcus pneumoniae. This approach has the potential to operate at significantly lower cost than TaqMan PCR detection because it can be performed with less expensive materials and equipment [2, 3].
The most common method for designing LAMP primers is to use PrimerExplorer V4 from Eiken Chemical Co. Ltd. http://primerexplorer.jp/e/index.html. While PrimerExplorer is very useful for LAMP signature design, as demonstrated by its widespread use, it has several limitations that reduced its usability for our high-throughput whole-genome analysis. First, PrimerExplorer doesn't support IUPAC characters other than "ATCG" in the input sequence, which are often used in MSA representations, because it was not built to handle MSA representations. Second, PrimerExplorer only runs in Windows operating systems, in a specific web browser. Third, PrimerExplorer cannot design signatures with loop primers, as discussed in Nagamine et al.  in a single execution, instead requiring two serial executions, which can prevent more optimal primer combinations from being identified. And fourth, PrimerExplorer is less suited for high throughput analysis since it is limited to a single execution process on a computer, accepts only up to 2,000 bp sequences, and outputs only HTML.
Find Potential LAMP Primers
MSAs are read as input with BioPerl's Bio::AlignIO module, which understands many different alignment formats. Long MSAs can sometimes be used as input, but have heavy computer resource requirements, and using long MSAs can result in fewer identified signature candidates.
LAMP signatures that cover individual non-MSA sequences 20 kbp in length can take up to 90 minutes on a desktop computer if few design constraints are specified. Identification of signatures for MSAs of this type is usually finished in minutes, because only regions of conservation are considered in this version of LAVA. MSAs can have both longer and shorter runtimes, depending on the content of the MSA. MSAs approaching 100% conservation will run similar to single sequences. As the level of conservation decreases, so will LAVA's runtime up to a point.
We suggest using the LAVA wrapper SLAVA (Serial-LAVA) for individual sequences and highly conserved MSAs over 10 kbp. SLAVA splits the MSA into sections, executes LAVA for each individual section, and combines the results into a single set of non-identical signatures. Running a series of smaller chunks through SLAVA is likely to result in more optimal signature combinations, because each sub-analysis can identify more primer candidate regions.
Default values of the most commonly adjusted LAVA parameters
Outer primer length
Middle primer length
Loop primer length
Inner primer length
Outer primer Tm
Middle primer Tm
Loop primer Tm
Inner primer Tm
Maximum signature length
Minimum spacing from middle to inner primers
Maximum consecutive repeated bases
The first sequence in the MSA is the basis for generating primers. The remaining MSA sequences are used to filter out primers that are not identically present in every target sequence. Sub-sequence with "N" or "-" characters is not considered a valid primer target. Primers that are shared among all MSA sequences are returned as potential signature components by the OligoEnumerator. During primer generation, a maximum poly-base restriction is enforced, limiting number of consecutive identical bases in a potential primer region.
Primer analysis and scoring is performed by PrimerAnalyzer modules. Penalties get applied to primers and combinations of primers in two separate places. First is as an individual primer, and second is as a combination of primers for a LAMP signature. The PrimerAnalyzer penalty for individual primers in this version of LAVA is simply the Primer3 penalty score, which reflects how closely the primer comes to the design parameters. For combinations of primers, the penalty also includes factors for for inter-primer spacing. The assessment of the primer by the PrimerAnalyzer is returned as PrimerInfo objects, which are used to provide primer sequence information for signature output. This may appear redundant because the primers already contain the original sequence information, but since different scores can exist for the same primer in different roles, and some analysis methods may impose context-sensitive sequence restrictions as part of score calculation, relying on PrimerInfo guarantees that the correct sequence is associated with the analysis result for each context.
Filter Down By Overlap And Score
To help control the number of primer combinations that need to be scored, the set of candidate primer regions is down-selected based on overlap. Many potential primers often target the same general sequence region. Of the available primers, the best scoring primers are given priority during down-selection. The lower scoring overlapping primers are removed from consideration if they overlap the higher priority primers by a given percentage.
Evaluate Primer Combinations
Finally, primers are combined into nested sets that can serve as LAMP signatures. The overall LAMP signature penalty is the weighted combination of inner, middle, and outer pair penalties, plus context-dependent spacing penalties. Inter-primer spacing penalty increases as distance increases. The default objective function includes slightly decreasing weights for the penalties of inner primers, middle primers, outer primers, and loop primers respectively.
If the minimum number of signatures is not identified for the target, the entire primer combination process is repeated with different primer overlap cutoff percentages. The amount of overlap permitted is set for each iteration, based on a "schedule" of primer overlap percentages. Since these repeat attempts are effectively multiple runs of LAVA, often with more individually considered primers, regions with difficult to identify signatures will take longer to process. Processing time can increase exponentially as the primer overlap restrictions become lighter.
Strain sequences used for LAMP signature comparison between LAVA and PrimerExplorer
GenBank GI Number
GenBank Accession Number
Staphylococcus aureus RF122, complete genome
Staphylococcus aureus subsp. aureus JH9, complete genome
Staphylococcus aureus subsp. aureus JH1, complete genome
Staphylococcus aureus subsp. aureus Mu50, complete genome
Staphylococcus aureus subsp. aureus N315, complete genome
Staphylococcus aureus subsp. aureus MW2, complete genome
Staphylococcus aureus subsp. aureus NCTC 8325, complete genome
Staphylococcus aureus subsp. aureus COL, complete genome
Staphylococcus aureus subsp. aureus MRSA252, complete genome
Staphylococcus aureus subsp. aureus MSSA476, complete genome
Staphylococcus aureus subsp. aureus USA300, complete genome
Staphylococcus aureus subsp. aureus USA300_TCH1516, complete genome
Staphylococcus aureus subsp. aureus str. Newman, complete genome
Staphylococcus aureus subsp. aureus Mu3, complete genome
LAMP signature candidate regions for S. aureus, as generated by both LAVA and PrimerExplorer. Tm calculated with BioPerl using calculations from SantaLucia(8) with 50 mg/L salt concentration and 50 ng/L oligo concentration
LAMP signature candidates for Mycobacterium tuberculosis, with gene targets based on the reference H37Rv genome [GenBank: NC_000962.2]. The hyphen in FIP and BIP sequences represents where the two segments should be linked together
Rv2735c and recX
When developing LAVA, getting usable LAMP signatures was our priority, so we have not spent time on optimizations, or on many of the features we desire. Operationally, LAVA has fulfilled our needs, but there are several components that are worthy of further attention. This discussion explores aspects of LAVA's design and operation that we believe will improve or augment LAVA's performance the most.
LAVA currently excludes dimerization checks during primer selection, which would contribute to the relative scores of signature candidates. This would have been accomplished using the UNAFold  libraries to analyze inter-oligo interactions, and primer self-annealing. This is currently omitted because in some instances, the increase in required processing time to analyze primer combinations was unacceptable. Predicting inter-oligo hybridization is still a serious concern with regards to LAMP signature design, so we separately perform dimerization analysis on the finished signature candidates, and avoid using candidates with high dimerization potential. We will include dimerization checks as part of the native signature design when possible.
There are two primary options for controlling the number of primer combinations that need to be analyzed. The first is limiting the number of primer candidates that Primer3 can identify in each execution. This can be accomplished by narrowing the acceptable range of primer design parameters such as length and melting temperature. Primer3 output can also be limited by explicitly setting a maximum number of primers for Primer3 to generate. The second option is to set stricter primer overlap limits. Stringent overlap limits may make it possible to perform the comprehensive dimerization checks discussed above within reasonable processing times.
We have observed through computer predictions including , that an optional linker sequence, used to connect the two components of the FIP and BIP (F1c and F2, and B1c and B2 respectively), has the potential to disrupt the LAMP reaction. In general, calculations based on a "TTTT" linker sequence predict a slight increase in sensitivity in many of the likely hybridization configurations, because of a slightly longer and more stable base pairing at the 5' end of the hybridization. However, in one instance, this linker greatly increased predicted primer self-hybridization because of an unfortunate co-incidence of self-similar sequence. The chance of this being a problem increases if the number of consecutive identical bases in the designed primers cannot be limited. Improper choice of linker sequence can also increase predicted primer dimerization. A more context-aware linker design should yield better results than always using a single linker sequence. One day, we hope LAVA will suggest the most appropriate linker sequence, or omission of the linker, for each designed signature, to help support desired assay behavior. One potential approach is to design linkers that are the least complimentary to the loop regions adjacent to the F1 region in the LAMP dumbbell structures as possible. Another potential approach is to design linker-free primers like Poon et al , which is currently the default behavior of LAVA.
When designing LAMP signatures, we found there is a general pattern we fell into of relaxing design parameters to identify signatures for difficult to target sequences. On average, the order which we adjusted parameters were to increase melting temperature range, decrease primer minimum length, and finally to increase signature max length.
The LAVA wrapper SLAVA was implemented as a serial execution of individual MSA segments. This is provided to enable signature design for long sequences such as whole bacterial genomes, and also to demonstrate how a parallel implementation would be structured. During the further development of LAVA, we hope to transition into a high-performance grid computing environment with a parallel LAVA implementation.
We are separately developing a OligoEnumerator for creating primers with degeneracy, which are primers with base variations designed to accommodate strain variation. So far, we have used a "masked" MSA representation of the target to design signatures. However, since Primer3 does not accept sequence containing the standardized IUPAC character codes, the MSA representation has so far been limited to perfect consensus sequence. The fundamental difference in approach for designing primers with degeneracy, is that primers are enumerated based on all the sequences of the MSA, instead of based on only the first sequence. Our proof of concept requires different internal representations of sub-sequence MSAs, but is built using the existing OligoEnumerator and PrimerAnalyzer interfaces.
We have designed and demonstrated new software for identifying signature candidates appropriate for LAMP assays. LAVA is available as open source, downloadable from the project home page. The focus of LAVA is to improve on other currently available software by accommodating high-throughput signature design, while providing a framework to develop more sophisticated algorithmic and analytical tools. We have used LAVA to design LAMP signatures for several organisms, which are currently undergoing bench screening and optimization for use in a point-of-care detection instrument.
Availability and Requirements
The project described was supported by Award Number U54EB007959 (Gerald J. Kost, PI of the UC Davis/LLNL POCT Center) from the National Institute of Biomedical Imaging and Bioengineering. The content is solely the responsibility of the authors and does not necessarily represent the official views of the National Institute of Biomedical Imaging and Bioengineering or the National Institutes of Health. This work performed under the auspices of the U.S. Department of Energy by Lawrence Livermore National Laboratory under Contract DE-AC52-07NA27344.
- Notomi T, Okayama H, Masubuchi H, Yonekawa T, Watanabe K, Amino N, Hase T: Loop-mediated isothermal amplification of DNA. Nucleic Acids Research 2000, 28: E63-e63. 28(12) 28(12) 10.1093/nar/28.12.e63PubMed CentralView ArticlePubMedGoogle Scholar
- Hara-Kudo Y, Yoshino M, Kojima T, Ikedo M: Loop-mediated isothermal amplification for the rapid detection of Salmonella . FEMS Microbiology Letters 2005, 253: 155–161. 10.1016/j.femsle.2005.09.032View ArticlePubMedGoogle Scholar
- Mori Y, Kitao M, Tomita N, Notomi T: Real-time turbidimetry of LAMP reaction for quantifying template DNA. Journal of Biochemical and Biophysical Methods 2004, 59(2):145–157. 10.1016/j.jbbm.2003.12.005View ArticlePubMedGoogle Scholar
- Nagamine K, Hase T, Notomi T: Accelerated reaction by loop-mediated isothermal amplification using loop primers. Molecular and Cellular Probes 2002, 16(3):223–229. 10.1006/mcpr.2002.0415View ArticlePubMedGoogle Scholar
- Stajich JE, Block D, Boulez K, Brenner SE, Chervitz SA, Dagdigian C, Fuellen G, Gilbert JG, Korf I, Lapp H, Lehvslaiho H, Matsalla C, Mungall CJ, Osborne BI, Pocock MR, Schattner P, Senger M, Stein LD, Stupka E, Wilkinson MD, Birney E: The Bioperl toolkit: Perl modules for the life sciences. Genome Res 2002, 2(10):1611–8.View ArticleGoogle Scholar
- Rozen S, Skaletsky H: Primer3 on the WWW for general users and for biologist programmers. Bioinformatics Methods and Protocols: Methods in Molecular Biology 2000, 365–386.Google Scholar
- Slezak T, Kuczmarski T, Ott L, Torres C, Medeiros D, Smith J, Truitt B, Mulakken N, Lam M, Vitalis E, Zemla A, Zhou CE, Gardner S: Comparative genomics tools applied to bioterrorism defense. Briefings in Bioinformatics 2003, 4(2):133–149. 10.1093/bib/4.2.133View ArticlePubMedGoogle Scholar
- SantaLucia J: A unified view of polymer, dumbbell, and oligonucleotide DNA nearest-neighbor thermodynamics. Proceedings of the National Academy of Sciences 1998, 95: 1460–1465. 10.1073/pnas.95.4.1460View ArticleGoogle Scholar
- Iwamoto T, Sonobe T, Hayashi K: Loop-Mediated Isothermal Amplification for Direct Detection of Mycobacterium Tuberculosis Complex, M. avium , and M. intracellulare in Sputum Samples. Journal of Clinical Microbiology 2003, 41(6):2616–2622. 10.1128/JCM.41.6.2616-2622.2003PubMed CentralView ArticlePubMedGoogle Scholar
- Markham NR, Zuker M: DINAMelt web server for nucleic acid melting prediction. Nucleic Acids Res 2005, 33: W577-W581. 10.1093/nar/gki591PubMed CentralView ArticlePubMedGoogle Scholar
- Poon L, Leung C, Tashiro M, Chan K, Wong B, Yuen K, Guan Y, Peiris J: Rapid Detection of the Severe Acute Respiratory Syndrome (SARS) Coronavirus by a Loop-Mediated Isothermal Amplification Assay. Clinical Chemistry 2004, 50(6):1050–1052. 10.1373/clinchem.2004.032011View ArticlePubMedGoogle Scholar
- PrimerExplorer V4 from Eiken Chemical Co. Ltd[http://primerexplorer.jp/e/index.html]
This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.