- Open Access
QuantPrime – a flexible tool for reliable high-throughput primer design for quantitative PCR
BMC Bioinformatics volume 9, Article number: 465 (2008)
Medium- to large-scale expression profiling using quantitative polymerase chain reaction (qPCR) assays are becoming increasingly important in genomics research. A major bottleneck in experiment preparation is the design of specific primer pairs, where researchers have to make several informed choices, often outside their area of expertise. Using currently available primer design tools, several interactive decisions have to be made, resulting in lengthy design processes with varying qualities of the assays.
Here we present QuantPrime, an intuitive and user-friendly, fully automated tool for primer pair design in small- to large-scale qPCR analyses. QuantPrime can be used online through the internet http://www.quantprime.de/ or on a local computer after download; it offers design and specificity checking with highly customizable parameters and is ready to use with many publicly available transcriptomes of important higher eukaryotic model organisms and plant crops (currently 295 species in total), while benefiting from exon-intron border and alternative splice variant information in available genome annotations. Experimental results with the model plant Arabidopsis thaliana, the crop Hordeum vulgare and the model green alga Chlamydomonas reinhardtii show success rates of designed primer pairs exceeding 96%.
QuantPrime constitutes a flexible, fully automated web application for reliable primer design for use in larger qPCR experiments, as proven by experimental data. The flexible framework is also open for simple use in other quantification applications, such as hydrolyzation probe design for qPCR and oligonucleotide probe design for quantitative in situ hybridization. Future suggestions made by users can be easily implemented, thus allowing QuantPrime to be developed into a broad-range platform for the design of RNA expression assays.
The use of real-time quantitative PCR (qPCR)  in medium – (hundreds of transcripts) to large-scale (thousands of transcripts) profiling experiments is growing. While in a large number of experiments qPCR is still mainly used to confirm results obtained by microarray-based hybridization experiments, the number of high-throughput discovery experiments is growing steadily [2, 3], especially for the quantification of transcripts of low abundance (e.g. those coding for transcription factors), due to the low detection limit of the method .
There are surprisingly few free software packages available to the academic research community suitable for the design of primer pairs for such high-throughput projects, for online use or download, including Osprey , Primique  and a few interfaces to Primer3  such as Primer3Plus , AutoPrime , BatchPrimer3 . Additionally, some databases of pre-computed primers, RTPrimerDB , PrimerBank , qPrimerDepot , AtRTPrimer  and DATFAP , have been established. There are numerous commercial and free software packages available for low-throughput design of primers, some of which are highly configurable and well suited for the design of primer pairs for qPCR.
However, none of the available packages combines all the important features (strict parameters for primer design, strict specificity checking and targeted design to avoid problems with contaminating genomic DNA) into a simple pipeline. Instead, with currently available computational tools, users have to either manually move information (such as identifiers, transcript sequences, primer sequences and others) between software packages or perform some steps completely on their own, such as specificity checking using an alignment package like BLAST . Such manual steps make researchers loose valuable time, increase the risk of mistakes (e.g. labeling and sequence errors), and force them to take important decisions based on their personal interpretation of complex problems regarding large amounts of data (such as BLAST alignment sets), which either require expert knowledge or introduce bias into the results. With respect to the available primer pair databases, they are usually of limited scope. Often, only few species are covered (human and mouse being clearly over-represented), few transcripts of the species are represented (especially in databases based on submitted or published primer pairs), or inappropriate primer design parameters for combined analysis were used, requiring time-consuming optimization of PCR amplification conditions.
Here we developed QuantPrime, a program for design and specificity testing of primer pairs for qPCR, designed to meet the needs of the average or advanced user in low- to high-throughput transcript profiling experiments, while keeping the user interface very simple and yet providing important features missing in other available software packages and web services.
QuantPrime includes a relational database for information storage, scripts containing the procedures to perform primer pair design and specificity testing, scripts for sequence installation and maintenance, scripts for command line user interface used in high-throughput design, and a web interface as the main user interface for low- to medium-throughput primer design. For academic users we currently offer web access to the public QuantPrime server (available at http://www.quantprime.de/) or, on demand, compiled scripts for local installation. Commercial users are requested to get in contact with the authors to develop a license agreement.
The public QuantPrime server is currently set up with publicly available transcriptome and genome annotations from 295 different eukaryotic species. Table 1 gives examples of supported species with included features and references. The list can be easily extended according to user requests.
The web interface is designed for maximum simplicity and convenience for the user. Users have to register at the first time they visit the website. The registration step allows users to return at a later time to check the results of longer runs. Their gene lists and jobs are kept confidential, i.e. no information is relayed to other users. Furthermore, registration eases the even distribution of computing resources among users and it is the main mechanism to verify academic affiliation. An account with access to limited computing resources is available for testing purposes.
The work flow starts with the generation of a 'Project' that is associated with the annotation of a species and a certain quantification protocol. The quantification protocol implies certain parameters for primer design and specificity testing; four standard protocols for typical situations are provided:
SYBR Green-based real-time qPCR (accept splice variant hits): typical parameters for real-time qPCR are used, such as 50–150 bp amplicon length, 60°C annealing temperature and strict primer criteria for G/C content and melting temperature (Tm). The specificity testing will allow amplicons present in splice variants of the transcript (more details in the 'Work flow' section).
SYBR Green real-time qPCR (no splice variant hits): as 1, but no amplicons in splice variants of the transcript are allowed.
End-point semi-quantitative PCR (accept splice variant hits): similar to 1, except that longer amplicons are preferred (350–1500 bp) for easier in-gel quantification.
End-point semi-quantitative PCR (no splice variant hits): as 3, but no amplicons in splice variants of the transcript are allowed.
Users are allowed to change any parameter and create custom protocols; see Additional file 1 for a list of all possible parameters.
Next, users should create a list of transcript identifiers in the project for which primer pair design is planned. This list can either be entered manually (using the identifiers of the chosen annotation), or can be created from a similarity-based search using BLAST and a starting query sequence. Additionally, for certain annotations, keywords describing the gene(s) can be used in a text search for identifiers.
Once the list of identifiers is ready, users may proceed to 'Primer finding' (Figure 1), which when started will continue completely in the background; in the meantime users can continue to look at resulting primer pairs or add new transcripts to the list. Larger primer finding projects may take longer time to process, therefore users may close the web browser and return at a later time to check the status of their jobs.
Successful primer pairs are displayed in the 'Results' page (Figure 2), where users can inspect primer pairs in detail (Tm, G/C content, positions within transcript sequence etc., see example in Figure 3) and do bulk export of the primer data (in delimited plain text format) for ordering or local storage.
Users may return at a later time to access their data, as lists of transcripts and primer pairs are automatically saved into their corresponding projects. On the public server, projects are kept for at least a month after the latest update, and may then be deleted by the administrator for space limitation reasons. Thus, users are recommended to export primer data and store locally for reference purposes.
QuantPrime employs a fully automated work flow for design and specificity testing of primer pairs, a process that does not require any intermediate intervention by the user. Once users have added the transcript identifiers to the project, selecting the 'Start' button will initiate the whole primer selection process, and the identified primer pairs will automatically be displayed in the 'Results' page when the process is completed.
The overall work flow of QuantPrime is sketched in Figure 4. It has two main algorithms, one for primer pair design and one for specificity testing, which are accessed by worker threads which check the output of each algorithm and decide upon the measures to be taken. The worker threads operate independent of the web server, processing submitted jobs according to defined load balancing principles (distributing computing power equally between users and projects). Due to the loosely bound system architecture it is straightforward to attach additional computing nodes to the central database allowing for high user loads. For testing purposes, a developer machine was set up to work as a computing node for the public server. With rising demand on the public server, local computing resources can be quickly mobilized to avoid long waiting times for the end users.
The primer pair design algorithm uses the Primer3 software to design primer pair candidates; a graphical representation can be found in Figure 5.
The Primer3 design parameters can be specified by the user when setting up the project; default settings are as follows:
● Primer length: 20–24 bases
● Amplicon size: 60–150 bp
● Primer melting temperatures (Tm): 64 +/- 3°C (for optimal annealing around 60°C) (using nearest neighbor thermodynamics ), maximum 2°C Tm difference between forward and reverse primers
● Amplicon melting temperature: 75–95°C
● G/C content: 45–55%
● Max. repetition of a nucleotide: 3
● G/C-clamp: last 3' base of each primer must be a G or a C
In addition to the Primer3 selection criteria, the primer pair candidates are filtered through the following steps:
● Extended G/C clamp options: to avoid mispriming, it is often appropriate to avoid too many G/C bases within the 3' region of the primer. This cannot be controlled by Primer3; therefore we introduced a parameter that allows the user to define a maximum number of G/C bases to be present in the last 3' bases. The default setting is maximum three G/C bases in the last five bases of a primer.
● Amplicon bias at 3' end of transcript: primers for amplicons at the 3' end of the transcript (the last 1000 bp) are favored. For the common user this is often wanted as cDNA preparations primed with oligo-d(T)x generally exhibit 3' region bias. For those using random hexamers for cDNA synthesis, this parameter can be switched off.
● Skip 3' UTR: in cases where multiple polyadenylation signals exist in the 3' UTR it might be desirable to avoid priming in this region, as it could lead to biased quantification. This option can be switched on for custom design protocols.
● Exon-exon junction in primers: as RNA preparations may contain some genomic DNA even after digestion with DNAse I, such primers can successfully distinguish between cDNA and genomic DNA. When possible (i.e., when a genomic sequence with one or more intron(s) is available), primers that span an exon-exon junction are favored, especially when the junction occurs at the 3' end of the primer, to further decrease the probability of extendable annealing to genomic DNA.
● Specificity pre-filtering: in order to save workload for the specificity testing algorithm, obvious unspecific primer pairs are removed at this step. This is achieved by finding transcripts that are similar to the target transcript using BLAST (blastn of transcript against the whole transcriptome with an e-value = 1) and filtering out the primer pair candidates annealing perfectly to any of those sequences. Three configurations of the filter are possible; one that forces the algorithm to find primer pairs amplifying all splice variants of the transcript (for annotations containing such information), one that forces it to find only those specific to a certain splice variant, and one that allows (but does not force) them to amplify other splice variants (default setting).
The successful primer pairs are saved to the database, and the algorithm reports the number of designed primer pairs back to the calling worker thread. If it was possible to find primer pairs, the next step is specificity testing, described below (an overview is shown in Figure 6):
The primer pair specificity determination algorithm is based on the interpretation of BLAST results (with default settings: e-value = 200, word size of 7), using each primer as a query towards the transcriptome and, when available, against the genome. To identify unspecific amplicons in a transcriptome or a genome, the following (configurable) criteria are applied to the BLAST hits:
● Last two bases of the 3' region of each primer must be identical to the BLAST hit.
● Amplicons of up to 1500 bp are considered for SYBR Green protocols, and 3500 bp for end-point protocols.
Even though the primer pairs cannot give rise to an unspecific amplicon, it is generally preferred that they should be as specific as possible to the target sequence. This is approximated by checking whether a single primer in the pair has a significant (the default setting is 75%) identity to another cDNA sequence, and where the last 3' base is identical (which can be configured).
The information from the above procedures is assembled and saved into the primer pair database. Based on this specificity information, QuantPrime labels the tested primer pairs with one out of four specificity ranks: bad, acceptable, good or very good. They are defined as follows:
Bad (shown in red in the web interface): might amplify a non-specific cDNA fragment.
Acceptable (yellow): amplifies only the specific sequence, but one primer has a high similarity with a non-target sequence and the primer pair might amplify genomic DNA.
Good (light green): amplifies only the target sequence, but one primer has a high similarity with a non-target sequence or the pair might amplify genomic DNA. This is the highest possible rank for primer pairs designed for species without a genome annotation.
Very good (dark green): amplifies only the target sequence, both primers are highly specific to this sequence and will not amplify genomic DNA.
The list of designed primers is worked through until enough (the default setting is 10) of at least acceptable (rank 2) primer pairs are found. The worker thread then decides whether it is possible to find higher-ranking primer pairs (e.g., when more primer pairs spanning exon-exon junctions can be designed); if so it continues until it is successful or until a certain primer pair threshold is reached (default setting is 500 primer pairs).
The work flow implemented on the web server only performs automated relaxation in amplicon 3' bias and exon-exon junction criteria; the Primer3 parameters are not relaxed. Thus, for certain transcripts, QuantPrime will fail to find specific primer pairs; with the default settings, we arrived at a failure rate of 2–9% (see Table 2). If the user wishes to relax the Primer3 parameters to be able to find specific primers for such problematic transcripts, a new project has to be created with different primer design parameters. Some users might find this procedure cumbersome, but we chose this design to prevent primer pairs with heterogeneous design parameters to be mixed within an assay. We are open for user suggestions to introduce certain configurable relaxations in future versions of QuantPrime.
Experimental testing of primers designed through QuantPrime
To verify the experimental usefulness of the primer pairs designed with QuantPrime, we tested it to design primers for a medium-sized expression profiling experiment for Arabidopsis thaliana (for 128 transcripts of various genes), carried through by fellow researchers in our group. The default settings for design and specificity testing (SYBR Green protocol, splice-variant hits allowed) were used and the highest ranking primer pairs were chosen. As can be seen in Table 3, we experienced a success rate of 96%, meaning unique amplicons of predicted size and amplification efficiencies (E) = 1.8 (see Methods for details). Over 88% of the primers were predicted not to amplify genomic DNA. For five out of 128 transcripts we obtained non-satisfying results. For those, good primer pairs could be obtained by testing one or two alternative primer pairs designed by QuantPrime, without having to perform any PCR optimization (results not shown).
We also designed primer pairs for 33 transcripts (cell cycle genes) from Chlamydomonas reinhardtii and tested them in the same way as above. In this case transcripts of four genes could not be detected, and as the primer pairs for these transcripts spanned exon-exon junctions, we could not test them on genomic DNA. However, only one of the primer pairs of the detectable transcripts did not pass the quality control (having multiple products seen on gel), giving a success rate of 97%. Seventy-three percent of the designed primer pairs were predicted not to amplify genomic DNA.
Additionally, primer pairs for 30 different barley (Hordeum vulgare) transcripts were tested. For two primer pairs, no product could be detected, but only one of the detectable transcripts did not pass the quality control (low amplification efficiency), yielding a success rate of 96%. As no whole-genome sequence is available for barley, no predictions for genomic amplicons could be made.
In these three experiments, we thus observed a success rate > 96%. Examples of primer pairs and PCR amplification products separated on agarose gels can be found in Additional file 2.
To assess QuantPrime's accuracy of prediction of genomic DNA amplification, 173 primer pairs from an existing qPCR platform for tonoplast-related transcripts of A. thaliana (to be published elsewhere)were tested in silico with QuantPrime and experimentally with genomic DNA in real-time PCR. QuantPrime predicted 95 of these as 'gDNA-unsafe', while in real-time PCR measurable amplification was recorded for 88 of the primer pairs (data not shown). Twelve primer pairs (6.9%) were falsely predicted as 'gDNA-unsafe', and 19 (11%) falsely as 'gDNA-safe'.
In silico benchmarking of QuantPrime
In order to assess the success rate and speed of QuantPrime for larger expression profiling projects, hypothetical high-throughput assays were designed for six different species. Five assays consisted of respectively 5000 randomly selected transcripts from current genome annotations of five species (Arabidopsis thaliana, Vitis vinifera, Drosophila melanogaster, Chlamydomonas reinhardtii and Oryza sativa ssp.japonica), while the sixth assay consisted of the whole UniGene collection of barley (Hordeum vulgare) transcripts. As seen in Table 2, the success rates (primer pairs ranked as 'acceptable' or better by specificity testing) varied between 91 and 98%, which correlates relatively well with the status and complexity of the annotations. For the higher specificity ranks rather high variation between species was observed, ranging from 76–93% for the rank 'good', and 39–61% for the rank 'very good'. Since the barley annotation lacks genomic information, 'good' is the highest possible rank. Primer pair identification speed varied between 3.6 (barley) and 60 (rice) seconds per transcript, correlating roughly with the size of the sequence sets to be searched by BLAST.
We also did preliminary tests with data sets from larger transcriptomes/genomes (human, mouse), for which the design speed dropped (data not shown). This is due to a higher memory demand of the BLAST runs that can be offered in the future, when requests for the service rise.
Our experimental results show that the primer pairs designed by QuantPrime can be directly used with a high success rate (> 96%) in qPCR applications, without a need for experimental optimization of individual reaction conditions. When running tests in parallel on a standard desktop computer, the speed is enough to design primers for high-throughput projects for small- to medium sized transcriptomes as shown by the in silico tests.
To our knowledge, there are no other web-based tools directly comparable to QuantPrime, although programs like Osprey  and Primique  offer possibilities for batch primer pair design. In those two other applications, however, the user has to supply the database against which primer pair specificity is tested, but the upload capacity is limited to 10 MB which does not suffice for most transcriptomes. QuantPrime always tests the primer pairs against the whole transcriptome of the annotation used, and additionally offers a richer user interface, exon-exon junction design of primers to avoid genomic DNA amplification, and a high degree of customization of parameters, features not available in the other software packages. Most annotations are already included in QuantPrime; in the case that users have special annotations not available on the public server, they can contact us for adding it there, or they can run QuantPrime locally. A more exhaustive comparison of QuantPrime with other available primer design software can be found in the Additional file 3.
For some species pre-computed databases of primers exist. An example is AtRTPrimer  containing primer pairs for most genes of A. thaliana. When looking at the available primers in this resource one will find that the parameters for design, especially amplicon size, make the primer pairs unsuitable for real-time PCR, and due to the differences in Tm between different primer pairs exhaustive PCR optimization would be necessary for using them in high-throughput. The authors report a success rate of 93%, however only 21 primer pairs offered by the database were experimentally validated. In comparison, QuantPrime offers complete customization of parameters for different quantification methods, and we see higher success rates (> 96% for the three species tested here, n = 191). Another example is the PrimerBank , which covers primer pairs for human and mouse transcripts, which could be useful for high-throughput purposes (due to strict design criteria), even though amplicon sizes vary. Those two databases are limited to specific species; there are a couple of databases covering more species, notably RTPrimerDB , which however cover very few non-human genes. Another database containing primer pairs for plant transcription factors is DATFAP , which however is based on EST sets, which is questionable for A. thaliana and O. sativa for which good genome annotations are available. It therefore lacks information about possible genomic sequences amplified by the primer pairs; additionally Tm values vary widely between primer pairs, which might require exhaustive PCR optimization.
The parameter flexibility for design and specificity testing offered in QuantPrime makes it straightforward to employ it for the design of oligonucleotides for a number of other quantification applications, such as qPCR with hydrolyzation probes (e.g. TaqMan probes, Scorpion primers), quantitative in situ hybridization of mRNA and others. Such protocols will be added to QuantPrime as we gather experimental data and feedback from users.
The QuantPrime website offers a unique service to the scientific community, with ease-of-use, flexibility of parameters and a broad scope of transcript databases and genomic annotations, which should make it a very useful tool for primer design. No other publicly available tool offers the same services. Overall, the speed of computation and the quality of the designed primer pairs as shown experimentally make QuantPrime (on the public web server or as standalone software) a suitable system for primer design in low- to high-throughput transcription profiling projects.
We are open for suggestions from the scientific community to further develop QuantPrime in the future. Upon request we may for example include further transcript databases and genome annotations, sets of parameters for other quantification protocols and applications, or improve the applied specificity testing algorithms. Institutions wanting to host mirrors of the QuantPrime public web server or supply additional computing power are encouraged to contact the authors.
Standard molecular techniques were performed as described . Oligonucleotides were obtained from MWG (Ebersberg, Germany). Unless otherwise indicated, other chemicals were purchased from Roche (Mannheim, Germany), Merck (Darmstadt, Germany), or Sigma (Deisenhofen, Germany).
Arabidopsis thaliana (L.) Heynh accession Col-0 plants were grown in growth chambers with an 8-h day length provided by fluorescent light at 120 μmol m-2 s-1 (50% intensity during the first and last 30 minutes of the light period) and a day/night temperature of 20/16°C and relative humidity of 60/75%. Whole, young plants (four weeks after germination) including washed roots were harvested 2 hours after lights-on, snap-frozen in liquid nitrogen and stored at -70°C until RNA extraction. Chlamydomonas reinhardtii CC503 cw92 mt+ was grown under continuous light (100 μmol m-2 s-1) at 21°C in HEPES-based medium as described . Hordeum vulgare (Karat variety) plants were grown as previously described , and parts of roots from seven days-old seedlings were used for total RNA extraction.
RNA extraction and cDNA synthesis
After grinding of plant/algal material in liquid nitrogen, total RNA was isolated with Trizol reagent (Invitrogen, Karlsruhe, Germany) or RNeasy Plant Mini Kit (Qiagen, Hilden, Germany) following the manufacturers' specifications. RNA quality was determined spectrometrically (A260/A280 > 1.8) using a NanoDrop ND-1000 spectrometer (NanoDrop, Detroit, USA) and by visual inspection of separated bands on agarose gels.
After isolation, genomic DNA was digested using Turbo DNA-free recombinant DNAse I (Applied Biosystems Applera, Darmstadt, Germany) following the manufacturer's specifications. The level of remaining genomic DNA contamination was measured by diluting the samples to the same concentration as the final cDNA samples (10 ng μl-1) and performing real-time PCR using primers for a genomic sequence (UBQ10: Fw 5'-GGCCTTGTATAATCCCTGATGAATAAG-3', Rev 5'-AAAGAGATAACAGGAACGGAAACATAGT-3'). Samples with consistent cycle threshold (Ct) values below 35 were re-treated with DNAse I or new RNA extractions were performed.
Two μg of total RNA was used in 20-μl reactions for cDNA synthesis, using RevertAid R-minus cDNA synthesis kit (Fermentas, St. Leon-Rot, Germany), following the manufacturer's specifications. The cDNA was then diluted 1:10 in order to reduce the effect of RNA isolation and cDNA synthesis buffer on the subsequent PCRs.
Real-time quantitative PCR
qPCR was carried out in technical triplicates or quadruplicates using 0.5 or 1 μl of diluted cDNA in 5- or 10-μl reactions, 2 or 4 μl of 500 nM primer pairs and 2.5 or 5 μl of 2× Power SYBR Green PCR Master Mix (Applied Biosystems). The following PCR protocol was used on Applied Biosystems 7300 (96-well plates) and 7900HT (384-well plates) real-time PCR systems: 10 min at 95°C, 15 sec at 95°C, and 1 min at 60°C repeated in 50 cycles, followed by melting curve analysis. When testing primer pairs, the PCR products were then separated on a 2% agarose gel and visualized with ethidium bromide, using 50 bp DNA ladder (Invitrogen) for size determination.
Cycle threshold (Ct) values for each reaction were calculated using Applied Biosystems SDS software, with baseline set to cycle 3–15 and threshold to 0.2 Rn, recorded from the SYBR Green I dye signal normalized against the ROX dye signal.
Real-time PCR amplification efficiencies were calculated using the LinRegPCR tool , using the best-fit method for 4 to 6 points. This tool uses linear regression on log-values of normalized fluorescence data from individual reactions to calculate E in the equation for PCR kinetics, NC = N0 * EC, which states that the amount of product after C cycles (NC) is equal to the starting concentration (N0) times the efficiency (E) to the power C; 100% efficiency would give an efficiency value of 2.
Efficiency values from fitted curves with R-squared values below 0.999 were considered as unreliable; Ct values and efficiencies from such reactions were removed from further calculations. Medians of Ct values and efficiencies were calculated and used in further calculations.
Public server setup
The web-based QuantPrime program runs on a Linux-based server, with two Intel 1.6 GHz QuadCore 64-bit processors and 4 GB of RAM, configured to run up to six design/testing threads in parallel, always leaving two virtual processors available for database and web handling. This was found to be the most efficient configuration for this single server; setting up the program and database in a clustered environment with specialized data and computation nodes should lead to synergistic speed improvements, as the amount of data transferred between database and executing threads are kept very low.
In silico benchmarking
For the random selection of transcripts from annotations, the built-in random function in MySQL was used to order all transcripts from the respective annotation having a transcript length of more than 300 bp, of which the top 5000 were selected.
The run times given are real time (not CPU time), meaning the difference of the time point when the experiment started and when it finished. The average time per transcript is the total time divided by the number of transcripts. Due to the parallel nature of the program, the typical time to design one specific primer pair for a transcript is longer.
Availability and requirements
Project name: QuantPrime
Project home page: http://www.quantprime.de/
Operating systems: Platform independent
Programming languages: Python and PHP (web interface)
Any restrictions to use by non-academics: License needed
Higuchi R, Fockler C, Dollinger G, Watson R: Kinetic PCR analysis: real-time monitoring of DNA amplification reactions. Biotechnology (NY) 1993, 11: 1026–1030. 10.1038/nbt0993-1026
Czechowski T, Bari RP, Stitt M, Scheible W, Udvardi MK: Real-time RT-PCR profiling of over 1400 Arabidopsis transcription factors: unprecedented sensitivity reveals novel root- and shoot-specific genes. Plant J 2004, 38: 366–379. 10.1111/j.1365-313X.2004.02051.x
Caldana C, Scheible W, Mueller-Roeber B, Ruzicic S: A quantitative RT-PCR platform for high-throughput expression profiling of 2500 rice transcription factors. Plant Methods 2007, 3: 7. 10.1186/1746-4811-3-7
Horak CE, Snyder M: Global analysis of gene expression in yeast. Funct Integr Genomics 2002, 2: 171–180. 10.1007/s10142-002-0065-3
Gordon PMK, Sensen CW: Osprey: a comprehensive tool employing novel methods for the design of oligonucleotides for DNA sequencing and microarrays. Nucl Acids Res 2004, 32: e133. 10.1093/nar/gnh127
Fredslund J, Lange M: Primique: automatic design of specific PCR primers for each sequence in a family. BMC Bioinformatics 2007, 8: 369. 10.1186/1471-2105-8-369
Rozen S, Skaletsky H: Primer3 on the WWW for general users and for biologist programmers. Methods Mol Biol 2000, 132: 365–386.
Untergasser A, Nijveen H, Rao X, Bisseling T, Geurts R, Leunissen JAM: Primer3Plus, an enhanced web interface to Primer3. Nucl Acids Res 2007, 35: W71–4. 10.1093/nar/gkm306
Wrobel G, Kokocinski F, Lichter P: AutoPrime: selecting primers for expressed sequences. Genome Biology 2004, 5: P11. 10.1186/gb-2004-5-5-p11
You FM, Huo N, Gu YQ, Luo M, Ma Y, Hane D, Lazo GR, Dvorak J, Anderson OD: BatchPrimer3: a high throughput web application for PCR and sequencing primer design. BMC Bioinformatics 2008, 9: 253. 10.1186/1471-2105-9-253
Pattyn F, Robbrecht P, De Paepe A, Speleman F, Vandesompele J: RTPrimerDB: the real-time PCR primer and probe database, major update 2006. Nucl Acids Res 2006, 34: D684–8. 10.1093/nar/gkj155
Wang X, Seed B: A PCR primer bank for quantitative gene expression analysis. Nucl Acids Res 2003, 31: e154. 10.1093/nar/gng154
Cui W, Taub DD, Gardner K: qPrimerDepot: a primer database for quantitative real time PCR. Nucl Acids Res 2007, 35: D805–9. 10.1093/nar/gkl767
Han S, Kim D: AtRTPrimer: database for Arabidopsis genome-wide homogeneous and specific RT-PCR primer-pairs. BMC Bioinformatics 2006, 7: 179. 10.1186/1471-2105-7-179
Fredslund J: DATFAP: a database of primers and homology alignments for transcription factors from 13 plant species. BMC Genomics 2008, 9: 140. 10.1186/1471-2164-9-140
Altschul SF, Madden TL, Schäffer AA, Zhang J, Zhang Z, Miller W, Lipman DJ: Gapped BLAST and PSI-BLAST: a new generation of protein database search programs. Nucl Acids Res 1997, 25: 3389–3402. 10.1093/nar/25.17.3389
SantaLucia JJ: A unified view of polymer, dumbbell, and oligonucleotide DNA nearest-neighbor thermodynamics. Proc Natl Acad Sci USA 1998, 95: 1460–1465. 10.1073/pnas.95.4.1460
Sambrook J, Sambrook J, Maniatis T: Molecular Cloning: A Laboratory Manual. Cold Spring Harbor Laboratory Press; 2001.
May P, Wienkoop S, Kempa S, Usadel B, Christian N, Rupprecht J, Weiss J, Recuenco-Munoz L, Ebenhoh O, Weckwerth W, Walther D: Metabolomics- and Proteomics-Assisted Genome Annotation and Analysis of the Draft Metabolic Network of Chlamydomonas reinhardtii. Genetics 2008, 179: 157–166. 10.1534/genetics.108.088336
Kwasniewski M, Szarejko I: Molecular Cloning and Characterization of beta-Expansin Gene Related to Root Hair Formation in Barley. Plant Physiol 2006, 141: 1149–1158. 10.1104/pp.106.078626
Ramakers C, Ruijter JM, Deprez RHL, Moorman AFM: Assumption-free analysis of quantitative real-time polymerase chain reaction (PCR) data. Neurosci Lett 2003, 339: 62–66. 10.1016/S0304-3940(02)01423-4
Childs KL, Hamilton JP, Zhu W, Ly E, Cheung F, Wu H, Rabinowicz PD, Town CD, Buell CR, Chan AP: The TIGR Plant Transcript Assemblies database. Nucl Acids Res 2007, 35: D846–51. 10.1093/nar/gkl785
Wheeler DL, Church DM, Federhen S, Lash AE, Madden TL, Pontius JU, Schuler GD, Schriml LM, Sequeira E, Tatusova TA, Wagner L: Database resources of the National Center for Biotechnology. Nucl Acids Res 2003, 31: 28–33. 10.1093/nar/gkg033
Swarbreck D, Wilks C, Lamesch P, Berardini TZ, Garcia-Hernandez M, Foerster H, Li D, Meyer T, Muller R, Ploetz L, Radenbaugh A, Singh S, Swing V, Tissier C, Zhang P, Huala E: The Arabidopsis Information Resource (TAIR): gene structure and function annotation. Nucl Acids Res 2008, 36: D1009–14. 10.1093/nar/gkm965
Pruitt KD, Tatusova T, Maglott DR: NCBI reference sequences (RefSeq): a curated non-redundant sequence database of genomes, transcripts and proteins. Nucl Acids Res 2007, 35: D61–5. 10.1093/nar/gkl842
Merchant SS, Prochnik SE, Vallon O, Harris EH, Karpowicz SJ, Witman GB, Terry A, Salamov A, Fritz-Laylin LK, Maréchal-Drouard L, Marshall WF, Qu L, Nelson DR, Sanderfoot AA, Spalding MH, Kapitonov VV, Ren Q, Ferris P, Lindquist E, Shapiro H, Lucas SM, Grimwood J, Schmutz J, Cardol P, Cerutti H, Chanfreau G, Chen C, Cognat V, Croft MT, Dent R, Dutcher S, Fernández E, Fukuzawa H, González-Ballester D, González-Halphen D, Hallmann A, Hanikenne M, Hippler M, Inwood W, Jabbari K, Kalanon M, Kuras R, Lefebvre PA, Lemaire SD, Lobanov AV, Lohr M, Manuell A, Meier I, Mets L, Mittag M, Mittelmeier T, Moroney JV, Moseley J, Napoli C, Nedelcu AM, Niyogi K, Novoselov SV, Paulsen IT, Pazour G, Purton S, Ral J, Riaño-Pachón DM, Riekhof W, Rymarquis L, Schroda M, Stern D, Umen J, Willows R, Wilson N, Zimmer SL, Allmer J, Balk J, Bisova K, Chen C, Elias M, Gendler K, Hauser C, Lamb MR, Ledford H, Long JC, Minagawa J, Page MD, Pan J, Pootakham W, Roje S, Rose A, Stahlberg E, Terauchi AM, Yang P, Ball S, Bowler C, Dieckmann CL, Gladyshev VN, Green P, Jorgensen R, Mayfield S, Mueller-Roeber B, Rajamani S, Sayre RT, Brokstein P, Dubchak I, Goodstein D, Hornick L, Huang YW, Jhaveri J, Luo Y, Martínez D, Ngau WCA, Otillar B, Poliakov A, Porter A, Szajkowski L, Werner G, Zhou K, Grigoriev IV, Rokhsar DS, Grossman AR: The Chlamydomonas genome reveals the evolution of key animal and plant functions. Science 2007, 318: 245–250. 10.1126/science.1143609
FluBase: The FlyBase database of the Drosophila genome projects and community literature. Nucl Acids Res 2003, 31: 172–175. 10.1093/nar/gkg094
Yamasaki C, Murakami K, Fujii Y, Sato Y, Harada E, Takeda J, Taniya T, Sakate R, Kikugawa S, Shimada M, Tanino M, Koyanagi KO, Barrero RA, Gough C, Chun H, Habara T, Hanaoka H, Hayakawa Y, Hilton PB, Kaneko Y, Kanno M, Kawahara Y, Kawamura T, Matsuya A, Nagata N, Nishikata K, Noda AO, Nurimoto S, Saichi N, Sakai H, Sanbonmatsu R, Shiba R, Suzuki M, Takabayashi K, Takahashi A, Tamura T, Tanaka M, Tanaka S, Todokoro F, Yamaguchi K, Yamamoto N, Okido T, Mashima J, Hashizume A, Jin L, Lee K, Lin Y, Nozaki A, Sakai K, Tada M, Miyazaki S, Makino T, Ohyanagi H, Osato N, Tanaka N, Suzuki Y, Ikeo K, Saitou N, Sugawara H, O'Donovan C, Kulikova T, Whitfield E, Halligan B, Shimoyama M, Twigger S, Yura K, Kimura K, Yasuda T, Nishikawa T, Akiyama Y, Motono C, Mukai Y, Nagasaki H, Suwa M, Horton P, Kikuno R, Ohara O, Lancet D, Eveno E, Graudens E, Imbeaud S, Debily MA, Hayashizaki Y, Amid C, Han M, Osanger A, Endo T, Thomas MA, Hirakawa M, Makalowski W, Nakao M, Kim N, Yoo H, De Souza SJ, Bonaldo MDF, Niimura Y, Kuryshev V, Schupp I, Wiemann S, Bellgard M, Shionyu M, Jia L, Thierry-Mieg D, Thierry-Mieg J, Wagner L, Zhang Q, Go M, Minoshima S, Ohtsubo M, Hanada K, Tonellato P, Isogai T, Zhang J, Lenhard B, Kim S, Chen Z, Hinz U, Estreicher A, Nakai K, Makalowska I, Hide W, Tiffin N, Wilming L, Chakraborty R, Soares MB, Chiusano ML, Suzuki Y, Auffray C, Yamaguchi-Kabata Y, Itoh T, Hishiki T, Fukuchi S, Nishikawa K, Sugano S, Nomura N, Tateno Y, Imanishi T, Gojobori T: The H-Invitational Database (H-InvDB), a comprehensive annotation resource for human genes and transcripts. Nucl Acids Res 2008, 36: D793–9.
Ouyang S, Zhu W, Hamilton J, Lin H, Campbell M, Childs K, Thibaud-Nissen F, Malek RL, Lee Y, Zheng L, Orvis J, Haas B, Wortman J, Buell CR: The TIGR Rice Genome Annotation Resource: improvements and new features. Nucl Acids Res 2007, 35: D883–7. 10.1093/nar/gkl976
Rensing SA, Lang D, Zimmer AD, Terry A, Salamov A, Shapiro H, Nishiyama T, Perroud P, Lindquist EA, Kamisugi Y, Tanahashi T, Sakakibara K, Fujita T, Oishi K, Shin-I T, Kuroki Y, Toyoda A, Suzuki Y, Hashimoto S, Yamaguchi K, Sugano S, Kohara Y, Fujiyama A, Anterola A, Aoki S, Ashton N, Barbazuk WB, Barker E, Bennetzen JL, Blankenship R, Cho SH, Dutcher SK, Estelle M, Fawcett JA, Gundlach H, Hanada K, Heyl A, Hicks KA, Hughes J, Lohr M, Mayer K, Melkozernov A, Murata T, Nelson DR, Pils B, Prigge M, Reiss B, Renner T, Rombauts S, Rushton PJ, Sanderfoot A, Schween G, Shiu S, Stueber K, Theodoulou FL, Tu H, Peer Y, Verrier PJ, Waters E, Wood A, Yang L, Cove D, Cuming AC, Hasebe M, Lucas S, Mishler BD, Reski R, Grigoriev IV, Quatrano RS, Boore JL: The Physcomitrella genome reveals evolutionary insights into the conquest of land by plants. Science 2008, 319: 64–69. 10.1126/science.1150646
Tuskan GA, Difazio S, Jansson S, Bohlmann J, Grigoriev I, Hellsten U, Putnam N, Ralph S, Rombauts S, Salamov A, Schein J, Sterck L, Aerts A, Bhalerao RR, Bhalerao RP, Blaudez D, Boerjan W, Brun A, Brunner A, Busov V, Campbell M, Carlson J, Chalot M, Chapman J, Chen G, Cooper D, Coutinho PM, Couturier J, Covert S, Cronk Q, Cunningham R, Davis J, Degroeve S, Déjardin A, Depamphilis C, Detter J, Dirks B, Dubchak I, Duplessis S, Ehlting J, Ellis B, Gendler K, Goodstein D, Gribskov M, Grimwood J, Groover A, Gunter L, Hamberger B, Heinze B, Helariutta Y, Henrissat B, Holligan D, Holt R, Huang W, Islam-Faridi N, Jones S, Jones-Rhoades M, Jorgensen R, Joshi C, Kangasjärvi J, Karlsson J, Kelleher C, Kirkpatrick R, Kirst M, Kohler A, Kalluri U, Larimer F, Leebens-Mack J, Leplé J, Locascio P, Lou Y, Lucas S, Martin F, Montanini B, Napoli C, Nelson DR, Nelson C, Nieminen K, Nilsson O, Pereda V, Peter G, Philippe R, Pilate G, Poliakov A, Razumovskaya J, Richardson P, Rinaldi C, Ritland K, Rouzé P, Ryaboy D, Schmutz J, Schrader J, Segerman B, Shin H, Siddiqui A, Sterky F, Terry A, Tsai C, Uberbacher E, Unneberg P, Vahala J, Wall K, Wessler S, Yang G, Yin T, Douglas C, Marra M, Sandberg G, Peer Y, Rokhsar D: The genome of black cottonwood, Populus trichocarpa (Torr. & Gray). Science 2006, 313: 1596–1604. 10.1126/science.1128691
Cherry JM, Ball C, Weng S, Juvik G, Schmidt R, Adler C, Dunn B, Dwight S, Riles L, Mortimer RK, Botstein D: Genetic and physical maps of Saccharomyces cerevisiae. Nature 1997, 387: 67–73. 10.1038/43025
Jaillon O, Aury J, Noel B, Policriti A, Clepet C, Casagrande A, Choisne N, Aubourg S, Vitulo N, Jubin C, Vezzi A, Legeai F, Hugueney P, Dasilva C, Horner D, Mica E, Jublot D, Poulain J, Bruyère C, Billault A, Segurens B, Gouyvenoux M, Ugarte E, Cattonaro F, Anthouard V, Vico V, Del Fabbro C, Alaux M, Di Gaspero G, Dumas V, Felice N, Paillard S, Juman I, Moroldo M, Scalabrin S, Canaguier A, Le Clainche I, Malacrida G, Durand E, Pesole G, Laucou V, Chatelet P, Merdinoglu D, Delledonne M, Pezzotti M, Lecharny A, Scarpelli C, Artiguenave F, Pè ME, Valle G, Morgante M, Caboche M, Adam-Blondon A, Weissenbach J, Quétier F, Wincker P: The grapevine genome sequence suggests ancestral hexaploidization in major angiosperm phyla. Nature 2007, 449: 463–467. 10.1038/nature06148
SA is supported through the EU Marie Curie Research Training Network 'VaTEP – Vacuolar Transport Equipment for Growth Regulation of Plants' (MRTN-CT-2006-035833) which the authors greatly acknowledge. MK thanks the DAAD for a fellowship provided through the program 'Modern Applications of Biotechnology' (No. A/06/04209) and the Polish Ministry of Science and Higher Education for financial support (research grant 2 P04C 056 30). DMRP and BMR thank the Interdisciplinary Research Center 'Advanced Protein Technologies' of the University of Potsdam for financial support. BMR thanks the Fonds der Chemischen Industrie for financial support (No 0164389). DMRP acknowledges financial support by the Bundesministerium fuer Bildung und Forschung (BMBF) (GABI-FUTURE grant 0315046). BMR thanks the BMBF for funding of the systems biology research unit 'GoFORSYS – Potsdam-Golm BMBF Forschungseinrichtung zur Systembiologie. Photosynthesis and Growth; a Systems Biology Based Approach' (FKZ 0313924).
Thanks to Luiz Gustavo Guedes Correa (DAAD fellowship) for primer testing, to Raúl Trejos-Espinosa (GoFORSYS) for providing Chlamydomonas reinhardtii cDNA, to Agnieszka Janiak (University of Silesia, Katowice, Poland) for providing Hordeum vulgare RNA and to Anika Wiese and Anup Karwa (VaTEP members at ICG III, FZ-Juelich, Germany) for testing the program.
SA designed and programmed QuantPrime, carried out most of the primer testing and drafted the manuscript. MK designed the graphical user interface and contributed to the design of the program, carried out the tests with barley and revised the manuscript. DMRP helped out to design the program, prepared sequence databases, installed and administrates the public server and revised the manuscript. BMR supervised the group, helped out with the design and testing of the program and helped drafting the manuscript. All authors read and approved the final manuscript.