FCI: an R-based algorithm for evaluating uncertainty of absolute real-time PCR quantification
BMC Bioinformatics volume 9, Article number: 13 (2008)
FCI is an R code for analyzing data from real-time PCR experiments. This algorithm estimates standard curve features as well as nucleic acid concentrations and confidence intervals according to Fieller's theorem.
In order to describe the features of FCI four situations were selected from real data collected during an international external quality assessment program for quantitative assays based on real-time PCR. The code generates a diagnostic figure suitable for assessing the quality of the quantification process.
We have provided a freeware programme using this algorithm specifically designed to increase the information content of the real-time PCR assay.
Real-time PCR is widely used for the quantification of nucleic acids in a wide range of clinical and research applications including the measurement of gene dosage, detection of residual disease in haematological malignancies and detection of bacterial and viral infection.
Real-time PCR typically employs fluorescent probes which generate a signal that accumulates during PCR cycling in a manner proportional to the concentration of amplification products. Absolute quantification of a nucleic acid target can be achieved using a standard curve, which is generated by amplifying known amounts of the target DNA. The standard curve is typically generated using a series of 10-fold dilutions of a control template. For each dilution, replicated determinations of the cycle threshold (ct) are performed and a straight line is fitted to the data by plotting the ct averages as a function of the logarithm of the starting concentration of the standards. By applying a technique known as "inverse regression," the straight line is used as a "calibrator" to estimate the unknown starting DNA concentration in the sample under examination.
As in any titration, evaluating the uncertainty in the estimated concentration of the unknown sample is critical for interpreting the data and optimizing experimental procedures. In addition to the point estimate it is important to calculate the confidence interval of the "true" value of the unknown concentration.
Commercial software specifically designed for generating standard curves and estimating nucleic acid concentrations is now available, however, there are currently no freeware tools available for calculating the uncertainty associated with the concentration estimates.
Several approaches have been proposed for constructing confidence intervals in inverse regression [1–5], but the most frequently used method is due to Fieller . Fieller's theorem provides a general procedure for the construction of confidence limits for certain ratios of parameters, most often applied to ratios of linear combinations of parameters (e.g. inverse estimate from a linear regression model). In this note we propose a statistical tool, FCI (Fieller's Confidence Interval), to estimate the confidence interval (CI) of the "true" value of each unknown concentration according to Fieller's theorem.
FCI code has been tested on the WINDOWS platform with the R Software version 2.5.1 , an open-source statistical programming language. FCI implementation involves data import, FCI running and FCI output.
Data must be imported into FCI as Comma Separated Values (.csv) file [see Additional File 1] using Microsoft Excel '97 or a more recent version. Figure 1 shows a screenshot of real-time PCR data suitable for analysis by FCI.
Before running the FCI code it is necessary to save the file in the new directory C:\DATA and install the 'car' package from the Packages menu of R.
The FCI algorithm works as follows. First, the data are fitted to a linear regression model as reported in equation 1 and the corresponding analysis of variance table (Anova table, Table 1) is displayed in the output. Next, using the regression coefficient (Intercept and Slope) estimates, the FCI code provides the unknown concentration estimate (equation 2) both in (common) logarithmic scale and also in its original scale as copy number. Finally lower and upper limits of the 100(1-α)% Fieller's CI of the unknown concentration in both logarithmic and original scale are estimated as shown in . By default FCI provides a two-tailed 95% confidence interval (significance level, α = 0.05), however users can modify the confidence level (1-α) by replacing 0.95 in the command line for calling the code with the chosen level.
The code performs a "Lack of fit" test on the fitted calibration model, the computation of 95% confidence interval for regression coefficients and, where appropriate, provides messages to assist in the interpretation of the results. In addition it generates a diagnostic plot enabling both an assessment of PCR assay quality and also a visual representation of the Fieller's CI graphical derivation .
Results and Discussion
In order to illustrate the functionality of FCI, we considered the data provided by four participants (laboratories A-D) in an international external quality assessment program (EQUAL-quant) for quantitative assays based on real-time PCR . In this program laboratories received primers, labelled probe master mixture, plasmid standards (containing 10, 102, 103, 104, 105 copies/5 μL), and three samples of unknown concentration (test samples). Participants were required to measure the target copy number in all test samples and provide all ct values in triplicate. For the purposes of this illustration we consider only performance in the analysis of the test sample corresponding to the lowest target copy number (56.21 copies/5 μl calculated as median value of copy number from a total of 92 laboratories).
For laboratory A the linear model underlying the standard curve was inadequate. In this case the warning message "Lack of fit" appears in the output (Figure 2, panel A) as the p-value associated to the Lack of fit test is less than 0.05 (significance level α = 0.05). Although FCI code will provide the Fieller's confidence interval, caution should be taken when interpreting results and attempts should be made to discover where and how the inadequacy occurred. In fact, as we see from Figure 3, panel A, the replicates related to standards 4 and 5 appear inadequate to the linear fit.
For laboratory B the ct mean of the unknown sample (y0) lies outside the range of the standard dilutions used to fit the standard curve. In this situation (i.e. extrapolation) we cannot assume that the linear model on which the calibration is based will hold true outside the range of the standard dilutions. In this case the FCI code will not provide the Fieller's CI in the output (Figure 2, panel B), and the warning message "Out of standards range" appears in the legend of the corresponding figure (Figure 3, panel B).
For laboratory C the 95% CI of the true value of the slope includes the theoretical expected slope value (-3.32193) corresponding to 100% amplification efficiency. In such a situation the message "Optimal efficiency" appears in the output (Figure 2, panel C). This message appears only when no warning messages have been displayed. Figure 3, panel C reports the pertinent graph.
When the 95% confidence interval of the true value of the slope includes zero (an unlikely eventuality in a real-time PCR experiment) the warning message "Not real confidence interval" appears in the FCI output. In this situation, not observed in our real data, the resulting Fieller's confidence interval is infinite in extent and the data have to be considered as valueless for estimating the unknown concentration.
We have described an algorithm for the computation of CIs based on Fieller's theorem in the context of real-time PCR quantification. While the algorithm is designed in this case for real-time PCR experiments, it is easily adapted to other assays based on inverse estimation from a straight line model (calibration). FCI estimates the standard curve, the unknown sample concentration and its uncertainty. Furthermore it provides an insightful diagnostic figure.
The statistical model corresponding to the standard curve is :
yij = β0 + β1xi + εij (1)
where yij specifies the value of ct measured for the j-th replication (j = 1, 2,...,Ji) at the i-th standard (i = 1, 2,...,I), xi defines the logarithm of the starting DNA/cDNA concentration of the i-th standard and εij is the random component assumed to be normally distributed with mean zero and constant error variance σ2. The estimates b0 and b1 of β0 and β1 respectively are obtained by the Ordinary Least Squared method.
The value of interest in a real-time PCR experiment is the logarithm of the unknown DNA/cDNA concentration in the sample under examination. The latter (x0) is usually estimated by resorting to the inverse regression, as:
where is the mean of the K replicated values of ct [ctk = yk, (k = 1, 2,....K)] measured for the sample under examination.
As shown in Verderio et al.  the limits of 100(1-α)% confidence interval of x0 according to Fieller's theorem are obtained as roots of a second degree equation.
Availability and requirements
For the current version of FCI code please see Additional file 2. The commands for importing data, running the code and displaying the output are provided in Additional file 3. Further development as well as extended version of FCI code will be available trough our website: http://www.istitutotumori.mi.it/INT/dipartimenti/UO/biometria. FCI code use R Software http://www.r-project.org, an open-source statistical programming language.
Hoadley B: A Bayesian look at inverse linear regression. J Am Stat Assoc 1970, 65: 356–369. 10.2307/2283598
Cox C: Fieller's theorem, the likelihood and the delta method. Biometrics 1990, 46: 709–718. 10.2307/2532090
Krutchkoff RG: Classical and inverse regression methods of calibration. Technometrics 1967, 9: 425–439. 10.2307/1266511
Grybill FA: Theory and applications of the linear model. Massachussets USA: Duxbury press; 1976.
Wu CFJ: Jackknife, bootstrap and other resampling methods in regression analysis. Ann Statist 1986, 14: 1261–1295. 10.1214/aos/1176350142
Fieller EC: The biological standardization of insulin. J R Stat Soc 1940, (Suppl 7):1–64.
Ihaka R, Gentleman R: R: A language for data analysis and graphics. J Comp Graph Stat 1996, 5: 299–314. 10.2307/1390807
Verderio P, Orlando C, Casini Raggi C, Marubini E: Confidence interval estimation for DNA and mRNA. concentration by real-time PCR: A new environment for an old theorem. Int J Biol Markers 2004, 19: 76–9.
Pizzamiglio S, Verderio P, Orlando C, Marubini E: Confidence Interval for DNA/mRNA concentration by real-time PCR. Int J Biol Markers 2007, 22: 232–6.
Ramsden SC, Daly S, Geilenkeuser WJ, Duncan G, Hermitte F, Marubini E, Neumaier M, Orlando C, Palicka V, Paradiso A, Pazzagli M, Pizzamiglio S, Verderio P: EQUAL-quant: An International External Quality Assessment Scheme for Real-Time PCR. Clin Chem 2006, 52: 1584–91. 10.1373/clinchem.2005.066019
Draper NR, Smith H: Applied Regression Analysis. Third edition. New York: John Wiley & Sons; 1998.
EQUAL-quant has been supported by the EU Sixth Framework Program (Contract 504842). This note was supported by the "Sovvenzione Globale INGENIO" awarded by the "Fondo Sociale Europeo, Ministero del Lavoro e della Previdenza Sociale and Regione Lombardia".
All authors contributed to the work described in the manuscript. The R-implementation of the FCI code as well as its testing was carried out by FG with advice and supervisor from PV and SP. PV provided oversight of the work, finalized the draft and optimized the final version. SR supplied the data for testing the code and improved the writing. All authors read and approved the manuscript.
Electronic supplementary material
Additional file 3: A text file containing the commands for importing data, calling the code and displaying the output. (R 939 bytes)
About this article
Cite this article
Verderio, P., Pizzamiglio, S., Gallo, F. et al. FCI: an R-based algorithm for evaluating uncertainty of absolute real-time PCR quantification. BMC Bioinformatics 9, 13 (2008). https://doi.org/10.1186/1471-2105-9-13