Skip to main content

Table 3 Estimates for the Naeglaria libraries

From: A Bayesian nonparametric method for prediction in EST analysis

%n m Expected number of new genes in an additional sample of size m Probability of discovering a new gene at the (n + m + 1)-th read
Naegleria aerobic
50 480 162 (138 , 188) 0.318 (0.307 , 0.329)
100 959 307 (271 , 345) 0.290 (0.277 , 0.303)
150 1438 441 (394 , 488) 0.270 (0.257 , 0.282)
200 1918 566 (510 , 624) 0.254 (0.241 , 0.267)
250 2398 685 (619 , 751) 0.242 (0.229 , 0.255)
300 2877 798 (725 , 873) 0.231 (0.219 , 0.244)
Naegleria anaerobic
50 484 231 (206 , 258) 0.450 (0.440 , 0.461)
100 969 440 (402 , 478) 0.412 (0.400 , 0.424)
150 1454 632 (583 , 683) 0.384 (0.371 , 0.397)
200 1938 812 (753 , 873) 0.362 (0.349 , 0.375)
250 2422 983 (915 , 1053) 0.344 (0.332 , 0.357)
300 2907 1146 (1069 , 1225) 0.330 (0.317 , 0.342)
  1. Naeglaria aerobic and anaerobic libraries: the first column provides the size of the additional sample in % of the size of the initial sample, the second the actual size of the additional survey, the third presents the expected number of new genes and the fourth the discovery probability. The estimates in the third and fourth column are accompanied by the 95% highest posterior density intervals.