Skip to main content

Table 3 Estimates for the Naeglaria libraries

From: A Bayesian nonparametric method for prediction in EST analysis

%n

m

Expected number of new genes in an additional sample of size m

Probability of discovering a new gene at the (n + m + 1)-th read

Naegleria aerobic

50

480

162 (138 , 188)

0.318 (0.307 , 0.329)

100

959

307 (271 , 345)

0.290 (0.277 , 0.303)

150

1438

441 (394 , 488)

0.270 (0.257 , 0.282)

200

1918

566 (510 , 624)

0.254 (0.241 , 0.267)

250

2398

685 (619 , 751)

0.242 (0.229 , 0.255)

300

2877

798 (725 , 873)

0.231 (0.219 , 0.244)

Naegleria anaerobic

50

484

231 (206 , 258)

0.450 (0.440 , 0.461)

100

969

440 (402 , 478)

0.412 (0.400 , 0.424)

150

1454

632 (583 , 683)

0.384 (0.371 , 0.397)

200

1938

812 (753 , 873)

0.362 (0.349 , 0.375)

250

2422

983 (915 , 1053)

0.344 (0.332 , 0.357)

300

2907

1146 (1069 , 1225)

0.330 (0.317 , 0.342)

  1. Naeglaria aerobic and anaerobic libraries: the first column provides the size of the additional sample in % of the size of the initial sample, the second the actual size of the additional survey, the third presents the expected number of new genes and the fourth the discovery probability. The estimates in the third and fourth column are accompanied by the 95% highest posterior density intervals.