Skip to main content

Table 1 Coverage statistics for Bignorm with Q 0=20, Diginorm, and the raw datasets

From: An improved filtering algorithm for big read datasets and its application to single-cell assembly

Dataset Algorithm \(\mathcal {P}10\) Mean \(\mathcal {P}90\) Max
Aceto Bignorm 6 132 216 6801
  Diginorm 7 171 295 12,020
  Raw 15 9562 17,227 551,000
Alphaproteo Bignorm 10 43 92 884
  Diginorm 7 173 481 6681
  Raw 25 5302 14,070 303,200
Arco Bignorm 1 98 54 2103
  Diginorm 1 362 200 6114
  Raw 3 10,850 4091 220,600
Arma Bignorm 8 23 32 358
  Diginorm 8 79 141 5000
  Raw 17 629 1118 31,260
ASZN2 Bignorm 40 70 83 2012
  Diginorm 23 143 354 3437
  Raw 50 1738 4784 43,840
Bacteroides Bignorm 3 74 90 6768
  Diginorm 3 123 205 7933
  Raw 7 6051 8127 570,900
Caldi Bignorm 25 63 110 786
  Diginorm 15 67 135 3584
  Raw 27 1556 3643 33,530
Caulo Bignorm 7 228 216 10,400
  Diginorm 8 362 491 35,520
  Raw 8 10,220 9737 464,300
Chloroflexi Bignorm 8 72 101 2822
  Diginorm 9 412 878 20,850
  Raw 9 5612 7741 316,900
Crenarch Bignorm 8 104 159 3770
  Diginorm 10 560 1285 29,720
  Raw 10 8086 14,987 316,700
Cyanobact Bignorm 9 144 153 5234
  Diginorm 10 756 1450 26,980
  Raw 10 9478 11,076 356,600
E.coli Bignorm 37 45 56 234
  Diginorm 50 382 922 7864
  Raw 112 2522 6378 56,520
SAR324 Bignorm 24 49 71 1410
  Diginorm 18 53 107 2473
  Raw 26 1086 2761 106,000