Skip to main content

Table 1 Coverage statistics for Bignorm with Q 0=20, Diginorm, and the raw datasets

From: An improved filtering algorithm for big read datasets and its application to single-cell assembly

Dataset

Algorithm

\(\mathcal {P}10\)

Mean

\(\mathcal {P}90\)

Max

Aceto

Bignorm

6

132

216

6801

 

Diginorm

7

171

295

12,020

 

Raw

15

9562

17,227

551,000

Alphaproteo

Bignorm

10

43

92

884

 

Diginorm

7

173

481

6681

 

Raw

25

5302

14,070

303,200

Arco

Bignorm

1

98

54

2103

 

Diginorm

1

362

200

6114

 

Raw

3

10,850

4091

220,600

Arma

Bignorm

8

23

32

358

 

Diginorm

8

79

141

5000

 

Raw

17

629

1118

31,260

ASZN2

Bignorm

40

70

83

2012

 

Diginorm

23

143

354

3437

 

Raw

50

1738

4784

43,840

Bacteroides

Bignorm

3

74

90

6768

 

Diginorm

3

123

205

7933

 

Raw

7

6051

8127

570,900

Caldi

Bignorm

25

63

110

786

 

Diginorm

15

67

135

3584

 

Raw

27

1556

3643

33,530

Caulo

Bignorm

7

228

216

10,400

 

Diginorm

8

362

491

35,520

 

Raw

8

10,220

9737

464,300

Chloroflexi

Bignorm

8

72

101

2822

 

Diginorm

9

412

878

20,850

 

Raw

9

5612

7741

316,900

Crenarch

Bignorm

8

104

159

3770

 

Diginorm

10

560

1285

29,720

 

Raw

10

8086

14,987

316,700

Cyanobact

Bignorm

9

144

153

5234

 

Diginorm

10

756

1450

26,980

 

Raw

10

9478

11,076

356,600

E.coli

Bignorm

37

45

56

234

 

Diginorm

50

382

922

7864

 

Raw

112

2522

6378

56,520

SAR324

Bignorm

24

49

71

1410

 

Diginorm

18

53

107

2473

 

Raw

26

1086

2761

106,000