Skip to main content

Table 5 Filter and assembly statistics for Bignorm with Q 0=20, Diginorm, and the raw datasets (Part II)

From: An improved filtering algorithm for big read datasets and its application to single-cell assembly

Dataset

Algorithm

N50

Longest contig length

Genomic fraction

Misassembled contig length

  

abs

% of raw

% of Diginorm

abs

% of raw

% of Diginorm

abs

% of raw

% of Diginorm

abs

% of raw

% of Diginorm

Aceto

Bignorm

2324

79

105

11,525

98

100

91

97

97

52,487

148

178

 

Diginorm

2216

76

 

11,525

98

 

94

100

 

29,539

84

 
 

Raw

2935

  

11,772

  

94

  

35,351

  

Alphaproteo

Bignorm

11,750

94

115

43,977

91

95

98

101

105

52,001

120

89

 

Diginorm

10,213

82

 

46,295

95

 

93

95

 

58,184

134

 
 

Raw

12,446

  

48,586

  

98

  

43,388

  

Arco

Bignorm

3320

81

97

12,808

57

57

85

100

97

76,797

99

91

 

Diginorm

3434

84

 

22,463

100

 

88

103

 

84,613

109

 
 

Raw

4092

  

22,439

  

85

  

77,888

  

Arma

Bignorm

18,432

102

107

108,140

100

100

98

100

100

774,291

91

103

 

Diginorm

17,288

96

 

108,498

100

 

98

100

 

748,560

88

 
 

Raw

18,039

  

108,498

  

98

  

849,085

  

ASZN2

Bignorm

19,788

91

88

72,685

71

88

97

99

99

2,753,167

94

105

 

Diginorm

16,591

76

 

82687

81

 

97

100

 

2,617,095

89

 
 

Raw

21,784

  

102,287

  

97

  

2,941,524

  

Bacteroides

Bignorm

3356

68

100

25,300

100

100

95

98

99

70,206

105

112

 

Diginorm

3356

68

 

25,300

100

 

96

99

 

62,882

94

 
 

Raw

4930

  

25,299

  

98

  

66,626

  

Caldi

Bignorm

50,973

82

83

143,346

89

91

100

100

100

573,836

94

68

 

Diginorm

61,108

98

 

157,479

98

 

100

100

 

839,126

138

 
 

Raw

62,429

  

160,851

  

100

  

609,604

  

Caulo

Bignorm

4515

69

95

20,255

100

107

96

98

98

60,362

86

113

 

Diginorm

4729

72

 

18,907

93

 

98

101

 

53,456

76

 
 

Raw

6562

  

20,255

  

97

  

70,161

  

Chloroflexi

Bignorm

13,418

102

109

79,605

102

102

99

100

100

666,519

95

93

 

Diginorm

12,305

93

 

78,276

100

 

100

100

 

716,473

102

 
 

Raw

13,218

  

78,276

  

99

  

703,171

  

Crenarch

Bignorm

6538

77

91

31,401

81

66

97

99

99

484,354

89

95

 

Diginorm

7148

84

 

47,803

124

 

98

100

 

510,256

94

 
 

Raw

8501

  

38,582

  

98

  

544,763

  

Cyanobact

Bignorm

5833

95

99

33,462

98

100

99

101

100

236,391

113

110

 

Diginorm

5907

96

 

33,516

98

 

99

101

 

214,574

103

 
 

Raw

6130

  

34,300

  

98

  

209,269

  

E. coli

Bignorm

112,393

100

100

268,306

94

94

96

100

100

28,966

65

65

 

Diginorm

112,393

100

 

285,311

100

 

96

100

 

44,465

100

 
 

Raw

112,393

  

285,528

  

96

  

44,366

  

SAR324

Bignorm

135,669

100

114

302,443

100

100

99

100

100

4,259,479

98

100

 

Diginorm

119,529

88

 

302,443

100

 

99

100

 

4,264,234

98

 
 

Raw

136,176

  

302,442

  

99

  

4,342,602

 Â