Skip to main content

Table 3 Compression execution time

From: Compressing DNA sequence databases with coil

     

coil

Dataset

bz2

nrdb+ bz2

7z

PPMdi

find_edges

encode

tar+bz

other

total

ems1

5.6

7.7

43.3

5.6

3.5

9.3

1.2

10.8

24.8

 

5.5

9.2

48.2

4.7

3.6

8.1

1.0

10.4

23.1

 

5.7

10.2

43.5

4.3

3.6

8.2

1.2

12.5

25.5

ems2

10.3

15.1

96.5

9.9

9.5

20.0

1.0

19.1

49.5

 

10.3

17.7

101.4

8.5

9.7

20.2

1.0

19.0

49.8

 

10.1

16.8

95.6

8.3

9.7

20.3

1.1

20.8

51.9

ems3

15.2

22.6

154.8

14.7

17.0

36.5

1.9

27.7

83.1

 

16.7

24.6

162.9

12.7

17.3

35.4

2.2

28.0

82.9

 

15.4

22.4

154.2

12.8

17.3

34.5

3.1

29.2

84.2

ems4

20.3

32.0

216.5

20.0

25.8

50.6

3.6

34.3

114.3

 

20.5

33.4

221.1

17.4

26.7

50.3

3.0

37.1

117.1

 

20.0

31.1

215.0

17.0

26.0

49.1

3.3

39.3

117.7

ems5

30.3

39.1

276.7

25.3

35.5

65.4

4.2

42.7

147.7

 

25.5

43.5

280.4

21.5

35.7

65.5

4.1

45.2

150.5

 

25.3

38.7

275.9

21.4

35.8

64.6

4.1

46.2

150.7

ems10

62.5

85.1

573.8

49.8

100.5

179.3

8.4

100.4

388.6

 

60.7

88.0

580.8

45.5

102.1

176.0

9.0

84.4

371.4

 

50.6

80.1

575.4

43.5

100.5

160.8

9.4

87.7

358.4

ems15

94.3

117.8

871.1

69.0

197.9

271.8

12.6

118.2

600.6

 

76.7

136.5

876.5

64.7

198.5

276.9

13.7

130.5

619.5

 

89.8

119.6

869.5

64.5

196.1

275.8

13.8

133.4

619.1

ems20

101.5

169.9

1163.0

92.8

317.1

393.5

16.7

176.0

903.5

 

101.7

179.7

1169.3

86.9

321.6

393.5

18.5

212.2

945.8

 

120.5

158.8

1161.3

84.9

319.6

399.7

16.9

215.6

951.8

ems25

133.0

207.7

1482.2

116.0

471.7

503.2

22.5

280.8

1278.1

 

152.3

220.7

1438.9

105.9

470.9

467.2

22.3

218.4

1178.8

 

171.0

196.3

1456.4

106.0

468.0

504.4

23.4

248.0

1243.8

ems50

306.2

411.2

2882.0

215.4

1657.4

1172.3

105.4

716.3

3651.4

 

340.0

452.4

2893.3

209.2

1658.2

1170.7

104.5

583.7

3517.1

 

291.1

411.6

2888.3

207.2

1655.5

1174.7

107.9

671.8

3609.9

ems75

500.7

712.4

4328.8

314.4

3517.1

1814.9

167.8

1173.4

6673.2

 

506.8

618.1

4304.5

311.9

3502.1

1810.2

164.7

992.9

6469.8

 

508.7

593.5

4298.9

317.7

3490.7

1798.6

165.0

1116.4

6570.6

ems100

668.6

 

5760.8

408.5

6064.4

2552.4

223.9

1421.8

10262.6

 

634.1

 

5707.5

404.1

6042.3

2524.8

219.3

1429.0

10215.3

 

689.2

 

5773.6

403.3

6114.1

2496.4

217.5

1546.2

10374.1

ems100*

    

6446.3

2515.4

218.6

1505.8

10686.1

rfam_full

32.8

 

75.8

7.9

114.8

12.8

4.0

40.7

172.3

 

29.6

 

75.3

7.9

113.9

12.3

4.3

38.2

168.7

 

29.6

 

75.5

7.8

114.5

12.4

4.2

36.0

167.1

  1. All durations are in seconds. The rightmost five columns break down the execution of coil by its main component programs; the "other" column includes the time needed for the programs extract_seqs, make_index and select_lines.
  2. *This row shows the result of using the Pentium 4-optimised version of find_edges – surprisingly, this version of find_edges is actually about 6% slower than the original version on this CPU.