Skip to main content

Table 4 Runtimes (in seconds) and corresponding speedups of SparkEC over CloudEC for all datasets and k-mer values

From: SparkEC: speeding up alignment-based DNA error correction tools

Dataset

K

#Nodes

CloudEC

SparkEC

Speedup

D1

24

5

29,862

11,951

2.5

9

13,697

4429

3.1

13

9150

2731

3.4

55

5

21,909

9693

2.3

9

10,135

2792

3.6

13

6216

1785

3.5

D2

24

5

13,307

5289

2.5

9

5351

1659

3.2

13

3309

971

3.4

55

5

8688

4035

2.2

9

3889

1250

3.1

13

2594

700

3.7

D3

24

5

11,609

4865

2.4

9

4831

1885

2.6

13

3167

1113

2.8

55

5

8502

4892

1.7

9

3679

1348

2.7

13

2616

756

3.5

D4

24

5

43,506

7473

5.8

9

20,383

2484

8.2

13

14,334

1511

9.5

55

5

21,723

3987

5.4

9

11,543

1146

10.1

13

7617

648

11.8

D5

24

5

26,959

8607

3.1

9

12,611

6486

1.9

13

7729

3927

2.0

55

5

14,374

3698

3.9

9

5827

2233

2.6

13

3577

1437

2.5

D6

24

5

31,146

3286

9.5

9

18,511

2047

9.0

13

11,232

942

11.9

55

5

17,806

2105

8.5

9

10,021

1180

8.5

13

6514

687

9.5