From: Coupling SIMD and SIMT architectures to boost performance of a phylogeny-aware alignment kernel
Execution times | GCUPS | Speedup GPU vs | ||||||
---|---|---|---|---|---|---|---|---|
Reference length | Seq | SSE(4) | GPU | Seq | SSE(4) | GPU | Seq | SSE(4) |
500 | 40.79 | 1.11 | 1.89 | 0.49 | 18.01 | 8.4 | 21.58 | 0.59 |
1000 | 90.45 | 2.58 | 2.5 | 0.44 | 15.52 | 14.4 | 36.18 | 1.03 |
5000 | 494.65 | 14.30 | 9.23 | 0.40 | 14.30 | 21.2 | 53.59 | 1.55 |
10000 | 1006.1 | 29.27 | 18.31 | 0.40 | 13.66 | 21.6 | 54.95 | 1.60 |
50000 | 5103.4 | 319.92 | 90.95 | 0.39 | 6.25 | 21.9 | 56.11 | 3.52 |
100000 | 10369 | 1785.8 | 181.31 | 0.38 | 2.24 | 22.0 | 57.19 | 9.85 |
500000 | 51448 | 9005.3 | 906.21 | 0.39 | 2.22 | 22.1 | 56.77 | 9.94 |