Skip to main content

Table 2 Benchmark of elapsed time (ms) for each steps of runtime compilation

From: CUDAMPF: a multi-tiered parallel framework for accelerating protein sequence search in HMMER on CUDA-enabled GPU

Steps of Runtime MSV/SSV kernel P7Viterbi kernel
Compliation Intel i5a Intel Xeonb Intel i5 Intel Xeon file into string 0.05/0.04 0.07/0.08 0.05 0.09
Create and compile nvrtcProgram 450/431 855/812 422 836
Get PTX string and kernel handle 0.35/0.77 5.38/5.63 0.27 7.64
  1. aIntel i5-3570K quad-core 3.4 GHz CPU and 64-bit Ubuntu Linux
  2. bIntel Xeon E5620 octa-core 2.4 GHz CPU and 64-bit Centos Linux