Skip to main content

Table 2 Benchmark of elapsed time (ms) for each steps of runtime compilation

From: CUDAMPF: a multi-tiered parallel framework for accelerating protein sequence search in HMMER on CUDA-enabled GPU

Steps of Runtime

MSV/SSV kernel

P7Viterbi kernel

Compliation

Intel i5a

Intel Xeonb

Intel i5

Intel Xeon

Read.cu.cuh file into string

0.05/0.04

0.07/0.08

0.05

0.09

Create and compile nvrtcProgram

450/431

855/812

422

836

Get PTX string and kernel handle

0.35/0.77

5.38/5.63

0.27

7.64

  1. aIntel i5-3570K quad-core 3.4 GHz CPU and 64-bit Ubuntu Linux
  2. bIntel Xeon E5620 octa-core 2.4 GHz CPU and 64-bit Centos Linux