Call cublas in a kernel

后端 未结 1 1645

I want to use Zgemv in parallel.

__global__ void S_Cphir(cuDoubleComplex *S,cuDoubleComplex *A,cuDoubleComplex *B, int n,int l)
{
    ....
cublasZgemv(handl         


        
相关标签:
1条回答
  • 2021-01-20 06:04

    Use the CUBLAS Device Library sample code as your reference. On a standard CUDA 5.5 install, you'll find it at:

    /usr/local/cuda/samples/7_CUDALibraries/simpleDevLibCUBLAS
    

    Referring to the Makefile in that directory, your compile commands should be like this:

    nvcc -arch=sm_35 -rdc=true -o main.v2 time_propagation_cublas.cu -lcublas -lcublas_device -lcudadevrt
    
    0 讨论(0)
提交回复
热议问题