Converting this function from C++ to CUDA

后端 未结 0 800

I\'m not an expert of CUDA but I would like to execute some code on GPU to speed-up my program. I\'ve already used avx2 intrinsics but is not enough for this critical part.

相关标签:
回答
  • 消灭零回复
提交回复
热议问题