Is it possible to collect performance metrics in CUDA per region inside the kernel?

前端 未结 0 2021
粉色の甜心
粉色の甜心 2021-01-24 18:38

I have the following fragment of code for which I want to count the number of executed instructions.

#include 
#include 

__global__ v         


        
相关标签:
回答
  • 消灭零回复
提交回复
热议问题