Difference on creating a CUDA context

后端 未结 1 1616
你的背包
你的背包 2021-01-21 07:53

I\'ve a program that uses three kernels. In order to get the speedups, I was doing a dummy memory copy to create a context as follows:

__global__ void warmStart(         


        
相关标签:
1条回答
  • 2021-01-21 08:29

    Each CUDA context has memory allocations that are required to execute a kernel that are not required to be allocated to syncrhonize, allocate memory, or free memory. The initial allocation of the context memory and resizing of these allocations is deferred until a kernel requires these resources. Examples of these allocations include the local memory buffer, device heap, and printf heap.

    0 讨论(0)
提交回复
热议问题