发表新帖

发表新帖

CUDA Zero Copy memory considerations

后端未结

关注

 5  1656

清歌不尽 2021-01-04 13:18

I am trying to figure out if using cudaHostAlloc (or cudaMallocHost?) is appropriate.

I am trying to run a kernel where my input data is more than the amount availab

5条回答

执念已碎 (楼主)

2021-01-04 14:10

Using host memory would be orders of magnitude slower than on-device memory. It has both very high latency and very limited throughput. For example capacity of PCIe x16 is mere 8GB/s when bandwidth of device memory on GTX460 is 108GB/s

0 讨论(0)

查看其它5个回答
发布评论:

提交评论
- 加载中...

热议问题