How to make parallel cudaMalloc fast?

前端 未结 0 463
孤城傲影
孤城傲影 2021-02-08 08:17

When allocating a lot of memory on 4 distinct NVIDIA V100 GPUs, I observe the following behavior with regards to parallelization via OpenMP:

Using the #pra

相关标签:
回答
  • 消灭零回复
提交回复
热议问题