Multiple processes launching CUDA kernels in parallel

后端 未结 3 1648
逝去的感伤
逝去的感伤 2021-02-04 07:02

I know that NVIDIA gpus with compute capability 2.x or greater can execute u pto 16 kernels concurrently. However, my application spawns 7 \"processes\" and each of these 7 proc

3条回答
  •  时光取名叫无心
    2021-02-04 07:46

    To add to the answer of @talonmies

    In the newer architectures, by the use of MPS multiple processes can launch multiple kernels concurrently. So, now it is definitely possible which was not sometime before. For a detailed understanding read this article.

    https://docs.nvidia.com/deploy/pdf/CUDA_Multi_Process_Service_Overview.pdf

    Additionally, you can also see maximum number of concurrent kernels allowed per cuda compute capability type supported by different GPUs. Here is a link to that:

    https://en.wikipedia.org/wiki/CUDA#Version_features_and_specifications

    For example a GPU with cuda compute capability of 7.5 can have maximum of 128 Cuda kernels launched to it.

提交回复
热议问题