Why is my inclusive scan code 2x faster on CPU than on a GPU?

前端 未结 1 1917
南笙
南笙 2020-12-12 06:42

I wrote a short CUDA program that uses the highly-optimized CUB library to demonstrate that one core from an old, quad-core Intel Q6600 processor (all four are supposedly ca

相关标签:
1条回答
  • 2020-12-12 07:09

    Thanks to Robert Crovella, it turns out I was using the "Debug" mode that is notoriously slow instead of "Release" mode.

    0 讨论(0)
提交回复
热议问题