CUDA to solve many “small/moderate” linear systems

后端 未结 2 529
日久生厌
日久生厌 2021-01-16 05:59

Some background info on the problem I am trying to speed up using CUDA:

I have a large number of small/moderate same-sized linear systems I need to solve independent

相关标签:
2条回答
  • 2021-01-16 06:38

    MATLAB provides a way to call the cublas batch interface for GPU arrays using pagefun.

    0 讨论(0)
  • 2021-01-16 06:43

    Try using two or more parallel streams (with one linear system each) on the GPU, possibly this helps utilizing a bigger part of the GPU.

    For timing measurments and hardware utilization use the visual profiler instead of CPU time measurements.

    Another point is, that the GTX (consumer) GPUs perform pretty bad on double preision. If you have the chance, try to use a Tesla GPU instead.

    0 讨论(0)
提交回复
热议问题