GPU library that implements Image Convolution using cuFFT?
问题 I've been using the image convolution function from Nvidia Performance Primitives (NPP). However, my kernel is fairly large with respect to the image size, and I've heard rumors that NPP's convolution is a direct convolution instead of an FFT-based convolution. (I don't think the NPP source code is available, so I'm not sure how it's implemented.) I'd like to see how fast a cuFFT-based convolution function could run in the image processing application that I'm working on. You might say "hey,