Cuda Fortran 4D array
问题 My code is being slowed down by a my 4D arrays access in global memory. I am using PGI compiler 2010. The 4D array I am accessing is read only from the device and the size is known at run time. I wanted to allocate to the texture memory and found that my PGI version does not support texture. As the size is known only at run time, it is not possible to use constant memory too. Only One dimension is known at compile time like this MyFourD(100, x,y,z) where x,y,z are user input. My first idea is