Concurrent kernel execution not working in AMD A10 APU
问题 I have an AMD A10 APU with Radeon R7 GPU. I believe this device supportes concurrent kernel execution. But when i wrote the following code and obtained profiling information it doesnt seem like the kernels are executing concurrently. My openCL code is given below (The kernels within each iteration is added to the same queue and kernels in different iteration are added to different queues and hence should be running in parallel). for(j = 0; j < 8; j++){ cl_err = clEnqueueNDRangeKernel(queue[4