efficient way of cuda file organization: .cpp .h .cu .cuh .curnel files
What is the most easy to understand/efficient etc. code organization for cuda. After some investigation i found that cuda function declarations should be in .cuh file and implementations reside in .cu file and kernel function implementations in .curnel files. Other c++ stuff in .cpp and .h files ordinarily. Recently i posted a question visual studio .cu file shows syntax error but compile successfully . Is this organization correct? where .cpp calls .cu and it calls kernel function that in .curnel. h , cpp , c , hpp , inc - files that don't contain CUDA C code (e.g. __ device __ and other