I have developing a project using CUDA and openmp.
void function() { float *data_h = (float *)malloc(data_size); omp_set_num_threads(n); #pragma omp