I want to use cudaMemcpyAsyn in the copying data from host to the device. To do it, memory in the host should be allocated non-pagable by using cudaMallocHost
cudaMemcpyAsyn
cudaMallocHost