cuda 5.0 dynamic parallelism error: ptxas fatal . unresolved extern function 'cudaLaunchDevice

前端 未结 2 2069
执笔经年
执笔经年 2020-12-06 20:32

I am using tesla k20 with compute capability 35 on Linux with CUDA 5.With a simple child kernel call it gives a compile error : Unresolved extern function cudaLaunchDe

相关标签:
2条回答
  • 2020-12-06 20:43

    Perhaps I'm somewhat off topic, but I would like to mention that I had the same issue under Windows/Visual Studio 2010 and I have solved the problem using the last comment by talonmies in few steps.

    1) View -> Property Pages
    2) Configuration Properties -> CUDA C/C++ -> Common -> Generate Relocatable Device Code -> Yes (-rdc=true)
    3) Configuration Properties -> CUDA C/C++ -> Device -> Code Generation -> compute_35,sm_35
    4) Configuration Properties -> Linker -> Input -> Additional Dependencies -> cudadevrt.lib
    

    I hope that this information is useful.

    0 讨论(0)
  • 2020-12-06 20:51

    You must explicitly compile with relocatable device code enabled and link the device runtime library in order to use dynamic parallelism. So your compilation command must include --relocatable-device-code true and the linking command (which you haven't shown us) should include -lcudadevrt.

    This procedure is described in detail in the "TOOLKIT SUPPORT FOR DYNAMIC PARALLELISM" section of the Dynamic Parallelism Programming Guide pdf, available here.

    0 讨论(0)
提交回复
热议问题