I am really curious about the performance of dynamic parallelism and the recursion abillity that gives you. So i make a simple benchmark of a cpu recursion and a device(gpu) rec