In my kernel function I defined:
__shared__ float shared_var;
which is not a pointer/array (as it is shown in examples NVIDIA docs). Then I initia