for this reinforcement learning problem I am using the reinforce.jl package. I am very new to programming and RL. I am attempting here to create a RL method for the cake eat