I want to develop an end-to-end machine learning application where data will be in gpu-memory and computations will run on the gpu. A stateless RESTfull service with a datab