I am building a data engineering platform for my Org (as a POC, for now), which will be used only in-house by a maximum of 20 members. The idea is to use: Spark (on Ku