Currently, I am doing a project that downloads images from a source and stores them to HDFS for image processing. So, I am quite new to Hadoop and Spark. A general structure