I am doing data preprocessing using spark(scala) on aws. Through preprocessing, the size of the sample data has been reduced from 5GB to 2.7GB, and I want to finally store it in