The purpose of this is in order to manipulate and save a copy of each data file in a second location in HDFS. I will be using
RddName.coalesce(1).saveAsTex
sc.wholeTextFiles(path) should help. It gives an rdd of (filepath, filecontent).