I\'m using python on Spark and would like to get a csv into a dataframe.
The documentation for Spark SQL strangely does not provide explanations for CSV as a source.
for Pyspark, assuming that the first row of the csv file contains a header
spark = SparkSession.builder.appName('chosenName').getOrCreate() df=spark.read.csv('fileNameWithPath', mode="DROPMALFORMED",inferSchema=True, header = True)