I would like to read in a file with the following structure with Apache Spark.
628344092\\t20070220\\t200702\\t2007\\t2007.1370
The delimiter
Use spark.read.option("delimiter", "\t").csv(file) or sep instead of delimiter.
spark.read.option("delimiter", "\t").csv(file)
sep
delimiter
If it's literally \t, not tab special character, use double \: spark.read.option("delimiter", "\\t").csv(file)
\t
\
spark.read.option("delimiter", "\\t").csv(file)