I am a beginner in Spark and I am trying to create a DataFrame based on the content of JSON file using PySpark by following the guide: http://spark.apache.org/docs/1.6.1/sql-pro
You must specify the file system protocol:
But also, the path must exist where the Spark driver and worker(s) are executed.