I created an Amazon EMR cluster with Spark already on it. When I run pyspark from the terminal it goes into the pyspark terminal when I ssh into my cluster.
I uploaded a
Try using findspark: Install via shell using pip install findspark.
pip install findspark
Sample code:
# Import package(s). import findspark findspark.init() from pyspark import SparkContext from pyspark.sql import SQLContext