AttributeError: 'NoneType' object has no attribute 'setCallSite'

前端 未结 3 551
借酒劲吻你
借酒劲吻你 2021-01-23 05:47

In PySpark, I want to calculate the correlation between two dataframe vectors, using the following code (I do not have any problem in importing pyspark or createDataFrame):

3条回答
  •  旧时难觅i
    2021-01-23 06:02

    There's an open resolved issue around this:

    https://issues.apache.org/jira/browse/SPARK-27335?jql=text%20~%20%22setcallsite%22

    [Note: as it's resolved, if you're using a more recent version of Spark than October 2019, please report to Apache Jira if you're still encountering this issue]

    The poster suggests forcing to sync your DF's backend with your Spark context:

    df.sql_ctx.sparkSession._jsparkSession = spark._jsparkSession
    df._sc = spark._sc
    

    This worked for us, hopefully can work in other cases as well.

提交回复
热议问题