How to create an empty DataFrame? Why “ValueError: RDD is empty”?

后端 未结 11 1090
孤城傲影
孤城傲影 2021-02-01 03:48

I am trying to create an empty dataframe in Spark (Pyspark).

I am using similar approach to the one discussed here enter link description here, but it is not working.

11条回答
  •  北恋
    北恋 (楼主)
    2021-02-01 04:19

    At the time this answer was written it looks like you need some sort of schema

    from pyspark.sql.types import *
    field = [StructField("field1", StringType(), True)]
    schema = StructType(field)
    
    sc = spark.sparkContext
    sqlContext.createDataFrame(sc.emptyRDD(), schema)
    

提交回复
热议问题