发表新帖

发表新帖

Spark scala remove columns containing only null values

前端未结

关注

 3  1890

南笙 2021-01-12 13:47

Is there a way to remove the columns of a spark dataFrame that contain only null values ? (I am using scala and Spark 1.6.2)

At the moment I am doing this:

3条回答

走了就别回头了 (楼主)

2021-01-12 14:48
If the dataframe is of reasonable size, I write it as json then reload it. The dynamic schema will ignore null columns and you'd have a lighter dataframe.

scala snippet:
```
originalDataFrame.write(tempJsonPath)
val lightDataFrame = spark.read.json(tempJsonPath)
```
0 讨论(0)

查看其它3个回答
发布评论:

提交评论
- 加载中...

热议问题