In pyspark, after reading data from impala/hive, how is it partitioned?

前端 未结 0 1540
隐瞒了意图╮
隐瞒了意图╮ 2020-12-13 06:03

I would like to make sure my assumption is right about partitions right after reading data.

By default, a RDD in my cluster has 200 partitions.

I read data fro

相关标签:
回答
  • 消灭零回复
提交回复
热议问题