overwrite hive partitions using spark

后端未结

关注

 4  1171

北荒 2021-02-05 21:44

I am working with AWS and I have workflows that use Spark and Hive. My data is partitioned by the date, so everyday I have a new partition in my S3 storage. My problem is when

4条回答

花落未央 (楼主)

2021-02-05 22:18

I would suggest to run sql using sparksession. you can run " insert overwrite partition query" by selecting the columns from existing dataset. this solution will surely overwrites partition only.

0 讨论(0)

查看其它4个回答
发布评论:

提交评论
- 加载中...