spark job is running very slow when same data written into s3 is read again and ingested into snowflake

后端 未结 0 1089
终归单人心
终归单人心 2021-02-11 23:47

I am running pyspark job that is doing below steps:

  1. Reading data from cassandra .
  2. calculate md5 on entire row and add one column with md5 value .
相关标签:
回答
  • 消灭零回复
提交回复
热议问题