Slow write to S3 when logic included in existing code

前端 未结 0 1398
面向向阳花
面向向阳花 2021-01-29 11:34

I have a PySpark code whose last step is to write data to S3 in parquet format. It looks something like this

df = generated_by_some_logic
df.cache()
df.count()
df         


        
相关标签:
回答
  • 消灭零回复
提交回复
热议问题