Using set_index() on a Dask Dataframe and writing to parquet causes memory explosion

后端 未结 0 1524
遥遥无期
遥遥无期 2020-11-22 06:44

I have a large set of Parquet files that I am trying to sort on a column. Uncompressed, the data is around ~14Gb, so Dask seemed like the right tool for the job. All I\'m do

相关标签:
回答
  • 消灭零回复
提交回复
热议问题