Dask dataframe split partitions based on a column or function

前端 未结 2 1050
情话喂你
情话喂你 2021-02-14 14:37

I have recently begun looking at Dask for big data. I have a question on efficiently applying operations in parallel.

Say I have some sales data like this:

cu         


        
2条回答
  •  旧巷少年郎
    2021-02-14 15:07

    Setting index to the required column and map_partitions works much efficient compared to groupby

提交回复
热议问题