specify how to partition dask dataframe?

自闭症网瘾萝莉.ら 提交于 2021-01-28 01:51:30

问题


I have a pandas df that's indexed by id and date. I would like to run some regressions for each id in parallel using dask. I know dask splits the df into N partitions but is there a way to force it to split by id column? This way when I do map_partitions I can simply apply my rolling regression function to each partition.

来源:https://stackoverflow.com/questions/51698459/specify-how-to-partition-dask-dataframe

易学教程内所有资源均来自网络或用户发布的内容,如有违反法律规定的内容欢迎反馈
该文章没有解决你所遇到的问题?点击提问,说说你的问题,让更多的人一起探讨吧!