“Large data” work flows using pandas

前端 未结 16 1778
被撕碎了的回忆
被撕碎了的回忆 2020-11-21 07:32

I have tried to puzzle out an answer to this question for many months while learning pandas. I use SAS for my day-to-day work and it is great for it\'s out-of-core support.

16条回答
  •  余生分开走
    2020-11-21 08:00

    It is worth mentioning here Ray as well,
    it's a distributed computation framework, that has it's own implementation for pandas in a distributed way.

    Just replace the pandas import, and the code should work as is:

    # import pandas as pd
    import ray.dataframe as pd
    
    #use pd as usual
    

    can read more details here:

    https://rise.cs.berkeley.edu/blog/pandas-on-ray/

提交回复
热议问题