pyarrow version 1.0 bug throws Out Of Memory exception while reading large number of files using ParquetDataset (works fine with version 0.13)

前端 未结 0 1108
礼貌的吻别
礼貌的吻别 2021-02-13 14:34

I have a dataframe split and stored in more than 5000 files. I use ParquetDataset(fnames).read() to load all files. I updated the pyarrow to latest version 1.0.1 from 0.13.0 and

相关标签:
回答
  • 消灭零回复
提交回复
热议问题