Methods for writing Parquet files using Python?

后端 未结 6 723
再見小時候
再見小時候 2021-02-02 09:30

I\'m having trouble finding a library that allows Parquet files to be written using Python. Bonus points if I can use Snappy or a similar compression mechanism in conjunction wi

6条回答
  •  滥情空心
    2021-02-02 10:23

    pyspark seems to be the best alternative right now for writing out parquet with python. It may seem like using a sword in place of needle, but thats how it is at the moment.

    • It supports most compression types like lzo, snappy. Zstd support should come into it soon.
    • Has complete schema support (nested, structs, etc)

    Simply do, pip install pyspark and you are good to go.

    https://spark.apache.org/docs/latest/sql-data-sources-parquet.html

提交回复
热议问题