Parquet without Hadoop?

后端 未结 6 1589
北海茫月
北海茫月 2021-01-01 12:21

I want to use parquet in one of my projects as columnar storage. But i dont want to depends on hadoop/hdfs libs. Is it possible to use parquet outside of hdfs? Or What is th

6条回答
  •  野趣味
    野趣味 (楼主)
    2021-01-01 12:42

    Investigating the same question I found that apparently it's not possible for the moment. I found this git issue, which proposes decoupling parquet from the hadoop api. Apparently it has not been done yet.

    In the Apache Jira I found an issue, which asks for a way to read a parquet file outside hadoop. It is unresolved by the time of writing.

    EDIT:

    Issues are not tracked on github anymore (first link above is dead). A newer issue I found is located on apache's Jira with the following headline:

    make it easy to read and write parquet files in java without depending on hadoop

提交回复
热议问题