Hive: parsing JSON

前端 未结 4 2049
遥遥无期
遥遥无期 2021-02-04 04:11

I am trying to get some values out of nested JSON for millions of rows (5 TB+ table). What is the most efficient way to do this?

Here is an example:

{\"c         


        
4条回答
  •  时光说笑
    2021-02-04 04:39

    Implementing a SerDe to parse your data in JSON is a better way for your case.

    A tutorial on how to implement SerDe for parsing JSON can be found here

    http://blog.cloudera.com/blog/2012/12/how-to-use-a-serde-in-apache-hive/

    You can use the following sample SerDe implementation as well

    https://github.com/rcongiu/Hive-JSON-Serde

提交回复
热议问题