In a hadoop cluster, should hive be installed on all nodes?

后端 未结 3 1267
伪装坚强ぢ
伪装坚强ぢ 2021-02-07 10:55

I am a newbie to Hadoop / Hive and I have just started reading the docs. There are lots of blogs on installing Hadoop in cluster mode. Also, I know that Hive runs on top of Hado

相关标签:
3条回答
  • 2021-02-07 11:14

    No, it is not something you install on worker nodes. Hive is a Hadoop client. Just run Hive according to the instructions you see at the Hive site.

    0 讨论(0)
  • 2021-02-07 11:16

    Hive is basically used for processing structured and semi-structured data in Hadoop. We can also perform Analysis of large datasets which is present in HDFS and also in Amazon S3 filesystem using Hive. In order to query data hive also provides query language known as HiveQL which is similar to SQL. Using Hive one can easily run Ad-hoc queries for the data analysis. Using Hive we don’t need to write complex Map-Reduce jobs, we just need to submit SQL queries. Hive converts these SQL queries into MapReduce jobs.

    Finally Hive SQL will get converted to MapReduce jobs and we don't have to submit MapReduce job from all node in a Hadoop cluster, in the same way we don't need Hive to be installed in all node of Hadoop cluster

    0 讨论(0)
  • 2021-02-07 11:34

    From Cloudera's Hive installation Guide:

    Install Hive on your client machine(s) from which you submit jobs; you do not need to install it on the nodes in your Hadoop cluster.

    0 讨论(0)
提交回复
热议问题