How to use Hive without hadoop

后端 未结 5 660
孤街浪徒
孤街浪徒 2021-01-05 08:30

I am a new to NoSQL solutions and want to play with Hive. But installing HDFS/Hadoop takes a lot of resources and time (maybe without experience but I got no time to do this

相关标签:
5条回答
  • 2021-01-05 09:05

    yes you can run hive without hadoop 1.create your warehouse on your local system 2. give default fs as file:/// than you can run hive in local mode with out hadoop installation

    In Hive-site.xml

    <?xml version="1.0"?> <?xml-stylesheet type="text/xsl" href="configuration.xsl"?> 
    <configuration>
          <property>
             <name>hive.metastore.schema.verification</name> 
             <value>false</value> 
          </property> 
         <property> 
          <!-- this should eventually be deprecated since the metastore should supply this --> 
            <name>hive.metastore.warehouse.dir</name> 
            <value>file:///tmp</value>
            <description></description> 
         </property>
         <property> 
            <name>fs.default.name</name> 
            <value>file:///tmp</value> 
         </property> 
    </configuration>
    
    0 讨论(0)
  • 2021-01-05 09:11

    Update This answer is out-of-date : with Hive on Spark it is no longer necessary to have hdfs support.


    Hive requires hdfs and map/reduce so you will need them. The other answer has some merit in the sense of recommending a simple / pre-configured means of getting all of the components there for you.

    But the gist of it is: hive needs hadoop and m/r so in some degree you will need to deal with it.

    0 讨论(0)
  • 2021-01-05 09:13

    If you are just talking about experiencing Hive before making a decision you can just use a preconfigured VM as @Maltram suggested (Hortonworks, Cloudera, IBM and others all offer such VMs)

    What you should keep in mind that you will not be able to use Hive in production without Hadoop and HDFS so if it is a problem for you, you should consider alternatives to Hive

    0 讨论(0)
  • 2021-01-05 09:17

    You cant, just download Hive, and run:

    ./bin/hiveserver2                                                                                                                                        
    Cannot find hadoop installation: $HADOOP_HOME or $HADOOP_PREFIX must be set or hadoop must be in the path
    

    Hadoop is like a core, and Hive need some library from it.

    0 讨论(0)
  • 2021-01-05 09:23

    Although, there are some details that you have to keep in mind it's completely normal to use Hive without HDFS. There are a few details one should keep in mind.

    1. As a few commenters mentioned above you'll still need some .jar files from hadoop common.
    2. As of today(XII 2020) it's difficult to run Hive/hadoop3 pair. Use stable hadoop2 with Hive2.
    3. Make sure POSIX permissions are set correctly, so your local hive can access warehouse and eventually derby database location.
    4. Initialize your database by manual call to schematool

    You can use site.xml file pointing to local POSIX filesystem, but you can also set those options in HIVE_OPTS environmen variable. I covered that with examples of errors I've seen on my blog post

    0 讨论(0)
提交回复
热议问题