Hadoop : how to start my first project

橙三吉。 提交于 2019-12-11 04:38:54

问题


I'm starting to work with Hadoop but I don't know where and how do it. I'm working on OS X and I follow some tutorial to install Hadoop, it's done and it's work but now I don't know what to do.

Is there an IDE to install (maybe eclipse)? I find some codes but nothing works and I don't know what I have to add in my project etc ...

Can you give me some informations or guide me to a complete tutorial ?


回答1:


If you want to learn Hadoop framework then i recomend to just start with installing Cloudera QuickStart virtual machine on your OSX system provided your system has all the prerequisites:

http://www.cloudera.com/downloads/quickstart_vms/5-8.html

Cloudera QuickStart virtual machines include everything you need to try Hadoop, MapReduce, Hive, Pig, Impala, etc. and Eclipse IDE as well.

Above will do perfect if you are interested in perusing career as Hadoop Developer however, if you are interested in Hadoop systems administrator then follow the @Alvaro recommendation.

Then there is a intro to Hadoop and MapReduce course on Udacity would be a good start for beginners: https://www.udacity.com/course/intro-to-hadoop-and-mapreduce--ud617

Hadoop: The Definitive Guide By Tom White could be a great comprehensive book to refer: http://shop.oreilly.com/product/0636920033448.do




回答2:


I would recommend you install the Cloudera pseudo distributed example on a virtual machine, the latest LTS Ubuntu. That way, you don't messed up with your laptop and it would be a environment closer to anything you would do in production. Have you checked vagrantup.com?

When you have it installed, you could choose on work directly on Java or chose a framework like MrJob (python) to execute some custom programs.

Best, Alvaro.



来源:https://stackoverflow.com/questions/42002420/hadoop-how-to-start-my-first-project

易学教程内所有资源均来自网络或用户发布的内容,如有违反法律规定的内容欢迎反馈
该文章没有解决你所遇到的问题?点击提问,说说你的问题,让更多的人一起探讨吧!