What Are the Pros and Cons of Running a Job in Hadoop Using Various Languages?

后端 未结 3 1326
时光取名叫无心
时光取名叫无心 2021-02-06 10:37

I\'ve been using either Pig or Java for Map Reduce exclusively for running jobs against a Hadoop cluster thus far. I\'ve recently tried out using Python Map Reduce through the H

3条回答
  •  孤独总比滥情好
    2021-02-06 11:17

    There is Scala, where you can write much simpler code for your jobs. For example, check out: https://github.com/NICTA/scoobi

    You probably can have some incentive to use C++ for tasks that are more Memory or CPU intensive. You can read what Hypertable wrote about their C++ Decision: http://code.google.com/p/hypertable/wiki/WhyWeChoseCppOverJava

    Java is also problematic on the Serialization side, as it creates an Object for any object that it reads from an input stream. You need to be careful not to use Java Serialization, just because you have Java implementation.

提交回复
热议问题