Experience with Hadoop?

前端 未结 9 1907
傲寒
傲寒 2021-02-08 21:49

Have any of you tried Hadoop? Can it be used without the distributed filesystem that goes with it, in a Share-nothing architecture? Would that make sense?

I\'m also inte

9条回答
  •  别跟我提以往
    2021-02-08 22:23

    As Joe said, you can indeed use Hadoop without HDFS. However, throughput depends on the cluster's ability to do computation near where data is stored. Using HDFS has 2 main benefits IMHO 1) computation is spread more evenly across the cluster (reducing the amount of inter-node communication) and 2) the cluster as a whole is more resistant to failure due to data unavailability.

    If your data is already partitioned or trivially partitionable, you may want to look into supplying your own partitioning function for your map-reduce task.

提交回复
热议问题