What does the shuffling phase actually do?

前端 未结 1 535
终归单人心
终归单人心 2021-01-14 23:23

What does the shuffling phase actually do?


Possibility - A

As shuffling is the process of bringing the mapper o/p to the reducer o/p, it just brings t

相关标签:
1条回答
  • 2021-01-15 00:10

    Mappers and Reducers are not separate machines but just separate code. Both, the mapping code as well as the reducing code runs on the same set machines present in the cluster.


    So, after all machines in the cluster have run mapper, the results are:

    1. Binned locally on the node (Consider it a "local-grouping"); and,
    2. Shuffled/Redistributed across all nodes on the cluster.

    Consider the step-2 a "global-grouping" because it is done in a manner that all values belonging to one key, go to their assigned unique node.

    Now, the nodes run the Reducer code on the (key, value) pairs residing on their memory.

    0 讨论(0)
提交回复
热议问题