Why is the number of combiner input records more than the number of outputs of maps?
问题 A Combiner runs after the Mapper and before the Reducer, it will receive as input all data emitted by the Mapper instances on a given node. It then emits output to the Reducers. So the records of the combiner input should less than the maps ouputs. 12/08/29 13:38:49 INFO mapred.JobClient: Map-Reduce Framework 12/08/29 13:38:49 INFO mapred.JobClient: Reduce input groups=8649 12/08/29 13:38:49 INFO mapred.JobClient: Map output materialized bytes=306210 12/08/29 13:38:49 INFO mapred.JobClient: