The reduce phase of the job fails with:
The reason why each task fails is:
Task attempt_201301251556_163
The reason for the timeouts might be a long-running computation in your reducer without reporting the progress back to the Hadoop framework. This can be resolved using different approaches:
I. Increasing the timeout in mapred-site.xml
:
mapred.task.timeout
1200000
The default is 600000 ms = 600 seconds
.
II. Reporting progress every x records as in the Reducer example in javadoc:
public void reduce(K key, Iterator values,
OutputCollector output,
Reporter reporter) throws IOException {
// report progress
if ((noValues%10) == 0) {
reporter.progress();
}
// ...
}
optionally you can increment a custom counter as in the example:
reporter.incrCounter(NUM_RECORDS, 1);