I wrote a mapreduce job to extract some info from a dataset. The dataset is users\' rating about movies. The number of users is about 250K and the number of movies is about 300k
In newer versions, the name of the parameter has been changed to mapreduce.task.timeout
as described in this link (search for task.timeout
). In addition, you can also disable this timeout as described in the above link:
The number of milliseconds before a task will be terminated if it neither reads an input, writes an output, nor updates its status string. A value of 0 disables the timeout.
Below is an example setting in the mapred-site.xml
:
mapreduce.task.timeout
0