I have a problem to execute mapreduce python files on Hadoop by using Hadoop streaming.jar.
I use: Windows 10 64bit Python 3.6 and my IDE is spyder 3.2.6, Hadoop 2.3
Solution:
#!/usr/bin/python3
from top of my python code.I changed my command as:
D:\hadoop\bin\hadoop jar
D:\hadoop\share\hadoop\tools\lib\hadoop-streaming-2.7.2.jar
-file /in/wordcount-mapper.py -mapper "python wordcount-mapper.py"
-file /in/wordcount-reducer.py -reducer "python wordcount-reducer.py"
-input /in/mahsa.txt -output /output
Therefore I could get result.
hadoop fs -cat /output/part-00000