I have a problem that would be solved by Hadoop Streaming in \"typedbytes\" or \"rawbytes\" mode, which allow one to analyze binary data in a language other than Java. (Without
We solved the binary data issue using hexaencoding the data at split level when streaming down data to the Mapper. This would utilize and increase the Parallel efficiency of your operation instead of first tranforming your data before processing on a node.