It looks like I am again stuck on the running a packaged spark app jar using spark submit. Following is my pom file:
The dependencies between hadoop and AWS JDK are very sensitive, and you should stick to using the correct versions that your hadoop dependency version was built with.
The first problem you need to solve is pick one version of Hadoop. I see you're mixing versions 2.8.3
and 2.8.0
.
When I look at the dependency tree for org.apache.hadoop:hadoop-aws:2.8.0
, I see that it is built against version 1.10.6
of the AWS SDK (same for hadoop-aws:2.8.3
).
This is probably what's causing mismatches (you're mixing incompatible versions). So:
hadoop-aws
with the version compatible with your hadoop