Trying to run Spark on EMR using the AWS SDK for Java, but it skips the remote JAR stored on S3
问题 I'm trying to run Spark on EMR using the SDK for Java, but I'm having issues getting the spark-submit to use a JAR that I have stored on S3. Here is the relevant code: public String launchCluster() throws Exception { StepFactory stepFactory = new StepFactory(); // Creates a cluster flow step for debugging StepConfig enableDebugging = new StepConfig().withName("Enable debugging") .withActionOnFailure("TERMINATE_JOB_FLOW") .withHadoopJarStep(stepFactory.newEnableDebuggingStep()); // Here is the