apark spark map does not parallel well

后端未结

关注

 0  720

I want to use spark to do a 2-stage job. pseudo-code like this.

# aggregate job
line = sc.textFile(input_file);
agg_result = lines.aggregate(initialValue, add


                      
              相关标签: