I want to use spark to do a 2-stage job. pseudo-code like this.
# aggregate job line = sc.textFile(input_file); agg_result = lines.aggregate(initialValue, add