I have an rdd with different fields namely a, b, c, d. I would like to filter on one of the field which has duplicate values in it. For example
inputRdd = [(1,2,3