I have a DataFrame which contains a lot of repeated values. An aggregated, distinct count of it looks like below
> df.groupby(\'fruits\').count().sort(F.de