Using Pyspark - I have a dataframe with with a Name column.
| Name | | --- | | 4013 | | 4013 | | 4013 | | 4013 | | 4013 | | 4019 | | 4019 | | 4010 | | 4010 | |