I have this dataset:
df = pd.DataFrame({\'scientist\':["Wendelaar Bonga"," Sjoerd E.", "Grätzel"," Michael", "Wil