I have data like:
df.columns
[ \'entity_id\', \'gross_sales\'] # etc..., those are the relevant ones
and also correlated data abo