When I do a simple aggregation avg and max on a large input dataframe, it seems to give starkly different results depnding on the data source. The orig
avg
max