问题
I need to calculate some rolling forward averages in a dataframe and really don't know where to start.
I know if I wanted to select a cell 10 days ahead say I would do df.shift(-10)
, but what I'm looking to do is calculate the average between 10 and 15 days ahead say.
So what I'm kind of thinking is df.rolling(-10,-15).mean()
, if I was trying to calculate just a moving average going backing in time df.rolling(15, 10).mean() would work perfectly and I did think about just calculating the averages like that, and then somehow shifting the data.
Any help would be great
Many thanks
回答1:
You could calculate the rolling mean 5 days ahead, and then shift that for 10 more periods. Since negative values in rolling are not allowed, you can invert the axis, calculate backwards, and then invert again (see How to use Pandas rolling_* functions on a forward-looking basis):
df = pd.DataFrame(np.random.rand(100, 2))
df[::-1].rolling(5).mean()[::-1].shift(-10)
回答2:
The above answer doesn't look right. IMHO you musn't reverse and shift.
df = pd.DataFrame(np.random.randint(low=0, high=10, size=(100, 2))) # int easier to interpret
df[::-1].rolling(window=5, min_periods=1).mean()[::-1]
this also works but you lose the last 4 values:
df.rolling(window=5, min_periods=1).mean().shift(-5)
The more difficult problem of a rolling window that is arbitrarily shifted (offset) probably needs to use .shift() in some way.
来源:https://stackoverflow.com/questions/55758035/calculating-rolling-forward-averages-with-pandas