Predicting a multiple forward time step of a time series using LSTM

后端 未结 3 1440
栀梦
栀梦 2020-12-07 23:03

I want to predict certain values that are weekly predictable (low SNR). I need to predict the whole time series of a year formed by the weeks of the year (52 values - Figure

相关标签:
3条回答
  • 2020-12-07 23:38

    I'd like to add to this question

    I added a TimeDistributed layer. My thought was that the algorithm will predict the values as a time series instead of isolated values (am I correct?)

    as I myself had quite a hard time understanding the functionality behind the Keras TimeDistributed layer.

    I'd argue that your motivation is right to not isolate the calculations for a Time Series Prediction. You specifically do wan't to get the characteristics and interdependencies of the whole series thrown together, when predicting its future shape.

    However, that's exactly the opposite of what the TimeDistributed Layer is for. It is for isolating the calculations on each timestep. Why is this useful, you might ask? For completely different tasks, e.g. sequence labelling where you have a sequential input (i1, i2, i3, ..., i_n) and aim at outputting the labels (label1, label2, label1, ..., label2) for each timestep separately.

    Imho the best explanation can be found in this post and in the Keras Documentation.

    For this reason, I'd claim, that against all intuition adding a TimeDistributed layer is likely never a good idea for Time Series Prediction. Open and happy to hear other opinions about that!

    0 讨论(0)
  • 2020-12-07 23:43

    I have data from 10 years. If my training dataset are: values from 4 weeks to predict the 5th and I keep shifting, I can have almost 52 X 9 examples to train the model and 52 to predict (last year)

    This actually means you have only 9 training examples with 52 features each (unless you want to train on highly overlapping input data). Either way, I don't think this is nearly enough to merit training an LSTM.

    I would suggest trying a much simpler model. Your input and output data is of fixed size, so you could try sklearn.linear_model.LinearRegression which handles multiple input features (in your case 52) per training example, and multiple targets (also 52).

    Update: If you must use an LSTM then take a look at LSTM Neural Network for Time Series Prediction, a Keras LSTM implementation which supports multiple future predictions all at once or iteratively by feeding each prediction back in as input. Based on your comments this should be exactly what you want.

    The architecture of the network in this implementation is:

    model = Sequential()
    
    model.add(LSTM(
        input_shape=(layers[1], layers[0]),
        output_dim=layers[1],
        return_sequences=True))
    model.add(Dropout(0.2))
    
    model.add(LSTM(
        layers[2],
        return_sequences=False))
    model.add(Dropout(0.2))
    
    model.add(Dense(
        output_dim=layers[3]))
    model.add(Activation("linear"))
    

    However, I would still recommend running a linear regression or maybe a simple feed forward network with one hidden layer and comparing accuracy with the LSTM. Especially if you are predicting one output at a time and feeding it back in as input your errors could easily accumulate giving you very bad predictions further on.

    0 讨论(0)
  • 2020-12-07 23:57

    Sharing the same concerns about having too little data, you can do that like this.

    First, it's a good idea to keep your values between -1 and +1, so I'd normalize them first.

    For the LSTM model, you must make sure you're using return_sequences=True.
    There is nothing "wrong" with your model, but it may need more or less layers or units to achieve what you desire. (There is no clear answer to this, though).

    Training the model to predict the next step:

    All you need is to pass Y as a shifted X:

    entireData = arrayWithShape((samples,52,1))
    X = entireData[:,:-1,:]
    y = entireData[:,1:,:]
    

    Train the model using these.

    Predicting the future:

    Now, for predicting the future, since we need to use predicted elements as input for more predicted elements, we are going to use a loop and make the model stateful=True.

    Create a model equal to the previous one, with these changes:

    • All LSTM layers must have stateful=True
    • The batch input shape must be (batch_size,None, 1) - This allows variable lengths

    Copy the weights of the previously trained model:

    newModel.set_weights(oldModel.get_weights())
    

    Predict only one sample at a time and never forget to call model.reset_states() before starting any sequence.

    First predict with the sequence you already know (this will make sure the model prepares its states properly for predicting the future)

    model.reset_states()
    predictions = model.predict(entireData)
    

    By the way we trained, the last step in predictions will be the first future element:

    futureElement = predictions[:,-1:,:]
    
    futureElements = []
    futureElements.append(futureElement)
    

    Now we make a loop where this element is the input. (Because of stateful, the model will understand it's a new input step of the previous sequence instead of a new sequence)

    for i in range(howManyPredictions):
        futureElement = model.predict(futureElement)
        futureElements.append(futureElement)
    

    This link contains a complete example predicting the future of two features: https://github.com/danmoller/TestRepo/blob/master/TestBookLSTM.ipynb

    0 讨论(0)
提交回复
热议问题