How to do operations on hidden vector of the decoder on every timestep and append it to the input of the next lstm unit

前端未结

关注

 0  448

To implement attention in encoder-decoder, we have to take the hidden vector of an LSTM unit of the decoder, do several operation on it, to compute the attention weights. No