I am currently building a Bilstm model with attention in Tensorflow 2. However, when I am training this model, I have a vanishing gradient problem.
Here is the code of th