I am trying to use the keras.layers.Attention layer for channeled data of dimensions (batch_size, seq_len, num_features, num_channels) = (None, 200, 8, 2).
(batch_size, seq_len, num_features, num_channels) = (None, 200, 8, 2)