attention calculation at matrix level

后端 未结 0 675
死守一世寂寞
死守一世寂寞 2021-01-30 16:04

I am confused with the calculation in self attention or attention only.

Lets talk about self-attention first and I have:

x -> [batch_size, query_len, em         


        
相关标签:
回答
  • 消灭零回复
提交回复
热议问题