Periodical loss increase in the learning curve

后端 未结 0 613
臣服心动
臣服心动 2020-11-30 22:18

I am training a transformers-based machine translation (NMT) model.

The size of the parallel corpus is 4.5 million sentence pairs in two languages. What I am observin

相关标签:
回答
  • 消灭零回复
提交回复
热议问题