Why does the quantization after pruning result in worse performance?

前端 未结 0 2009
余生分开走
余生分开走 2021-01-14 09:03

I quantized resent18 model of mine, using graph quantization which lead to a forward pass of about 100 ms (on my cpu) and a size reduction of 40MBs (from initial 85 to 45 MB

相关标签:
回答
  • 消灭零回复
提交回复
热议问题