How to use multi CPU cores to train NNs using caffe and OpenBLAS

前端未结

关注

 3  1905

我在风中等你 2021-02-10 20:57

I am learning deep learning recently and my friend recommended me caffe. After install it with OpenBLAS, I followed the tutorial, MNIST task in the doc. But later I found it was

3条回答

后悔当初 (楼主)

2021-02-10 21:32

@Karthik. That also works for me. One interesting discovery that I made was that using 4 threads reduces forward/backward pass during the caffe timing test by a factor of 2. However, increasing the thread count to 8 or even 24 results in f/b speed that is less than what I get with OPENBLAS_NUM_THREADS=4. Here are times for a few thread counts (tested on NetworkInNetwork model).

[#threads] [f/b time in ms]
1 223
2 150
4 113
8 125
12 144

For comparison, on a Titan X GPU the f/b pass took 1.87 ms.

0 讨论(0)

查看其它3个回答
发布评论:

提交评论
- 加载中...