TensorFlow: InternalError: Blas SGEMM launch failed

后端 未结 16 2296
清酒与你
清酒与你 2020-12-04 15:13

When I run sess.run(train_step, feed_dict={x: batch_xs, y_: batch_ys}) I get InternalError: Blas SGEMM launch failed. Here is the full error and st

相关标签:
16条回答
  • 2020-12-04 15:58

    I encountered this error when running Keras CuDNN tests in parallel with pytest-xdist. The solution was to run them serially.

    0 讨论(0)
  • 2020-12-04 15:59

    For me, I got this error when using Keras, and Tensorflow was the the backend. It was because the deep learning environment in Anaconda was not activated properly, as a result, Tensorflow didn't kick in properly either. I noticed this since the last time I activated my deep learning environment (which is called dl), the prompt changed in my Anaconda Prompt to this:

    (dl) C:\Users\georg\Anaconda3\envs\dl\etc\conda\activate.d>set "KERAS_BACKEND=tensorflow"

    While it only had the dl before then. Therefore, what I did to get rid of the above error was to close my jupyter notebook and Anaconda prompt, then relaunch, for several times.

    0 讨论(0)
  • 2020-12-04 15:59

    I encountered this error after changing OS to Windows 10 recently, and I never encountered this before when using windows 7.

    The error occurs if I load my GPU Tensorflow model when an another GPU program is running; it's my JCuda model loaded as socket server, which is not large. If I close my other GPU program(s), this Tensorflow model can be loaded very successfully.

    This JCuda program is not large at all, just around 70M, and in comparison this Tensorflow model is more than 500M and much larger. But I am using 1080 ti, which has much memory. So it would be probably not an out-of-memory progblem, and it would perhaps be some tricky internal issue of Tensorflow regarding OS or Cuda. (PS: I am using Cuda version 8.0.44 and haven't downloaded a newer version.)

    0 讨论(0)
  • 2020-12-04 16:01

    I got this error when running Tensorflow Distributed. Did you check if any of the workers were reporting CUDA_OUT_OF_MEMORY errors? If this is the case it may have to do with where you place your weight and bias variables. E.g.

    with tf.device("/job:paramserver/task:0/cpu:0"):
       W = weight_variable([input_units, num_hidden_units])       
       b = bias_variable([num_hidden_units])             
    
    0 讨论(0)
提交回复
热议问题