发表新帖

发表新帖

keep_prob in TensorFlow MNIST tutorial

前端未结

关注

 1  2016

不要未来只要你来 2021-02-04 13:32

I can\'t understand the following code in the Deep MNIST for Experts tutorial.

train_step.run(feed_dict={x: batch[0], y_: batch[1], keep_prob: 0.5})

1条回答

醉话见心 (楼主)

2021-02-04 13:58

The keep_prob value is used to control the dropout rate used when training the neural network. Essentially, it means that each connection between layers (in this case between the last densely connected layer and the readout layer) will only be used with probability 0.5 when training. This reduces overfitting. For more information on the theory of dropout, you can see the original paper by Srivastava et al. To see how to use it in TensorFlow, see the documentation on the tf.nn.dropout() operator.

The keep_prob value is fed in via a placeholder so that the same graph can be used for training (with keep_prob = 0.5) and evaluation (with keep_prob = 1.0). An alternative way to handle these cases is to build different graphs for training and evaluation: look at the use of dropout in the current convolutional.py model for an example.

0 讨论(0)
发布评论:

提交评论
- 加载中...

热议问题