How to use tf.nn.ctc_loss in cnn+ctc network

前端未结

关注

 1  1603

时光取名叫无心 2021-01-16 14:53

Recently, I try to use tensorflow to implement a cnn+ctc network base on the article Towards End-to-End Speech Recognition with Deep Convolutional Neural Networks.

1条回答

星月不相逢 (楼主)

2021-01-16 15:42
The fully connected layer should be applied per time step. It's like applying same dense layer per time step in recurrent neural network. For output of convolution layer, time step is width.

So for example, output shape would be:
1. convolution: (10,120,155,3) = (batch, height, width, channels)
2. flatten: (10, 155, 120*3) = (batch, max_time, features)
3. fully connected: (10, 155, 1024), (same dense layer applied per time step)
4. (10, 155, num_classes)
It is expected shape for ctc_loss in tensorflow.
0 讨论(0)
发布评论:

提交评论
- 加载中...