How to use tf.nn.ctc_loss in cnn+ctc network

前端 未结 1 1603
时光取名叫无心
时光取名叫无心 2021-01-16 14:53

Recently, I try to use tensorflow to implement a cnn+ctc network base on the article Towards End-to-End Speech Recognition with Deep Convolutional Neural Networks.

I

1条回答
  •  星月不相逢
    2021-01-16 15:42

    The fully connected layer should be applied per time step. It's like applying same dense layer per time step in recurrent neural network. For output of convolution layer, time step is width.

    So for example, output shape would be:

    1. convolution: (10,120,155,3) = (batch, height, width, channels)
    2. flatten: (10, 155, 120*3) = (batch, max_time, features)
    3. fully connected: (10, 155, 1024), (same dense layer applied per time step)
    4. (10, 155, num_classes)

    It is expected shape for ctc_loss in tensorflow.

    0 讨论(0)
提交回复
热议问题