How to disable sentence-level auto correction in Google Cloud Speech-to-Text API

问题

I am working on a speech recognition task, which involves the detection of children's speaking capability, improvement over time...

I'd like to use the Google Cloud Speech to Text API for the ASR part of the detection. Then I would use the transcripts of different measurements to estimate the advancement.

But! The sentence level autocorrect of Google Speech API consistently rewrites the previous limb of the spoken sentence...

Is there a way to disable the autocorrect of this ASR?

I can't bypass this problem with the "speechContext", "single_utterance" or "maxAlternatives" options.

"single_utterance" may work with words, but it corrects the misspells..

Any advice in this field?

回答1:

If you use streaming instead of batch recognize, you should receive an answer as soon as that part of the audio is transcribed, it does not wait for the rest of the sentence. You should then just store the first answer provided by the stream, not the further corrections.

This means that you don't have to wait until isFinal=True.

For a quick and dirty example of what I mean, go tho the speech API page, and run the streaming test with the developer tools open. There you'll see the streaming data received as the words are being spoken:

来源：https://stackoverflow.com/questions/48932010/how-to-disable-sentence-level-auto-correction-in-google-cloud-speech-to-text-api

标签

google-cloud-platform

speech-recognition

speech-to-text

google-speech-api

易学教程内所有资源均来自网络或用户发布的内容，如有违反法律规定的内容欢迎反馈！
该文章没有解决你所遇到的问题?点击提问,说说你的问题,让更多的人一起探讨吧!