Mp3 / Wav to Text

爱⌒轻易说出口 提交于 2019-12-11 17:53:11

问题


I currently have a mobile application that can record speech as either a WAV or MP3 and would like to convert it to text. I have looked around - Microsoft Speech, UCMA, etc -- but haven't seen any good examples of how to do it. Can someone help out here?

FYI - We have access to MS Lync.

Look forward to any responses, James


回答1:


There's a sample of using the UCMA 3.0 SDK to perform speech recognition, available here. However, from experience (and I've love to be proved wrong here) you can only recognise a specific response from a pre-determined set of options. So, you can ask "How was your day" and tell UCMA that valid responses are "Good", "Fair", "Bad" and it will then attempt to match any of those. I've not seen it be able to 'just' do free-speech conversion. (a much harder problem)

This might be something the Microsoft Speech API can do better. Try this post for a head start.



来源:https://stackoverflow.com/questions/11123588/mp3-wav-to-text

易学教程内所有资源均来自网络或用户发布的内容,如有违反法律规定的内容欢迎反馈
该文章没有解决你所遇到的问题?点击提问,说说你的问题,让更多的人一起探讨吧!