sphinx4

Dictation Application using Sphinx4

醉酒当歌 提交于 2019-11-27 02:31:46
问题 My requirements are similar to this question since the question is now 3 years old I am re-posting the question with information specific to mine, I want to create an application which takes a .wav (or any other standard audio file format) and converts it to text. For Speech Recognition I have decided to use sphinx4, I am trying to enhance the Transcriber demo provided with sphinx. Its good but That only works for a specific Grammar (written in .gram and .gxml files). EDIT To be able to use

Build NEW Acoustic model, Dictionary , Language model for uncommon language speech recognition

半世苍凉 提交于 2019-11-26 14:10:06
问题 I want to build NEW Acoustic model ,New Dictionary ,New Language model for " Sinhala Language speech recognition " Sinhala language Characters are Unicode based. for an example A=අ,I=ඉ,U=උ,KA=ක,BA=බ. I did go through CMUSphinx Tutorial For Developers. But it did not help me. It works for English language. Language model should be ARPA model. and How can I map Sinhala Unicode with English phonemes and how to train Language model with Different voices. Is there any tool available for generate