speech-to-text

Speech to text conversion for non-english language

两盒软妹~` 提交于 2019-12-18 07:24:13
问题 I am trying to implement naive speech to text conversion for non-english language. where user speaks in other language and text is also in the same language .Is that possible ? If yes can anyone help with the idea of how to implement it ? Any Help would be greatly appreciated.Thanks in advance. 回答1: It is unlikely any commercial speech recognition solution will support Sanskrit, so the only choice you have is to add support for Sanskrit into open source engine like CMUSphinx. This is pretty

Google-speech-api transcribing spoken numbers incorrectly

守給你的承諾、 提交于 2019-12-18 05:08:48
问题 I started using google speech api to transcribe audio. The audio being transcribed contains many numbers spoken one after the other. E.g. 273 298 But the transcription comes back 270-3298 My guess is that it is interpreting it as some sort of phone number. What i want is unparsed output e.g. "two seventy three two ninety eight' which i can deal with and parse on my own. Is there a setting or support for this kind of thing? thanks 回答1: So I had this exact same problem and I think we found a

Does Anyone Uses Google Speech API in Production?

拟墨画扇 提交于 2019-12-17 20:56:05
问题 I have found some articles (http://mikepultz.com/2011/03/accessing-google-speech-api-chrome-11/) describing how to use the Google speech API. However, is this reliable? Does anyone uses it in production? Thanks 回答1: My understanding is that Google's speech web services are only legitimately used by their Chrome browser or Android operating system. You can certainly build a production Chrome or Android app that relies on these services. Both Chrome and Android documentation describe how to add

Google speech to text API in C#

大兔子大兔子 提交于 2019-12-17 19:52:59
问题 My program get a correct respon from google when the flac file recorded manual by using windows's sound recorder and convert it using a software converter. But when I use the file that recorded by my program, I got "{"result":[]} " from google. What should I do? here is my code : the sender : private static void CopyStream(FileStream fileStream, Stream requestStream) { var buffer = new byte[32768]; int read; while ((read = fileStream.Read(buffer, 0, buffer.Length)) > 0) { requestStream.Write

Can CMU Sphinx be set up to recognize ~200 words

倖福魔咒の 提交于 2019-12-17 19:25:28
问题 I have a client who needs an Android App that can recognize spoken commands. From what I understand the built-in voice to text functionality actually sends data to Google's servers which then sends back a text translation. This is a major problem, as the voice data is extremely sensitive (unless if the data is encrypted when it is sent to and from Google - but I doubt it is encrypted). There are 2 options that I can think of. First is to convert speech-to-text on the Android, though this

good Speech recognition API

旧时模样 提交于 2019-12-17 07:05:51
问题 I am working on a college project in which I am using speech recognition. Currently I am developing it on Windows 7 and I'm using system.speech API package which comes along with .net and I am doing it on C#. The problem I am facing is dictation recognition is not accurate enough. Then whenever I start my application the desktop speech recognition starts automatically. This is a big nuicance to me. As already the words I speak are not clear enough and conflicting recognition are interpreted

Watson speech to text live stream C# code example

烂漫一生 提交于 2019-12-13 17:15:50
问题 I'm trying to build an app in C# that will take an audio stream (from a file for now, but later it will be a web stream) and return transcriptions from Watson in real time as they become available, similar to the demo at https://speech-to-text-demo.mybluemix.net/ Does anyone know where I can find some sample code, preferably in C#, that could help me get started? I tried this, based on the limited documentation at https://github.com/watson-developer-cloud/dotnet-standard-sdk/tree/development

How can I Toast after Text to Speech finish speaking Android

拟墨画扇 提交于 2019-12-13 15:19:39
问题 How can I Toast after Text to Speech finish speak. Actually I want to do someting more than Log. This is my code. public class MainActivity extends AppCompatActivity implements TextToSpeech.OnInitListener, TextToSpeech.OnUtteranceCompletedListener { private TextToSpeech mTts; Button btnSpeak; EditText editTextTTS; @Override protected void onCreate(Bundle savedInstanceState) { super.onCreate(savedInstanceState); setContentView(R.layout.activity_main); mTts = new TextToSpeech(this,this);

Improving accuracy of Google Cloud Speech API

天涯浪子 提交于 2019-12-13 14:12:32
问题 I am currently recording audio from a web page on my Mac OS computer and running it through the cloud speech api to produce a transcript. However, the results aren't that accurate and there are chunks of missing words in the results. Are there any steps that would help me yield more accurate results? Here are the steps I am taking to convert audio to text: Use Soundflower to channel audio output from my soundcard to mic in. Play audio from website Use quickTime player to record audio which is

how to convert text from audio clip? [duplicate]

可紊 提交于 2019-12-13 08:18:39
问题 This question already has answers here : Is there any way to send audio file to the speech-to-text recognition (2 answers) Closed last year . I have an audio file link came from json file now I need to convert the audio clip to text and this text should compare with a speech to text in android programming. how to convert this audio file to text, I had googled a lot, still didn't find the correct answer, any suggestions will be greatly appreciated. 回答1: use two powerful natural language