speech-to-text

Speech recognition for recorded audio files in .3gp or wav format [duplicate]

℡╲_俬逩灬. 提交于 2019-12-07 02:56:27
问题 This question already has answers here : Closed 6 years ago . Possible Duplicate: Voice recognition on android with recorded sound clip? I am developing an Android app which will record audio to a file, and then use speech recognition to get text from the recorded speech. Is there a speech recognition library available for this sort of task? How can I get started? 回答1: Get start with this tutorial where you can learn speech to text conversion in Android with it's speech API Also see: Speech

Android Speech to Text Api Google - notification

◇◆丶佛笑我妖孽 提交于 2019-12-06 22:04:28
I followed this tuto: https://jbinformatique.com/2018/02/16/android-speech-to-text-api-google-tutoriel/ It works nice ! It uses android.speech.RecognizerIntent package it's free and it works without Internet as mentionned here: Difference between Android Speech to Text API (Recognizer Intent) and Google Cloud Speech API? However when I start the speech recognition, I get the following notification : If I translate (as I can..), it says : "Your audio records will be sent to Google and used for speech recognition. Transcription will be shared with this application. File recorded on "....@gmail

Microsoft speech API 5.1, 5.3?

早过忘川 提交于 2019-12-06 14:52:19
I'm a little confuse between the different SAPI version available. First of all, I only find the SDK to develop with the 5.1 version, is there any SDK for the 5.3 version available, if not, why ? Witch version can I use if I'm developing with the 3.5 version of the .Net framework. Is there any good tutorial because the only one I found are pretty old (they use 2003 version of visual studio) : http://msdn.microsoft.com/en-us/library/ms986944.aspx Is there any way I can use the speech API directly in Asp.Net web site in speech-to-text mode ? Thx! Wikipedia tells me that SAPI 5.3 was included in

Android SpeechRecognizer set recognize engine?

五迷三道 提交于 2019-12-06 11:56:53
问题 I'm using SpeechRecognizer to convert speech to text. I'm using Samsung and I encounterd error while I set Voice recognizer to 'Samsung powered by Vlingo', after I changed to 'Google' and the error disappear. I was wondering is there any method to change recognize engine programmatically? Thanks. 回答1: I've found the answer of my question. I can use 'public static SpeechRecognizer createSpeechRecognizer (Context context, ComponentName serviceComponent)' the method to choose which recognition

Getting WAV file transcription to work with Sphinx4

人走茶凉 提交于 2019-12-06 11:26:23
I've got Sphinx-4 installed on my windows XP system and JSAPI set up. I'd like to transcribe an English spoken WAV (or MP3) file to text. When I run the "WavFile" demo - it runs successfully. java -jar WavFile.jar But, when I pass my own wav file like this: java -jar WavFile.jar c:\test.wav I get: Loading Recognizer as defined in 'jar:file:/C:/sphinx4-1.0beta3-bin/sphinx4-1.0beta3/bin/WavFile.jar!/edu/cmu/sphinx/demo/wavfile/config.xml'... Decoding jar:file:/C:/sphinx4-1.0beta3-bin/sphinx4-1.0beta3/bin/WavFile.jar!/edu/cmu/sphinx/demo/wavfile/12345.wav Result: one two three four five It seems

Speech to text button in textbox on a webpage

走远了吗. 提交于 2019-12-06 09:20:46
问题 Is is possible to have a working Google speech to text button in a textbox on a webpage? How can it be done? 回答1: it's an HTML5 thingy ... <input type="text" id="txtSearch" x-webkit-speech /> so, will only work in available browsers. Live code: http://jsbin.com/afifaw/3 回答2: This is a really crude implementation of TTS that you might find acceptable. It's going to need some tender love and care, but it's a starting point. <p id="tts">The maximum number of characters, including punctuation

how to pass language in speech recognition on android apps?

拈花ヽ惹草 提交于 2019-12-06 09:01:58
I've been working on speech Recognition API in android and found out that the speech results vary allot when the language settings are changed , is there a way to set it programmatically ? or is there an intent to lunch the speech language settings screen ? or what else ? note: I tried to use this intent extra: intent.putExtra(RecognizerIntent.EXTRA_LANGUAGE_PREFERENCE, "en-US"); and Intent detailsIntent = new Intent(RecognizerIntent.ACTION_GET_LANGUAGE_DETAILS); sendOrderedBroadcast(detailsIntent, null, new LanguageDetailsChecker(), null, Activity.RESULT_OK, null, null); Yes hanifs, that

How can I access IBM speech-to-text api with curl?

删除回忆录丶 提交于 2019-12-06 08:40:13
I cannot access the speech-to-text API on IBM Bluemix with curl! I tried the example from the documentation for a sessionless request with curl and it didn't work; I got an invalid userID/password message. Here is the error I got: "{ "code" : 401 , "error" : "Not Authorized" , "description" : "2016-10-08T15:22:37-04:00, Error ERCDPLTFRM-DNLKUPERR occurred when accessing https://158.85.132.94:443/speech-to-text/api/v1/recognize?timestamps=true&word_alternatives_threshold=0.9&continuous=true , Invalid UserId and/or Password. Please confirm that your credentials match the end-point you are trying

How to detect that speech recogntion is in progress

纵饮孤独 提交于 2019-12-06 08:24:14
问题 Problem: I have UITextField side by side with UIButton with send functionality. When user presses send button I'm performing simple action: - (IBAction)sendMessage: (id)sender { [self.chatService sendMessage: self.messageTextField.text]; self.messageTextField.text = @""; // here I get exception } Now when user starts using dictation from keyboard, then presses done on dictation view (keyboard) and immediately presses send button, I've got exception "Range or index out of bounds". Possible

Disable speech to text button (Micro phone) on soft input keyboard in android programmatically

怎甘沉沦 提交于 2019-12-06 05:31:42
问题 Thanks in advance for the help. I am developing an android application for research purposes and need to disable the speech to text button on the soft input keyboard. The reason for this is due to concurrency issues that arise since the application I am developing uses the microphone. I understand that for a general application disabling keys is generally seen as impossible (since users may change default keyboards). I know for a fact that the default keyboard will be used. With this in mind