speech-to-text

Android SpeechRecognizer should only be used from the application's main thread

牧云@^-^@ 提交于 2019-12-02 07:44:42
I am trying to integrate some of Androids Speech APIS in my AndEngine based game. I placed my code within the BaseGame activity - however this error appears at runtime: 05-06 23:51:28.955: ERROR/AndroidRuntime(553): java.lang.RuntimeException: SpeechRecognizer should be used only from the application's main thread How do I access the applications main thread? And how can I ensure my Speech code runs in it. The setup is as follows: Main Class - BaseGameActivity SpeechClass Main Class instantiates a new SpeechClass + calls its method - then I get the above error. Please advise on how I can do

C# system.speech.recognition alternate words

感情迁移 提交于 2019-12-02 06:17:30
I am currently using the Microsoft.Speech API to dictate utterances into text, but what I really need is the alternative dictations the program could use. I am using this for my honours thesis, and for it I wish to know the top 10 interpretations of any utterance. A very similar, if not exact question was asked in 2011: C# system.speech.recognition alternates But was never answered. My question thus is: how does one get the alternatives to an interpretation of a dictation using the Microsoft.Speech API? This MSDN page handles what you're asking quite nicely. For reference, I'll post the

Transcribing WMA/MP3 audio in an automated fashion?

偶尔善良 提交于 2019-12-02 05:29:26
问题 I’ve got a lot of speech audio in WMA format and I’d like to machine transcribe it – even if the transcription is not 100% accurate, I think it could help quite a bit as an “index” to some of the audio. I’m willing to write some code to make this happen, but can Microsoft’s Speech APIs help me here? Is there already an app that can do this for me? 回答1: SAPI can certainly do what you want. Start with an in-proc recognizer, connect up your audio as a file stream (you'll probably need to

Watson Speech to Text Response Times

风流意气都作罢 提交于 2019-12-02 05:24:34
问题 Having some troubles with the service while pushing to Bluemix The push request I am sending via php-curl is: POST https://stream.watsonplatform.net/speech-to-text/api/v1/recognize with the following parameters model : en-US_NarrowbandModel Content-Type : audio/wav; rate: 8000 Transfer-Encoding: chunked timestamps : true word_confidence : true continuous : true Response timeout is set to 2h (7200000 milliseconds) We are getting timeout from curl for some of our files, without any apparent

What is this error- “IOError: [Errno 2] No such file or directory: 'audio.flac' ”, i am trying to use the google voice recognition api for python

≡放荡痞女 提交于 2019-12-02 03:17:34
问题 Basically i want to convert speech to text, so I am trying to use the google voice recognition api for python. This is the code which i'm trying to run- from pygsr import Pygsr speech = Pygsr() speech.record(3) # duration in seconds (3) phrase, complete_response = speech.speech_to_text('es_ES') print phrase # This is the required output I've installed all the modules correctly, so probably nothing is wrong with the modules, i am getting the following error- Traceback (most recent call last):

Watson Speech to Text Response Times

断了今生、忘了曾经 提交于 2019-12-02 01:48:37
Having some troubles with the service while pushing to Bluemix The push request I am sending via php-curl is: POST https://stream.watsonplatform.net/speech-to-text/api/v1/recognize with the following parameters model : en-US_NarrowbandModel Content-Type : audio/wav; rate: 8000 Transfer-Encoding: chunked timestamps : true word_confidence : true continuous : true Response timeout is set to 2h (7200000 milliseconds) We are getting timeout from curl for some of our files, without any apparent reason, at first I thought was related to the duration of the recordings, but I've got in logs with

Transcribing WMA/MP3 audio in an automated fashion?

眉间皱痕 提交于 2019-12-02 01:13:17
I’ve got a lot of speech audio in WMA format and I’d like to machine transcribe it – even if the transcription is not 100% accurate, I think it could help quite a bit as an “index” to some of the audio. I’m willing to write some code to make this happen, but can Microsoft’s Speech APIs help me here? Is there already an app that can do this for me? SAPI can certainly do what you want. Start with an in-proc recognizer, connect up your audio as a file stream (you'll probably need to transcode your WMA files to a WAV stream, as SAPI only takes WAV input, but you can do the transcoding on the fly),

What is this error- “IOError: [Errno 2] No such file or directory: 'audio.flac' ”, i am trying to use the google voice recognition api for python

筅森魡賤 提交于 2019-12-01 22:51:28
Basically i want to convert speech to text, so I am trying to use the google voice recognition api for python. This is the code which i'm trying to run- from pygsr import Pygsr speech = Pygsr() speech.record(3) # duration in seconds (3) phrase, complete_response = speech.speech_to_text('es_ES') print phrase # This is the required output I've installed all the modules correctly, so probably nothing is wrong with the modules, i am getting the following error- Traceback (most recent call last): File "C:/Python/google_voice.py", line 4, in <module> phrase, complete_response = speech.speech_to_text

408 Request timed out Microsoft Speech to Text

时光总嘲笑我的痴心妄想 提交于 2019-12-01 21:50:53
问题 My .wav file length is just 4 seconds. Even after multiple retries and running it on cloud i am constantly getting following error * upload completely sent off: 12 out of 12 bytes < HTTP/1.1 408 Request timed out (> 14000 ms) < Transfer-Encoding: chunked < Content-Type: text/plain < Server: Microsoft-IIS/8.5 < X-MSEdge-Ref: Has anybody faced this issue? This is my request `curl -v "https://speech.platform.bing.com/recognize? scenarios=catsearch&appid=D4D52672-91D7-4C74-8AD8-42B1D98141A5

408 Request timed out Microsoft Speech to Text

我只是一个虾纸丫 提交于 2019-12-01 20:20:29
My .wav file length is just 4 seconds. Even after multiple retries and running it on cloud i am constantly getting following error * upload completely sent off: 12 out of 12 bytes < HTTP/1.1 408 Request timed out (> 14000 ms) < Transfer-Encoding: chunked < Content-Type: text/plain < Server: Microsoft-IIS/8.5 < X-MSEdge-Ref: Has anybody faced this issue? This is my request `curl -v "https://speech.platform.bing.com/recognize? scenarios=catsearch&appid=D4D52672-91D7-4C74-8AD8-42B1D98141A5&locale=en- US&device.os=wp7&version=3.0&format=json&requestid=1d4b6030-9099-12e0-91e4- 0800200c9a67