speech-recognition

Using Gstreamer with Google speech API (Streaming Transcribe) in C++

荒凉一梦 提交于 2020-02-02 13:42:51
问题 I am using the Google Speech API from cloud platform for getting speech-to-text of a streaming audio. I have already done the REST API calls using curl POST requests for a short audio file using GCP. I have seen the documentation of the Google Streaming Recognize, which says "Streaming speech recognition is available via gRPC only." I have gRPC (also protobuf) installed in my OpenSuse Leap 15.0 . Here is the screenshot of the directory. Next I am trying to run the streaming_transcribe example

Using Gstreamer with Google speech API (Streaming Transcribe) in C++

强颜欢笑 提交于 2020-02-02 13:38:30
问题 I am using the Google Speech API from cloud platform for getting speech-to-text of a streaming audio. I have already done the REST API calls using curl POST requests for a short audio file using GCP. I have seen the documentation of the Google Streaming Recognize, which says "Streaming speech recognition is available via gRPC only." I have gRPC (also protobuf) installed in my OpenSuse Leap 15.0 . Here is the screenshot of the directory. Next I am trying to run the streaming_transcribe example

Partial results using speech recognition

回眸只為那壹抹淺笑 提交于 2020-02-01 08:26:06
问题 I created a simple application inspired by this example in order to test all the available options (ie extra). I read about the EXTRA_PARTIAL_RESULTS extra and if I enable this option I should receive from the server any partial results related to a speech recognition. However, when I add this extra to the ACTION_RECOGNIZE_SPEECH intent, the voice recognition does not work anymore: the list does not display any results. protected void onActivityResult(int requestCode, int resultCode, Intent

Partial results using speech recognition

笑着哭i 提交于 2020-02-01 08:26:05
问题 I created a simple application inspired by this example in order to test all the available options (ie extra). I read about the EXTRA_PARTIAL_RESULTS extra and if I enable this option I should receive from the server any partial results related to a speech recognition. However, when I add this extra to the ACTION_RECOGNIZE_SPEECH intent, the voice recognition does not work anymore: the list does not display any results. protected void onActivityResult(int requestCode, int resultCode, Intent

Partial results using speech recognition

瘦欲@ 提交于 2020-02-01 08:26:04
问题 I created a simple application inspired by this example in order to test all the available options (ie extra). I read about the EXTRA_PARTIAL_RESULTS extra and if I enable this option I should receive from the server any partial results related to a speech recognition. However, when I add this extra to the ACTION_RECOGNIZE_SPEECH intent, the voice recognition does not work anymore: the list does not display any results. protected void onActivityResult(int requestCode, int resultCode, Intent

Speech recognition with Microsoft Cognitive Speech API and non-microphone real-time audio stream

纵然是瞬间 提交于 2020-01-31 06:17:04
问题 Problem My project consists of a desktop application that records audio in real-time, for which I intend to receive real-time recognition feedback from an API. With a microphone , a real-time implementation using Microsoft's new Speech-to-Text API is trivial, with my scenario differing from that only in the sense that my data is written to a MemoryStream object. API Support This article explains how to implement the API's Recognizer (link) with custom audio streams , which invariably requires

Speech recognition with Microsoft Cognitive Speech API and non-microphone real-time audio stream

你。 提交于 2020-01-31 06:17:04
问题 Problem My project consists of a desktop application that records audio in real-time, for which I intend to receive real-time recognition feedback from an API. With a microphone , a real-time implementation using Microsoft's new Speech-to-Text API is trivial, with my scenario differing from that only in the sense that my data is written to a MemoryStream object. API Support This article explains how to implement the API's Recognizer (link) with custom audio streams , which invariably requires

Getting started with speech recognition programming questions

三世轮回 提交于 2020-01-31 03:57:49
问题 So, you've all probably seen Iron Man where Tony interacts with an AI system called Jarvis. Demo clip here (Sorry it's a commercial). I'm very familiar with C#, C++ and Visual Basic, but I am unsure what options I have available for me to program something like this. Ideally, I'd like to have it assist me while working on some projects by automating a few things. After doing a bit of research, I saw that a lot of people where using apple script. Well, I'm a windows developer and I work on

Speech Recognition Limits for iOS 10

跟風遠走 提交于 2020-01-30 22:02:41
问题 Does anyone know are there limits for the speech recognition in iOS 10 (per device or per app)? 回答1: Yes, there are limits, but I don't think Apple has issued many specific numbers. Apple released a supplementary video during WWDC 2016 which said the following: Now just a quick talk about some best practices. We're making speech recognition available for free to all apps but we do have some reasonable limits in place so that the service remains available to everyone. Individual devices may be

SpeechRecognizer ERROR_SERVER when running not offline languages

怎甘沉沦 提交于 2020-01-24 23:28:25
问题 Everything is fine when I run this having English set as default language, but when I run it on any language that is not available offline I keep getting error 4 (ERROR_SERVER), even if I turn on Internet connection. I fixed it some time ago by changing language model to LANGUAGE_MODEL_WEB_SEARCH. But I added some other features and it is not working again no matter what I change here. What I have already tried to do: Read all other related questions on Stack Overflow. Manually set speech