speech-recognition | 易学教程

Using Gstreamer with Google speech API (Streaming Transcribe) in C++

阅读更多关于 Using Gstreamer with Google speech API (Streaming Transcribe) in C++

问题 I am using the Google Speech API from cloud platform for getting speech-to-text of a streaming audio. I have already done the REST API calls using curl POST requests for a short audio file using GCP. I have seen the documentation of the Google Streaming Recognize, which says "Streaming speech recognition is available via gRPC only." I have gRPC (also protobuf) installed in my OpenSuse Leap 15.0 . Here is the screenshot of the directory. Next I am trying to run the streaming_transcribe example

Using Gstreamer with Google speech API (Streaming Transcribe) in C++

阅读更多关于 Using Gstreamer with Google speech API (Streaming Transcribe) in C++

Partial results using speech recognition

阅读更多关于 Partial results using speech recognition

问题 I created a simple application inspired by this example in order to test all the available options (ie extra). I read about the EXTRA_PARTIAL_RESULTS extra and if I enable this option I should receive from the server any partial results related to a speech recognition. However, when I add this extra to the ACTION_RECOGNIZE_SPEECH intent, the voice recognition does not work anymore: the list does not display any results. protected void onActivityResult(int requestCode, int resultCode, Intent

Partial results using speech recognition

阅读更多关于 Partial results using speech recognition

Partial results using speech recognition

阅读更多关于 Partial results using speech recognition

Speech recognition with Microsoft Cognitive Speech API and non-microphone real-time audio stream

阅读更多关于 Speech recognition with Microsoft Cognitive Speech API and non-microphone real-time audio stream

问题 Problem My project consists of a desktop application that records audio in real-time, for which I intend to receive real-time recognition feedback from an API. With a microphone , a real-time implementation using Microsoft's new Speech-to-Text API is trivial, with my scenario differing from that only in the sense that my data is written to a MemoryStream object. API Support This article explains how to implement the API's Recognizer (link) with custom audio streams , which invariably requires

Speech recognition with Microsoft Cognitive Speech API and non-microphone real-time audio stream

阅读更多关于 Speech recognition with Microsoft Cognitive Speech API and non-microphone real-time audio stream

Getting started with speech recognition programming questions

阅读更多关于 Getting started with speech recognition programming questions

问题 So, you've all probably seen Iron Man where Tony interacts with an AI system called Jarvis. Demo clip here (Sorry it's a commercial). I'm very familiar with C#, C++ and Visual Basic, but I am unsure what options I have available for me to program something like this. Ideally, I'd like to have it assist me while working on some projects by automating a few things. After doing a bit of research, I saw that a lot of people where using apple script. Well, I'm a windows developer and I work on

Speech Recognition Limits for iOS 10

阅读更多关于 Speech Recognition Limits for iOS 10

问题 Does anyone know are there limits for the speech recognition in iOS 10 (per device or per app)? 回答1: Yes, there are limits, but I don't think Apple has issued many specific numbers. Apple released a supplementary video during WWDC 2016 which said the following: Now just a quick talk about some best practices. We're making speech recognition available for free to all apps but we do have some reasonable limits in place so that the service remains available to everyone. Individual devices may be

SpeechRecognizer ERROR_SERVER when running not offline languages

阅读更多关于 SpeechRecognizer ERROR_SERVER when running not offline languages

问题 Everything is fine when I run this having English set as default language, but when I run it on any language that is not available offline I keep getting error 4 (ERROR_SERVER), even if I turn on Internet connection. I fixed it some time ago by changing language model to LANGUAGE_MODEL_WEB_SEARCH. But I added some other features and it is not working again no matter what I change here. What I have already tried to do: Read all other related questions on Stack Overflow. Manually set speech