speech-to-text | 易学教程

SAPI: Speech to Text example

阅读更多关于 SAPI: Speech to Text example

I am new to SAPI, and I would really appreciate if any of you can provide me a speech to text Hello World example in SAPI. I know MS got some examples like "Dictation" etc, but I would like to start with a very small one. Glad if you can help. I played a bit with Windows Voice Recognition using SAPI, it really isn't user friendly. Here is an example of code I wrote (in C++) : #include <sphelper.h> #include <sapi.h> #include <iostream> #include <string> const ULONGLONG grammarId = 0; const wchar_t* ruleName1 = L"ruleName1"; int start_listening(const std::string& word); ISpRecoGrammar* init

Android extras about speech recognition does not work

阅读更多关于 Android extras about speech recognition does not work

I am currently busy with speech recognition, and I want speech recognizer to listen me continously, forever. The purpose of the app is, it is going to wait for speech of a specific keyword: when the user says this keyword, the service becomes ready to receive user voice commands After research I found that we can manupulate the time that the recognizer listens. By these constants: By the way the default recognizer service listens about 10 seconds. (The google api) String EXTRA_SPEECH_INPUT_COMPLETE_SILENCE_LENGTH_MILLIS The amount of time that it should take after we stop hearing speech to

Python pocketsphinx RequestError: missing PocketSphinx module: ensure that PocketSphinx is set up correctly

阅读更多关于 Python pocketsphinx RequestError: missing PocketSphinx module: ensure that PocketSphinx is set up correctly

I am trying to make a Python app that can record audio and translate it into english text using PyAudio, SpeechRecognition and PocketSphinx. I'm running on a Mac OS X El Capitan, version 10.11.2. Following a tutorial like this one and others, I've downloaded PyAudio version 0.2.9, SpeechRecognition as well as PocketSphinx. I've installed them into a Conda environment. I have followed the instructions from this site to use brew install swig git python on my OS X, hoping it would help. This is my code: # Load packages import speech_recognition as sr import sphinxbase import pocketsphinx # obtain

Getting started with speech recognition and python

阅读更多关于 Getting started with speech recognition and python

问题 I would like to know where one could get started with speech recognition. Not with a library or anything that is fairly "Black Box'ed" But instead, I want to know where I can Actually make a simple speech recognition script. I have done some searching and found, not much, but what I have seen is that there are dictionaries of 'sounds' or syllables that can be pieced together to form text. So basically my question is where can I get started with this? Also, since this is a little optimistic, I

x-webkit-speech is deprectated. A JS replacement for simple speech input for <input>?

阅读更多关于 x-webkit-speech is deprectated. A JS replacement for simple speech input for ?

I noticed this warning in Chrome devtool's console: The x-webkit-speech input field attribute is deprecated. Please use the JavaScript API instead. So what's the best JS API replacement for simple speech input on input fields? Perhaps there exists a drop-in JS module that will handle it for me? Answering my own question: Just made a Github project for this purpose: https://github.com/Daniel-Hug/speech-input 0e4ef622 There is a JavaScript SpeechRecognition API that does text-to-speech and speech-to-text. Not many browsers support it but Chrome should have support. 来源： https://stackoverflow.com

Android Speech to Text Example

阅读更多关于 Android Speech to Text Example

I've looked at the Android example for VoiceRecognition, but I don't really understand what it is suppose to do or how it works. In the manifest there isn't any sort of main activity to run and so when I install the app on my phone I can't run it. I'm also trying to find a simple example of Speech to text that takes speech as input and outputs the text on the screen. Just so I can study it to see how it works, but I haven't been able to find any sort of example on the web that shows it. I did it like that: in onCreate: List<ResolveInfo> activities = pm.queryIntentActivities(new Intent

How do I convert speech to text?

阅读更多关于 How do I convert speech to text?

问题 How could I take MP3 and convert the speech to text? I've got some recorded notes from a conference and from meetings (there is a single voice on the recording, which is my voice). I thought it would be easier and intellectually interesting to convert to text using speech to text tools rather than simply transcribe by hand. I know there are technologies out there, especially for VoIP applications using Asterisk and Podcasts, but what are they and how can I use them? 回答1: Open Source: CMU

pyspeech (python) - Transcribe mp3 files?

阅读更多关于 pyspeech (python) - Transcribe mp3 files?

I'd like to transcribe mp3 (speech-to-text) using the pyspeech API. I don't know if this is possible, though. Is it? How? pyspeech seems to be merely a python interface to the regular Windows speech APIs. Most likely you'd create some method of treating mp3 playback as an audio source for that speech API to listen to. Michael Levy I don't know about pyspeech, but if it is a Python wrapper around the Microsoft speech APIs, then some other posts may be helpful. Microsoft Speech engines do not require microphone input. They can accept audio files. If you are doing transcription, you will need a

Speech to text API for iphone? [closed]

阅读更多关于 Speech to text API for iphone? [closed]

Can anyone provide sample integration or please let me know any 3rd party api for speech to text conversion in iphone ? Thanks & Regards, Tariq You could look into CMU Sphinx and VocalKit (on github). Using pocketsphinx it would be easier http://cmusphinx.sourceforge.net/ OpenEars – iPhone Voice Recognition and Text-To-Speech http://www.politepix.com/openears/ 来源： https://stackoverflow.com/questions/3598549/speech-to-text-api-for-iphone

Google speech to text API in C#

阅读更多关于 Google speech to text API in C#

My program get a correct respon from google when the flac file recorded manual by using windows's sound recorder and convert it using a software converter. But when I use the file that recorded by my program, I got "{"result":[]} " from google. What should I do? here is my code : the sender : private static void CopyStream(FileStream fileStream, Stream requestStream) { var buffer = new byte[32768]; int read; while ((read = fileStream.Read(buffer, 0, buffer.Length)) > 0) { requestStream.Write(buffer, 0, read); } } private static void ConfigureRequest(HttpWebRequest request) { request.KeepAlive