speech-to-text

Speech Recognition Service in Android

别等时光非礼了梦想. 提交于 2019-12-03 16:34:39
I have an Android application that uses speech recognition in an Activity. The GUI doesn't do anything except for contain the speech recognition objects. I would like to port this over to a service so I can talk to the application while it's running in the background. However, as far as I know, the speech recognition service has to use onActivityResult, which is unavailable for Services. Is there a way to either contain an Activity in a Service such that its GUI is not displayed, or perform speech recognition in a service instead of an activity? Michael Levy See Google's voice search speech

Speech to text in emulator: No Activity found to handle Intent

时光毁灭记忆、已成空白 提交于 2019-12-03 15:53:43
I want to ask how I can use speech to text code on my emulator. My codes work on real device but not work on emulator. The error said : No Activity found to handle Intent { act=android.speech.action.RECOGNIZE_SPEECH (has extras) } What can I do? package net.viralpatel.android.speechtotextdemo; import java.util.ArrayList; import android.app.Activity; import android.content.ActivityNotFoundException; import android.content.Intent; import android.os.Bundle; import android.speech.RecognizerIntent; import android.view.Menu; import android.view.View; import android.widget.ImageButton; import android

C#: transcribe WAV file to text (speech-to-text) with System.Speech namespaces

梦想与她 提交于 2019-12-03 13:04:00
问题 How do you use the .NET speech namespace classes to convert audio in a WAV file to textual form which I can display on the screen or save to file? I am looking for some tutorial samples. UPDATE Found a code sample here. But when I tried it it gives incorrect results. Below is the vb code sample I've adopted. (Actually I don't mind the lang as long as its either vb/c#...). It is not giving me proper results. I assume if we put the right grammar - i.e. the words we expect in the recording - we

How to hide toast message “Your audio will be sent to google to provide speech recognition service.” in Android?

半城伤御伤魂 提交于 2019-12-03 10:17:22
问题 I am using google speech recognizer for integrating voice services in Android but while pressing on mic button this annoying toast message is showing. Please suggest me a way to hide this toast message. This is my java code public class FormActivity extends AppCompatActivity { AppCompatEditText mFeedbackView; ImageView mFeedbackVoiceView; private final int REQ_CODE_SPEECH_INPUT_FEEDBACK = 100; @Override protected void onCreate(Bundle savedInstanceState) { super.onCreate(savedInstanceState);

Burmese speech to text conversion in android?

删除回忆录丶 提交于 2019-12-03 08:02:45
Can we add custom language for RecognizerIntent? I have search many SO Question like https://stackoverflow.com/questions/2080401/is-there-a-speech-to-text-api-by-google That solve my problem of using limited number of language during Speech to Text conversion. My problem is that, I need to used Burmese(Local Language of Burma) Speech and convert it to text. any other help can be appreciated. UPDATE : Google's servers currently support English, Mandarin Chinese, and Japanese. Speech Input API for android Google will unlikely support small languages any time soon. The only way to custom language

Open Source Software For Transcribing Speech in Audio Files

左心房为你撑大大i 提交于 2019-12-03 06:57:30
问题 Can anyone recommend reliable open source software for transcribing English speech in wav files? The two main programs I've researched are Sphinx and Julius, but I've never been able to get either to work, and the documentation with each on transcribing files is sketchy at best. I'm developing on 64-bit Ubuntu 10.04, whose repos include sphinx2 and julius, as well as voxforge's julius acoustic modal for English. I'm focussing on transcribing files, instead of directly processing sound from a

How to convert human voice into digital format?

£可爱£侵袭症+ 提交于 2019-12-03 06:57:29
I am working on a project where biometric system is used to secure the system. We are planning to use human voice to secure the system. Idea is to allow the person to say some words or sentences and system will store that voice in digital format. Next time person wants to enter the system, he/she has to speak some words which may or may not be different from the words used earlier. We don't want to match words but want to match voice frequency. I have read some research papers regarding this system but those papers don't have any implementation details. So just want to know whether there is

How to translate live streaming using google speech api?

我的未来我决定 提交于 2019-12-03 05:59:07
this is the google speech API docs: https://cloud.google.com/speech/docs/sync-recognize i try this API for 2 weeks. but still cant solved my main purpose (translate live streaming). I'm using PHP. (other languange suggestion is allowed, i will find by myself) What i can do in my 2 weeks: Synchronous Speech Recognition (<=1min) Asynchronous Speech Recognition (>1min and <=80min). Note : i can modify this to accept 3hours video. Live speech recognition from mic : https://www.google.com/intl/en/chrome/demos/speech.html UPDATE: Perform streaming API with audio less than 6sec duration. What can't i

Voice Input to Populate Edit Text in android?

半城伤御伤魂 提交于 2019-12-03 05:46:51
问题 I am working on voice input in android. I used the sample from http://developer.android.com/resources/samples/ApiDemos/src/com/example/android/apis/app/VoiceRecognition.html And while testing on Xperia X10, I got the "Speak now" dialog but before I input some voice it gets closed. I am trying to implement voice search e.g. If voice input is James Bond then I want to populate the James in first name Edit Text and Bond in Last name Edit Text. Which will search in database for the name. But

How to use CMU Sphinx 4 for speech to text with english voxforge models

£可爱£侵袭症+ 提交于 2019-12-03 04:33:10
问题 I'm trying to figure out how to use sphinx4 or pocketsphinx with the english voxforge model but I can't get it working. I have tried to read doc pages (like this one http://cmusphinx.sourceforge.net/sphinx4/doc/UsingSphinxTrainModels.html ) but it does not help me. What I want is an executable where I can specify which model to use and which audio file to use as source and have the executable print out it's best guess about what the voice on the recording says. I hade some luck with: