speech-to-text

How can I transcribe a speech file with the Bing Speech API in Python?

有些话、适合烂在心里 提交于 2019-12-06 02:25:59
How can I transcribe a speech file with the Bing Speech API in Python? My speech file is longer than 15 seconds. I'm aware that one may use the Bing Speech REST API in Python. https://gist.github.com/jellis505/973ea6de12508c7c720da4a074e7d065 gives an example in Python 2: #!/usr/bin/env python # -*- coding: utf-8 -*- import requests import httplib import uuid import json class Microsoft_ASR(): def __init__(self): self.sub_key = 'YourKeyHere' self.token = None pass def get_speech_token(self): FetchTokenURI = "/sts/v1.0/issueToken" header = {'Ocp-Apim-Subscription-Key': self.sub_key} conn =

How to convert speech to text during call with different text colors for caller and call receiver?

笑着哭i 提交于 2019-12-06 01:51:42
问题 I want to convert speech to text during call. I also want the text to display in different colors: the call initiator's in red and the call receiver's green. During my tests, I converted speech to text during call but was unable to distinguish between the voice of the call initiator and that of the call receiver. Thanks in advance Please Help me out... 来源: https://stackoverflow.com/questions/20964359/how-to-convert-speech-to-text-during-call-with-different-text-colors-for-caller

Using c++ to call and use Windows Speech Recognition [closed]

假装没事ソ 提交于 2019-12-05 22:05:31
I am making an application that involves the use of windows speech recognition. I am thinking of using c++ to do this since i have some experience with this language. The way i want to use the speech recognition is so that it works internally. If i upload an audio file into my program, i want speech recognition to write this audio up as a text file, but all this should be done internally. Please provide some help with this and if i have not explained my question properly please let me know and i will try to explain again. Thanks in advance, Divs Michael Levy Windows provides speech recognition

Web Speech API Custom Words

点点圈 提交于 2019-12-05 21:04:00
问题 I read through the W3C docs on this and I'm thinking that custom words come from custom grammar, but I tried going to this demo and in the console entered the following javascript: recognition.grammars.addFromString('foo'); Which ran fine and recognition.grammars[0].src returns: "data:application/xml,foo" Note : 'foo' is not the word I'm interested in, but the word I'm interested in isn't an english word, using 'foo' for the example. When I speak my custom word normally, it thinks I'm saying

Android Speech Recognition API does not work in Android 7 Nougat

拜拜、爱过 提交于 2019-12-05 14:07:51
问题 I am using the android.speech.SpeechRecognizer API for speech. I works great in Android 4-5, In Android 6 it now has a bunch of bugs, like the chime that occurs when the mic turns on is detected as speech, so it exists (and loops indefinitely when it restarts because to speech was detected, we have a hack workaround for this that sets the volume to 0 before the chime is played...) In Android 6 the speech also dies with no error or anything after 5 seconds. We have another hack workaround for

Sample example for Speech to Text in iOS

懵懂的女人 提交于 2019-12-05 10:41:59
I am new to iOS programming.Can you please tell me how to convert the speech to text in iOS?Is there any API called?Please suggest me how to proceed? There are several libraries for this kind of conversion - I host two of those on GitHub: libsprec (this uses the Google speech recognition APIs, so it supports multiple languages) and VocalKit which uses the high-quality opensource PocketSphinx library (however, it currently supports English only). OpenEars uses PocketSphinx as well. There are some commercial solutions as well: the Dragon Dication SDK is a popular, however quite expensive

ios speech to text conversion [duplicate]

痞子三分冷 提交于 2019-12-05 08:17:12
问题 This question already has answers here : iPhone App › Add voice recognition? [closed] (4 answers) API or SDK for speech to text(speech recognition ) iphone (3 answers) Closed 6 years ago . I would like to convert spoken words into text so I can use nslinguistictagger in my app. How can I convert speech to text? What are the options? Does openears support voice to text conversion? 回答1: Openears will support free speech recognition and text-to-speech functionalities in offline mode. They have

can i use google speech recognition api in my desktop application

与世无争的帅哥 提交于 2019-12-05 04:02:40
问题 I want to know whether i can use speech recognition api of google for my desktop application. I have seen some example in which i have to convert the speech to a file and send to a url. But it will be little cumbersome task because in my application the user have to continuously submit his voice. So is there any other alternative to use google speech api. I am least interested to go with sphinx because its accuracy is very less and i dont know how to add new words in the dictionary and

Speech to text in emulator: No Activity found to handle Intent

被刻印的时光 ゝ 提交于 2019-12-04 23:47:04
问题 I want to ask how I can use speech to text code on my emulator. My codes work on real device but not work on emulator. The error said : No Activity found to handle Intent { act=android.speech.action.RECOGNIZE_SPEECH (has extras) } What can I do? 回答1: package net.viralpatel.android.speechtotextdemo; import java.util.ArrayList; import android.app.Activity; import android.content.ActivityNotFoundException; import android.content.Intent; import android.os.Bundle; import android.speech

Android SpeechRecognizer set recognize engine?

前提是你 提交于 2019-12-04 15:11:29
I'm using SpeechRecognizer to convert speech to text. I'm using Samsung and I encounterd error while I set Voice recognizer to 'Samsung powered by Vlingo', after I changed to 'Google' and the error disappear. I was wondering is there any method to change recognize engine programmatically? Thanks. I've found the answer of my question. I can use 'public static SpeechRecognizer createSpeechRecognizer (Context context, ComponentName serviceComponent)' the method to choose which recognition service to bind. Here is the implementation. SpeechRecognizer.createSpeechRecognizer(mContext, ComponentName