htk

HTK installing in windows10 / Not able to find VC98

こ雲淡風輕ζ 提交于 2021-01-29 15:49:51
问题 I am trying to install HTK Toolkit in my windows 10 machine. It has a prerequisite of: Ensure that your PATH contains C:\Program Files\Microsoft Visual Studio\VC98\bin I installed Microsoft Visual studios but I am not able to find the VC98 folder(file) in location where my visual studio is installed. I tried to search it a lot of times, but still I was unsuccessful in finding it. Can someone please solve this for me. My final goal is to install HTK Installing HTK on Microsoft Windows

语音识别之HTK入门(三)训练文件准备

為{幸葍}努か 提交于 2020-02-04 02:37:18
语音识别之HTK入门(三)训练文件准备 config文件夹下的操作 lists文件夹下的操作 dict文件夹操作 labels文件夹操作 scripts文件夹操作 主目录树下的操作 本节记录的是进行训练前的各种文件的准备,步骤稍微有些繁杂。 config文件夹下的操作 config文件夹下运行终端命令如下: touch grammer touch global . ded touch mkphones_color . led touch codetrain . scp touch config_HCopy touch proto touch train . scp touch config_color grammer文件内容: $color = RED | BLUE | YELLOW ; ( SENT - START ( $color ) SENT - END ) global.ded文件内容: AS sp RS cmu MP sil sil sp mkphones_color.led文件内容 EX IS sil sil DE sp codetrain.scp文件内容 data / train / speech / s01 . wav data / train / mfc / s01 . mfc data / train / speech / s02 . wav data /

Android offline voice recognition

前提是你 提交于 2020-01-01 19:22:56
问题 I already use HTK (Hidden Markov Model Tool Kit) for recognizing specific commands used to control my Android application, but in this case I need to pass some voice data to a server and that may consume more time. To prevent this latency, I am thinking about using pocketsphinx to recognize the voice data locally with the Android application so that I won't need to pass that audio to the server. If this is a good idea, is it easy to learn pocketsphinx from scratch? Also, what are advantages

Building HTK without X11

强颜欢笑 提交于 2019-12-31 07:20:57
问题 I am trying to install HTK using Cygwin (32 bit) on a x86_64 computer build. This is part of the error I get when running ./configure make all gcc = 5.4.0, g++ = 5.4.0, perl = 5; I have X11 folder in C:/cygwin/usr/share Let me know if I should post the entire error of the make file. HSLab.c: In function ‘FileExists’: HSLab.c:1209:12: warning: variable ‘isEXF’ set but not used [-Wunused-but-set-variable] Boolean isEXF; /* File name is extended */ ^ /tmp/ccZO1TK3.o: In function InitBar':

HTK compile error: make[1]: Nothing to be done for 'all'

非 Y 不嫁゛ 提交于 2019-12-13 03:26:57
问题 Following the official HTK Unix/Linux installation guide, I had similar issues to the ones described in this thread when trying to run make all on HTK. I had initially been looking at the wrong Makefile ( /htk-3.4.1/Makefile ), as suggested by MadScientist. After fixing the extra indentation on line 77 in the correct Makefile ( /htk-3.4.1/HLMTools/Makefile ) generated by running ./configure --prefix=/tmp , the HTK compile error: Makefile:77: *** missing separator (did you mean TAB instead of

Converting from one MFCC type to another - HTK

北战南征 提交于 2019-12-13 02:10:21
问题 I am working with the HTK toolkit on a word spotting task and have a classic training and testing data mismatch. The training data consisted of only "clean" (recorded over a mic) data. The data was converted to MFCC_E_D_A parameters which were then modelled by HMMs (phone-level). My test data has been recorded over landline and mobile phone channels (inviting distortions and the like). Using the MFCC_E_D_A parameters with HVite results in incorrect output. I want to make use of cepstral mean

Process audio packets decoded with ffmpeg

白昼怎懂夜的黑 提交于 2019-12-11 12:46:45
问题 Following my other post, I am wondernig if it is possible to do some process like MFCC extraction on the decoded audio packets. The code I use decode audio and video from mpeg-2 file using ffmpeg. Process on video is done using opencv as this library permits to grab frames on by one. I need to process the corresponding audio samples in the same time. Thanks. 回答1: I've created a C++ audio engine named "Crosstalk". Although it's referred to as an "audio engine", It's really just a real-time C++

A problem with forced alignment in speech recognition - HTK

删除回忆录丶 提交于 2019-12-06 14:15:57
问题 I have a system where a user is asked to repeat a sentence after a prompt. It uses HTK to force-align the user-spoken-sentence to the pre-defined word level label file (of the sentence) to get a time-aligned phone level file. The HMMs have been trained on a large amount of data and give very accurate time-aligned files with HVite . My problem arises when the user does not speak the exact sentence that is required to be spoken. Let me illustrate with an example: Word level label file of the

“fatal error: bits/libc-header-start.h: No such file or directory” while compiling HTK

三世轮回 提交于 2019-11-29 04:06:01
I'm getting the following issue when trying to run make on the HTK library : (cd HTKLib && make HTKLib.a) \ || case "" in *k*) fail=yes;; *) exit 1;; esac; make[1]: Entering directory '/home/william/speech/htk/HTK-3.4.1/htk/HTKLib' gcc -m32 -ansi -D_SVID_SOURCE -DOSS_AUDIO -D'ARCH="x86_64"' -Wall -Wno-switch -g -O2 -I. -DPHNALG -c -o HGraf.o HGraf.c In file included from HShell.h:40:0, from HGraf.c:54: /usr/include/stdio.h:27:10: fatal error: bits/libc-header-start.h: No such file or directory #include <bits/libc-header-start.h> ^~~~~~~~~~~~~~~~~~~~~~~~~~ compilation terminated. <builtin>:

关于HTK工具下载安装的问题

我只是一个虾纸丫 提交于 2019-11-28 07:25:15
最近在做公司的ASR优化,便是用到了HTK这个工具来统计ASR出错的概率来选择更好的ASR服务并做一些优化之类的,不过在安装HTK的过程中也是踩了很多坑啊。。。。 HTK的官网: http://htk.eng.cam.ac.uk/ 下载之后在D盘新建一个htk的目录,解压到目录下, 确保的环境变量Path包含路径:C:\Program Files\Microsoft Visual Studio\VC98\bin ,路径根据你安装VC所在目录作相应改变。 打开cmd 命令窗口。 在命令窗口中输入cd htk 进入htk文件夹,在该文件夹下建立一个新文件夹bin.win32 . 输入:mkdir bin.win32 然后在DOS窗口运行vcvars32; 关于vcvars32不是内部命令解决步骤: 转自: https://www.cnblogs.com/mingzhao810/archive/2012/08/03/2617674.html 后面如果还会有错,那就是像: fatal error U1077.“cl.exe” return code 0x1, stdio.h: No such file or directory fatal error C1083: 无法打开包括文件:“stdio.h ; 这种错的话,别担心,以下也有解决方案,点击链接: https://blog.csdn