Synchronizing text and audio. Is there a NLP/speech-to-text library to do this?

后端未结

关注

 1  1269

I would like to synchronize a spoken recording against a known text. Is there a speech-to-text / natural language processing library that would facilitate this? I imagine I\'d

相关标签:

1条回答

爱一瞬间的悲伤

2021-01-31 22:32

Forced Alignment

It sounds like you want to do forced alignment between your audio and the known text.

Pretty much all research/industry grade speech recognition systems will be able to do this, since forced alignment is an important part of training a recognition system on data that doesn't have phone level alignments between the audio and the transcript.

Alignment CMUSphinx

The Sphinx4-1.0 beta 5 release of CMU's open source speech recognition system now includes a demo on how to do alignment between a transcript and long speech recordings.

0 讨论(0)
发布评论:

提交评论
- 加载中...