Synchronizing text and audio. Is there a NLP/speech-to-text library to do this?

后端 未结 1 1269
忘了有多久
忘了有多久 2021-01-31 22:14

I would like to synchronize a spoken recording against a known text. Is there a speech-to-text / natural language processing library that would facilitate this? I imagine I\'d

相关标签:
1条回答
  • 2021-01-31 22:32

    Forced Alignment

    It sounds like you want to do forced alignment between your audio and the known text.

    Pretty much all research/industry grade speech recognition systems will be able to do this, since forced alignment is an important part of training a recognition system on data that doesn't have phone level alignments between the audio and the transcript.

    Alignment CMUSphinx

    The Sphinx4-1.0 beta 5 release of CMU's open source speech recognition system now includes a demo on how to do alignment between a transcript and long speech recordings.

    0 讨论(0)
提交回复
热议问题