Is there a fast way to find (not necessarily recognize) human speech in an audio file?

前端 未结 3 1294
感动是毒
感动是毒 2021-02-01 08:59

I want to write a program that automatically syncs unsynced subtitles. One of the solutions I thought of is to somehow algorythmically find human speech and adjust the subtiles

3条回答
  •  梦毁少年i
    2021-02-01 09:38

    The technical term for what you are trying to do is called Voice Activity Detection (VAD). There is a python library called SPEAR that does it (among other things).

提交回复
热议问题