Open source code for voice detection and discrimination

前端 未结 8 2098
囚心锁ツ
囚心锁ツ 2021-01-31 17:49

I have 15 audio tapes, one of which I believe contains an old recording of my grandmother and myself talking. A quick attempt to find the right place didn\'t turn it up. I don

8条回答
  •  执笔经年
    2021-01-31 18:53

    You could also try pyAudioAnalysis to:

    1. Silence removal:

    from pyAudioAnalysis import audioBasicIO as aIO from pyAudioAnalysis import audioSegmentation as aS [Fs, x] = aIO.readAudioFile("data/recording1.wav") segments = aS.silenceRemoval(x, Fs, 0.020, 0.020, smoothWindow = 1.0, Weight = 0.3, plot = True)

    segments contains the endpoints of the non-silence segments.

    1. Classification: Speech vs music discrimination: pyAudioAnalysis also includes pretrained classifiers, which can be used to classify unknown segments to either speech or music.

提交回复
热议问题