Gender detection of the speaker from wave data of the audio

前端 未结 2 564
星月不相逢
星月不相逢 2020-12-19 17:17

I would like to add a gender detection capability to a news video translator app I\'m working on, so that the app can switch between male and female voice according to the v

相关标签:
2条回答
  • 2020-12-19 17:29

    Accurate gender identification can be implemented with training a GMM classifier on MFCC features of male and female. Here is how one can go about it.

    1. One needs to collect training set for each of the gender.
    2. Extract MFCCs features from all the audios of respective gender(One can find python implementation like scikit-talkbox etc).
    3. Train GMM models for both the gender on the extracted features from their training set audios.

    For details, Here is an open source python implementation of the same. The following tutorials evaluates the code on subset extracted from Google's AudioSet which is released this year (2017)

    https://appliedmachinelearning.wordpress.com/2017/06/14/voice-gender-detection-using-gmms-a-python-primer/

    0 讨论(0)
  • 2020-12-19 17:51

    Accurate gender identification can be implemented with GMM classifier of MFCC features. You can read about it here:

    AGE AND GENDER RECOGNITION FOR TELEPHONE APPLICATIONS BASED ON GMM SUPERVECTORS AND SUPPORT VECTOR MACHINES

    To the date I am not aware of open source implementation of this, though many components are available in open source speech recognition toolkits like CMUSphinx.

    0 讨论(0)
提交回复
热议问题