Gender detection of the speaker from wave data of the audio

前端未结

关注

 2  564

I would like to add a gender detection capability to a news video translator app I\'m working on, so that the app can switch between male and female voice according to the v

相关标签:

2条回答

抹茶落季

2020-12-19 17:29
Accurate gender identification can be implemented with training a GMM classifier on MFCC features of male and female. Here is how one can go about it.
1. One needs to collect training set for each of the gender.
2. Extract MFCCs features from all the audios of respective gender(One can find python implementation like scikit-talkbox etc).
3. Train GMM models for both the gender on the extracted features from their training set audios.
For details, Here is an open source python implementation of the same. The following tutorials evaluates the code on subset extracted from Google's AudioSet which is released this year (2017)

https://appliedmachinelearning.wordpress.com/2017/06/14/voice-gender-detection-using-gmms-a-python-primer/
0 讨论(0)
发布评论:

提交评论
- 加载中...
谎友^

2020-12-19 17:51

Accurate gender identification can be implemented with GMM classifier of MFCC features. You can read about it here:

AGE AND GENDER RECOGNITION FOR TELEPHONE APPLICATIONS BASED ON GMM SUPERVECTORS AND SUPPORT VECTOR MACHINES

To the date I am not aware of open source implementation of this, though many components are available in open source speech recognition toolkits like CMUSphinx.

0 讨论(0)
发布评论:

提交评论
- 加载中...