Audio Matching (Audio Fingerprinting)

萝らか妹 提交于 2019-12-01 17:44:34

If you are aiming to compare an old recording of a user with a new call as it comes in, audio fingerprinting solutions like Dejavu in Python on a server or Echoprint in C++ won't help you. They are for doing recognition and retrieval on recorded audio segments plus noise. They cannot deal with the variabilites in human voice. See an explanation here.

If that's the case, what you are referring to is speaker recognition, which is much harder and involves quite a bit of machine learning. It would be tough to do this for a large corpus of users (especially offline on a phone), but for determining between a couple users, it might be doable.

Below is a good Library. Which is Easy to use. But you need to convert your Audio Files to Wave Format prior to this.

https://code.google.com/p/musicg/

易学教程内所有资源均来自网络或用户发布的内容,如有违反法律规定的内容欢迎反馈
该文章没有解决你所遇到的问题?点击提问,说说你的问题,让更多的人一起探讨吧!