I need to know when a person started and stopped speaking based on the microphone data, I already have this fully working in python and audio from my laptop using Python sou