The below code, when executed we can hear the prompt text being prompted twice. From the output audio it seems the first is being generated by the on device TTS and the second b