I use Azure\'s Microsoft.CognitiveServices.Speech service to caption live stream video.
For what I\'ve experienced, it can take up to 40 seconds from the moment the a