Job description
We care about two modifications:
- Changing the speech recognition engine from google STT to others - asynchronous communication - redirecting the audio stream via websocket. I have sample scripts
- Adding an action after an event: if the right phrase is recognized then start streaming the audio file, all the while having the speech recognition engine active in the background