Thank you for the product feedback! We’ve been exploring adding features like Streamed responses back from the Whisper API, as it’s transcribing large files. We don’t have an ETA on when it’ll be launched though.
this would be absolutely killer for realtime audio processing agents. If i could upload an audio stream and then begin downloading the output of whisper at the same time. You could build a voip agent that starts tool calling the second a specific keyword leaves a persons’ mouth. please make this!