Chunked Audio Upload for Speech-to-Text Processing

Question

Hello, I have a feature request:Chunked Audio Upload for Speech-to-Text ProcessingCurrent Limitation:The API currently requires the full audio file to be uploaded before processing, leading to increased latency.Proposed Feature:Add an API endpoint to support chunked audio uploads during recording, allowing processing to begin as audio is being sent.Benefit: Reduces upload latency, improving user experience. This feature would significantly improve real-time processing capabilities and user satisfaction. Please provide feedback on this feature request.Thanks!

yawnxyz · Answer

Hi there,Thank you for the product feedback! We’ve been exploring adding features like Streamed responses back from the Whisper API, as it’s transcribing large files. We don’t have an ETA on when it’ll be launched though.Best,Jan

Reply

Sign up

Login to the community

Scanning file for viruses.

This file cannot be downloaded