I was checking out the pricing page and noticed ASR (Automatic Speech Recognition) and TTS (Text-to-Speech) listed, but I couldn’t find any information on the real-time service that '@phoney ai' is using through Groq. Has anyone here come across this service or knows more about what Groq offers in terms of real-time capabilities? Any help or insights would be greatly appreciated!
We have Whisper transcription models, and a TTS model as separate model endpoints. Phonely builds on top of Whisper and adds their own coordination/infrastructure on top of it. Unfortunately some of that is their secret sauce, and we don’t know how they’ve implemented it.