What models do you want to see on Groq?

yawnxyz · August 20, 2025, 11:35pm

Hi everyone,

We’d love to hear what kind of models you’d love to see on Groq — from coding models to text-to-speech and speech-to-text, to embeddings and diffusion and other sorts of models.

While we can’t accommodate everyone’s wishes, we’d love to keep the conversation alive here with new model drops, and model benchmarks and performance in this thread!

gxmarantes · August 30, 2025, 1:12pm

I”d love to see portuguese text-to-speech

mrdog · August 31, 2025, 9:41am

phi would be nice if it was added

yawnxyz · August 31, 2025, 7:35pm

We’re exploring tts models at the moment; what’s your favorite Portuguese tts model?

yawnxyz · August 31, 2025, 7:35pm

We’re considering the phi series, but honestly that one’s a bit lower on the consideration list than some of the other ones

gxmarantes · September 4, 2025, 9:13pm

Considering cost-benefit I’d say that AI products for growing applications. (excelent princing and good quality) or https://murf.ai/ (good price, better quality). There are others, but for heavy usage I think that no one beats those two cost-benefit wise.

sean.carmody · September 10, 2025, 1:18pm

One or two more vision models to compete with the Llama 4 ones that you guys have would be great. Qwen2.5 VL would be good!

yawnxyz · September 15, 2025, 11:12pm

we’re currently exploring Qwen2.5 VL! I’ve heard great things about it too, and really good point about getting more vision models

build · September 16, 2025, 2:04pm

We are on the look-out for an elastic, scalable environment for our speech-to-text inference workloads. We use the NB-Whisper models - based on Whisper from OpenAI and trained further on Norwegian speech data - from NbAiLab ( NB-Whisper - a NbAiLab Collection ).

We are currently serving these from a dedicated H100 environment. Due to client consumption growth we are looing for a more compute-efficient solution, and we would love to try out groq. But we are entirely dependent on using these Norwegian-specific models, and would like to host them from the Nordics (read; Helsinki).

yawnxyz · September 16, 2025, 3:49pm

That’s really interesting, I didn’t know about a Norwegian specific Whisper! This is probably too niche for us to host for the immediate future, but as a fellow Nordic (Swede!) it’s really cool to see a Norwegian-tuned Whisper. Probably every language (and dialect) will have their own speech-to-text model in the future!

enrico_stauss · September 22, 2025, 12:23pm

I brought it up before in another thread but for the sake of visibility, I’ll post it again here.

I’m desperately missing EU compliant VLMs (and non-reasoning LLMs) and I’m also desperately missing capable SLMs with very low latency and high throughput. I’m currently still on Llama3.1-8B but this is apparently slower then the Llama4 series and it also is not producing json-structured output reliably.

I therefore suggest the Mistral and/or Gemma3 family of models. A combination of Mistral Small 3 and Mistral Medium 3 should be rock solid.

Per · September 22, 2025, 6:41pm

Any deepseek model.

Or another one for creative writing, that doesn’t “feel” like a robot is writing.

TurkerT · September 29, 2025, 9:52pm

I am looking for a model to integrate with my Kilo Code environment. I’ve search for Devstral but found nothing for my purpose.

yawnxyz · September 29, 2025, 10:50pm

@Per is there an open model for creative writing that you really like?

@TurkerT have you tried Kimi K2 yet as a coding model? I haven’t used Kilo Code yet, but K2 is pretty good!

lolrazh · September 30, 2025, 7:30am

Would love to see Qwen3 235B Instruct or Deepseek V3.2!

quirky · October 2, 2025, 12:56pm

i would like to see these models, these are alot but these are good ones- deepseek r1, parakeet, glm 4.5, deepseek 3.1, stable diffusion 3.5.

NorrinRadd · October 2, 2025, 6:34pm

There is a separate thread for **Z.AI: GLM 4.5, but 4.6 is out now and showing excellent stats.

Would love to see it hosted on Groq!**

javed_iqbal · October 4, 2025, 5:32pm

I would love to see open source model specially medical related like : MedGemma and more.

Harry · October 5, 2025, 3:08am

Mistral-OCR => powerful for unstructured data extraction workflows

Topic		Replies	Views
Add speech-to-speech model support Feature Requests	2	38	August 24, 2025
Is it possible to build a voice assistant with models hosted on GroqCloud? How is the performance? FAQs	1	47	August 26, 2025
Please support new Canary-1B-v2 and parakeet-tdt-0_6b-v3 Speech to text models Forum	1	74	August 31, 2025
Groq's Depreciation of distil-whisper-large-v3-en Forum	1	24	July 30, 2025
What models are available on Groq? FAQs	1	767	August 8, 2025

What models do you want to see on Groq?

Related topics