What models do you want to see on Groq?

Cosmos · January 24, 2026, 7:17am

a LLM text-to-text only, high rate limits for that model

Emad_Omar · January 25, 2026, 2:17am

Please add GLM 4.7 , It will be great on your fast hardware.

ap96 · January 28, 2026, 5:51am

We’re all really desperate for a new model on Groq. Top choices right now:
-GLM 4.7
-Kimi K2.5

Pleeeeeeeease

Jai · January 29, 2026, 8:11pm

I would love to see Kimi k2.5!!

Ajay_Tiwari · January 31, 2026, 1:34pm

Definitely looking fotwrd for Kimi K2.5 to test agent swarm.

counterfactual · February 2, 2026, 10:15pm

As some others have indicated, additional/advanced ASR models would be really fantastic—especially those which can handle diarization (i.e. properly splitting out multiple speakers) and timestamps.

Models which seem most promising:

ryanntk · February 7, 2026, 4:23am

Kimi K2.5 please. Would love to this model running.

NotEduCo342 · February 15, 2026, 4:50pm

Kimi K2.5 is Absolutely needed. or any deepseek chat model

D_K · February 16, 2026, 3:58pm

allow people to host and run their own models then charge as usual per token, why wait for people to tell you what they want while they can do it themselves.

onemoremayur · February 16, 2026, 7:16pm

Kimi 2.5 please! No new models added since 6 months! Please add the latest Kimi models

okhosting · February 21, 2026, 4:41pm

Really all of the really big ones, like KIMI, GLM, DeepSeek, Qween, the best OCRs, the best TTS and STT.

Open source AI is flourishing and you are lagging behind real bad, after a very good start.

mike_schlottig · February 22, 2026, 2:44pm

Mistrals newer models are pretty good and a great value. Codestral, and Devestral.
DeepSeek 3.2 and 4 when it comes out. GLM 4.5, 4.7, 5 are all really good for the price and with a proper harness they’re truly impressive.
Minimax 2.5

Tom · February 26, 2026, 7:23pm

I would appreciate new models from huggingface like kimi k2.5 and minimax m2.5 etc.

AnyHow · February 26, 2026, 7:24pm

Kimi 2.5 is good tho

bibourokushi · February 26, 2026, 7:25pm

Title: Add Moshi / J-Moshi speech-to-speech model support

What problem are you trying to solve?
Real-time full-duplex voice conversation with low latency (~200ms)

What would you like instead?
Support for Kyutai’s Moshi (kyutai/moshika-pytorch-bf16)
and J-Moshi (nu-dialogue/j-moshi-ext) for Japanese voice dialogue

Any workarounds you’re using now?
Running locally on 24GB GPU or Colab L4, which is expensive

Anything else we should know?
Moshi is Apache 2.0 licensed, 7B parameters, fits Groq’s LPU well

Seb_K · February 26, 2026, 7:26pm

Qwen TTS, Kimi 2.5 would be amazing.

Gb_M · February 26, 2026, 7:26pm

Kimi k2.5 please and soon

Pierre-Alexandre_Voy · February 27, 2026, 2:04pm

I would like a code compatible model. For instance, I c’ant use groq inside my Kilocode vscode’s extension, because neither Gpt-OSS, Kimi or Qwen doesn’t work well
There’s probably open source code oriented in Hugging Face

MarcoDev · March 2, 2026, 11:09am

Among the models, there must be one with vision! It would be a huge oversight if there weren’t.

Goodbird · March 5, 2026, 7:51am

It would be really nice, if you could add the models from the latest Qwen3.5 model family, like Qwen3.5-35B-A3B. They are rather small and MoE-based, which guarrantees low latency, but they are told to perform better than gpt-oss-120b and even larger models. They have built-in reasoning, which however can be disabled. So these models could outperform and potentially even replace the current gpt-oss-120b and gpt-oss-20b models.

Also, desperately waiting for Qwen-Embedding-8B model to be deployed, since as far as I know, there are no decent production-grade low-latency deployments of this or similar models from other cloud providers. And it is crucial to have such a model for low-latency RAG systems such as voice assistants and others.

Topic		Replies	Views
Qwen3 Coder & Qwen3 235B A22B Thinking 2507 Feature Requests	9	428	December 31, 2025
DeepSeek Coder V2 Feature Requests	6	137	December 21, 2025
Add speech-to-speech model support Feature Requests	3	159	January 8, 2026
We need kimi k2.5 asap! Feature Requests	20	2386	April 1, 2026
Welcome to the Help Center! FAQs	1	220	August 8, 2025

What models do you want to see on Groq?

Related topics