What models do you want to see on Groq?

itstyrion · March 7, 2026, 3:19am

latest kimi or glm or qwen coder or minimax? i understand that k2.5 is probably the most challenging one due to its sheer size

Joseph_Ikinda · March 12, 2026, 2:06pm

A model with diarization can be one the greatest model

Hello_Test · March 16, 2026, 9:31pm

I would like to see Qwen 2.5 72B

lunacyworks · March 22, 2026, 8:01am

I have sadly had to stop using Groq due to the lack of progress on the available models. Inference speed can’t make up for the lack of a models basic abilities. And as of today almost every model Groq offers is 2 to 3 models behind what is actually usable and worth paying for. Qwen 3.5,GLM 5, Kimi k2.5 any other models that rank near or above opus 4.5

Phi_tuan · March 26, 2026, 8:45pm

I just want you to upgrade the whisper model; it’s quite bad at misidentifying time and voice, which I find confusing, while other providers do a pretty good job.

Dongwon · April 3, 2026, 2:18am

Gemma-4 Please!!!

Sumanth_Pagadala · April 5, 2026, 10:42am

@yawnxyz gemma-4 is something we are seariously looking forward into!

Dat_LQ · April 7, 2026, 1:38am

Gemma 4 + kimi 2.5 plsssssssss

Ishan · April 8, 2026, 8:03pm

Would love to see Googles new Gemma 4 models!

ahgsj · April 9, 2026, 1:53am

Gemma 4, Kimi2.5, Minimax 2.7(if they are open sourced)

mhjeridi · April 14, 2026, 5:15pm

Would love to see Gemma 4 31B on Groq! Its extended context window is a game-changer for managing the load of complex, multi-tool pipelines and deep skill chaining.

Alan_DLG · April 20, 2026, 3:12pm

Please, regarding any Mercury model from Inception Labs — I performed an approximate calculation and estimated it could achieve about 3,200 tokens per second. I just want to clarify the meaning of the term “ridiculously fast.”

Aniket_Patel · April 22, 2026, 9:48pm

Gamma 4. Its about time. I dont want t use Together AI or Cerebras. Groq is the best and fastest.

itstyrion · April 25, 2026, 11:05pm

don’t feel like they’re actually interested anymore. Kimi K2 removed, Minimax 2.5 locked behind enterprise plan, Qwen3 32B somehow also locked behind enterprise plan.

I understand the challenge that is the behemoth Kimi K2/K2.5 on Groq hardware, but Minimax M2.5, and Qwen 3.5/3.6 models (aside from 397MB maybe) should very much be doable?

Topic		Replies	Views
Qwen3 Coder & Qwen3 235B A22B Thinking 2507 Feature Requests	9	428	December 31, 2025
DeepSeek Coder V2 Feature Requests	6	137	December 21, 2025
Add speech-to-speech model support Feature Requests	3	159	January 8, 2026
We need kimi k2.5 asap! Feature Requests	20	2379	April 1, 2026
Welcome to the Help Center! FAQs	1	218	August 8, 2025

What models do you want to see on Groq?

Related topics