Prompt Caching

JHBC · October 8, 2025, 3:03pm

Caching is currently enabled for Kimi-K2 and gpt-oss-20b according to Prompt Caching - GroqDocs

Can you roll out support for all models please? We’re using qwen3-32b a lot, and caching would be a huge improvement

yawnxyz · October 8, 2025, 4:10pm

Yep, we’re rolling it out for more and more models!

JHBC · October 9, 2025, 11:37am

Do you have a timeline for the rollout to more models? We’d love to get it on gpt-oss-120b ASAP too

yawnxyz · October 9, 2025, 4:43pm

We’re doing rolling testing and releasing to more and more models currently, should be soon but I don’t have an exact rollout timeline.

Topic		Replies	Views
Qwen3 Coder & Qwen3 235B A22B Thinking 2507 Feature Requests	8	103	August 24, 2025
Qwen3-Coder-30B-A3B Feature Requests	2	49	August 8, 2025
OpenAI's gpt-oss now available, plus built-in tools and responses API! Groq News	0	123	August 5, 2025
Kimi K2 Forum	3	19	July 15, 2025
Qwen3-32B is Now LIVE on Groq! Groq News	1	34	June 11, 2025