How to use a 10M context window? Rate limit issue

ktibow June 19, 2025, 4:22pm 2

Unfortunately, Llama Scout currently only runs with a 131k token context window. In fact, I don’t believe any provider runs it with the whole 10M context window. If you’d still like to get more full context window requests in per minute, consider using "service_tier": "flex" or requesting a rate limit increase from Chat With Us.

Topic		Replies	Views
429 Rate limits with a single tool call Forum	1	160	August 19, 2025
Can I set spending limits or budgets? FAQs	0	73	August 8, 2025
Free tier Forum	3	5435	December 11, 2025
What are the rate limits for the Groq API, for the Free and Dev tier plans? FAQs	0	4369	August 8, 2025
tricks to avoid hitting groq rate limits Build with Groq	0	60	February 26, 2026

How to use a 10M context window? Rate limit issue

Related topics