Rate limits apply at the organization level with both requests per minute (RPM) and tokens per minute (TPM) constraints. Implement exponential backoff or retries for 429 errors.
Related topics
Topic | Replies | Views | Activity | |
---|---|---|---|---|
How are rate limits related to my billing tier? | 0 | 11 | August 8, 2025 | |
What are the rate limits for the Groq API, for the Free and Dev tier plans? | 0 | 64 | August 8, 2025 | |
429 Rate limits with a single tool call | 1 | 10 | August 19, 2025 | |
How to use a 10M context window? Rate limit issue | 1 | 7 | June 19, 2025 | |
Can I set spending limits or budgets? | 0 | 4 | August 8, 2025 |