Skip to main content

Rate limits act as control measures to regulate how frequently users and applications can access our API within specified timeframes. These limits help ensure service stability, fair access, and protection against misuse so that we can serve reliable and fast inference for all. We offer a generous Free Tier, a Developer Tier for 10X higher token consumption, and Enterprise plans for custom capacity needs. Rate limits apply at the organization level, not at the individual user level. Rate limits are also independent by model. More information on rate limits can be found here: https://console.groq.com/docs/rate-limits You can view your current rate limits in your account settings here: https://console.groq.com/dashboard/limits

Be the first to reply!

Reply