Rate limits act as control measures to regulate how frequently users and applications can access our API within specified timeframes. These limits help ensure service stability, fair access, and protection against misuse so that we can serve reliable and fast inference for all. We offer a generous Free Tier, a Developer Tier for 10X higher token consumption, and Enterprise plans for custom capacity needs. Rate limits apply at the organization level, not at the individual user level. Rate limits are also independent by model. More information on rate limits can be found here: https://console.groq.com/docs/rate-limits You can view your current rate limits in your account settings here: https://console.groq.com/dashboard/limits
Be the first to reply!
Reply
Login to the community
No account yet? Create an account
Enter your E-mail address. We'll send you an e-mail with instructions to reset your password.