How do I handle rate limits?

Rate limits apply at the organization level with both requests per minute (RPM) and tokens per minute (TPM) constraints. Implement exponential backoff or retries for 429 errors.