I’ll dig into oss-120b perf degradation, sorry about that. Our usage can sometimes be spiky which causes requests to be queued up, but I’m checking if there was any performance degradation in the model.
For a quick check, could you pull up Metrics - Dashboard - GroqCloud to see if you have a larger # of request failures?