[Issue] Tool Calling Failures on Groq LLMs via Pydantic-AI (OpenAI baseline vs Groq)

kinyugo · September 25, 2025, 6:52pm

As a follow-up to my earlier post, I wanted to share one more trace and a reproducible notebook.

Here’s another example from Logfire:

To make this easier to verify, I’ve also prepared a notebook that reproduces the runs I described.

Curious if anyone else has tested this model specifically, or if there are known workarounds to improve tool call reliability on Groq.

Topic		Replies	Views
Tool calling errors on both gpt oss models Forum	12	2149	December 27, 2025
Gpt-oss:20b calling tools unasked Forum	12	572	December 1, 2025
Gpt-oss-120b ignoring tools Forum	53	3390	April 24, 2026
Parallel Tool Use with Groq API Tutorials	3	762	March 2, 2026
Allow no output parsing of message content (return raw response only) Feature Requests	6	210	December 1, 2025