LLM emits function calls as inline markup in natural language response

Patrick_McGuinness · December 16, 2025, 5:13pm

We’re integrating the Groq LLM with a real-time voice agent pipeline (LiveKit Agents). We’ve observed that the model sometimes emits tool calls as inline XML-style markup embedded inside normal conversational text, for example:

… Let me check that for you. <function=get_service>{…}

This creates issues for streaming / TTS-first systems, because the surrounding text is spoken before the tool executes, and the tool call cannot be cleanly separated without custom parsing and suppression logic.

Could you clarify:

Whether emitting inline function markup inside natural language responses is expected behavior
If there is a supported way to force tool calls to be returned only via the structured tool call channel (no mixed text)
Or if there is a recommended prompt or parameter configuration to prevent mixed text + function markup output

This is specifically impacting real-time voice use cases where speech must be serialized correctly around tool execution.

Thanks for your help.

yawnxyz · December 16, 2025, 5:15pm

Hi Patrick, I’m going to reproduce this — what model did you use?

Patrick_McGuinness · December 16, 2025, 11:21pm

Hi

I am currently using llama-3.3-70b-versatile

yawnxyz · December 16, 2025, 11:28pm

oh interesting, could you please DM me a reproducible cURL?

Patrick_McGuinness · December 17, 2025, 6:31am

Hi You will have to excuse me but I dont seem to be able to DM you. I do have a curl which reproduces the issue

yawnxyz · December 17, 2025, 6:49pm

You should have gotten a DM from me!

Topic		Replies	Views
How do I implement function calling with Groq? FAQs	1	87	August 8, 2025
Parallel Tool Use with Groq API Tutorials	3	539	March 2, 2026
Do models hosted on GroqCloud support function calling/tool use? FAQs	1	98	August 26, 2025
How to include tool call arguments in messages for tool call responses? Feature Requests	1	35	July 29, 2025
[Issue] Tool Calling Failures on Groq LLMs via Pydantic-AI (OpenAI baseline vs Groq) Forum	19	573	February 24, 2026

LLM emits function calls as inline markup in natural language response

Related topics