Which tokeniser is used by Groq?

Akash_coder · November 23, 2025, 11:54am

Hello
What tokeniser is used by Groq ? Is there a python library that can be added in the code which tells us the token count BEFORE the request is sent to Groq servers ? Will be very useful for context engineering and optimising tokens will benifit the entire ecosystem.

yawnxyz · November 26, 2025, 5:02pm

Hi! We use whatever tokenizer was used to train the model.

You can do this:

from tokenizers import Tokenizer
t = Tokenizer.from_pretrained("meta-llama/Meta-Llama-3-70B-Instruct")
t.encode("hello").ids

Docs: tokenizers · PyPI

Akash_coder · November 28, 2025, 6:07am

Thanks for sharing this!

Akash_coder · November 29, 2025, 10:37am

with some hit and trial, i arrived at the following for getting accurate prompt tokens count that matches the value in the api response :

from tokenizers import Tokenizer

# Load the tokenizer
t = Tokenizer.from_pretrained("openai/gpt-oss-20b")

# If messages is a list of dictionaries (like in chat completion)
text_content = ""
for msg in messages:
    text_content += f"{msg['role']}: {msg['content']}\n"
    
encoding = t.encode(text_content)
token_count = len(encoding.ids)
accurate_token_count = (len(messages)*17+1)+token_count

yawnxyz · December 1, 2025, 11:51pm

Wow I didn’t realize there was more work involved to do that; if you have more tips and tricks on how you reverse engineered and built that out, I’d love if you cross posted that on the forum as a technical post — that might help a lot of people!

Topic		Replies	Views
What are tokens and how do I count them? FAQs	1	335	August 26, 2025
GPT-oss-120b Reasoning Tokens Not Counted in Responses API Usage Statistics Forum	2	241	September 15, 2025
Groq overcharging by 10x Forum	1	138	September 9, 2025
Groq Python SDK Support for Responses API Feature Requests	4	107	August 12, 2025
Moonshotai/kimi-k2-instruct-0905 excessive (cache-) token usage Forum	1	133	November 7, 2025

Which tokeniser is used by Groq?

Related topics