When will Embedding Models be available in Groq?

enzo · August 28, 2025, 7:17am

Hi,

When will Groq add embedding models?

Thank you!

yawnxyz · August 28, 2025, 4:35pm

Hi @enzo,

We’re considering adding a couple of models but we haven’t settled on which ones to add yet.

Are there any open source ones in particular that you find useful or you’d like us to add?

enzo · August 29, 2025, 7:41am

Hi,

Interested in multilingual embedding models with high retrieval scores (average >60 on MTEB multilingual retrieval) for mRAGs etc, ex:

BGE-M3 (BAAI/bge-m3 = 65.2 compared to Gemini-Embedding-001 = 68.32, pretty good for an open source model)

Others:
intfloat/multilingual-e5-large (64.8)
nomic-ai/nomic-embed-text-v2 (63.5)
Alibaba-NLP/gte-multilingual-base (62.9)
Qwen/Qwen3-Embedding-0.6B (62.1)

Regards
Enzo

yawnxyz · August 31, 2025, 7:40pm

Thank you, that list if very very helpful!

GAIT · September 12, 2025, 8:00am

Hi @yawnxyz

Please include mxbai-embed-large-v1 - it is in super wide usage and our default.

Regards

fbellomi · September 17, 2025, 2:21pm

hi @yawnxyz

we use Qwen/Qwen3-Embedding-8B, which is currently the top open source model in the MTEB leaderboard

yawnxyz · September 17, 2025, 3:49pm

Good to know, thank you! I’m curious, do you run these locally or on something like HF Inference endpoints?

fbellomi · September 17, 2025, 4:03pm

we currently use DeepInfra for this specific task, here is their offering:

Qwen/Qwen3-Embedding-8B - Demo - DeepInfra

trongnguyen3310 · September 24, 2025, 4:45pm

I like nomic-embed-text model

yawnxyz · September 26, 2025, 6:14pm

I’m curious — what are people liking about something like nomic vs. mixedbread vs. qwen and other ones?

I’ve liked using mixedbread but mostly because it’s lightweight and works fairly well; I haven’t really benchmarked or even vibe checked / cross-compared them… what makes one of these embedding models much better than another?

Are some of these better for some use cases vs. other uses cases?

enzo · October 7, 2025, 10:45am

Any roadmap/timeplan for deploying embedding models in groq production?

yawnxyz · October 7, 2025, 9:10pm

Not yet, sorry. Embedding models are lower priority right now though

Rene · November 27, 2025, 3:20pm

multilingual-e5-large would be nice, even it is an older model

SimonLL · February 26, 2026, 7:25pm

+1 on this, would love to have them on Groq.

Joe · March 3, 2026, 4:10pm

Please add Qwen3 4B or 8B

Topic		Replies	Views
Will you add embedding models any time soon? Feature Requests	2	563	July 24, 2025
Embeddings Models on Groq Forum	4	4581	January 31, 2026
What models do you want to see on Groq? Feature Requests	114	4846	May 1, 2026
Deepseek-v3.2 Feature Requests	4	427	February 9, 2026
Any clarification from Groq team about the future? Forum	14	1163	May 3, 2026

When will Embedding Models be available in Groq?

Related topics