New Model Family: Devstral 2

Matthew_Lopez · December 9, 2025, 11:44pm

Devstral 2 (123B) under a modified MIT license, and Devstral Small (24B) under Apache 2.0.

These models are scoring very well for their size. And they are currently outpacing GPT-OSS-120b in coding. I would like to see these models adopted.

yawnxyz · December 10, 2025, 9:32pm

I’ve been playing with Devstral on my local machine and it seems to be adequate but not really mind-blowing — have you seen it outperform in any way? It’s great for a small 120B size (and I think it vibes a bit better than 120b) but I think still falls short of the larger models like Kimi K2?

Topic		Replies	Views
Propreitary models Feature Requests	2	57	November 25, 2025
What models do you want to see on Groq? Feature Requests	101	3362	March 12, 2026
Deepseek-v3.2 Feature Requests	4	317	February 9, 2026
Do you plan to offer qwen 3.5? Forum	7	292	March 13, 2026
Nemotron 3 series Feature Requests	6	216	December 26, 2025

New Model Family: Devstral 2

Related topics