Mistral

Mistral Small 3.1

Unknown Size

By Mistral • Released 2025-03-17

Capability Radar

Avg Score
27

Across all benchmarks

Participated
13
Benchmarks

Benchmark Performance

Benchmark Category Score
MATH-500
Reasoning
70.7
MMLU-Pro
Knowledge
65.9
GPQA Diamond
Knowledge
45.4
IFBench
Agent
29.9
SciCode
Reasoning Knowledge
26.5
𝜏²-Bench Telecom
Reasoning Knowledge
25.1
LiveCodeBench
Coding
21.2
LCR
Long-Context Reasoning
19.7
Artificial Analysis Intelligence Index
Knowledge
14
Artificial Analysis Coding Index
Coding
13.9
Terminal-Bench Hard
Agent Coding
7.6
HLE
Knowledge Multi-Modal
4.8
AIME 2025
Reasoning
3.7