Mistral

Mistral Medium 3.1

Unknown Size

By Mistral • Released 2025-08-12

Capability Radar

Avg Score
33

Across all benchmarks

Participated
12
Benchmarks

Benchmark Performance

Benchmark Category Score
MMLU-Pro
Knowledge
68.3
GPQA Diamond
Knowledge
58.8
LiveCodeBench
Coding
40.6
𝜏²-Bench Telecom
Reasoning Knowledge
40.6
IFBench
Agent
39.8
AIME 2025
Reasoning
38.3
SciCode
Reasoning Knowledge
33.8
Artificial Analysis Intelligence Index
Knowledge
21.1
LCR
Long-Context Reasoning
19.7
Artificial Analysis Coding Index
Coding
18.3
Terminal-Bench Hard
Agent Coding
10.6
HLE
Knowledge Multi-Modal
4.4