Mistral

Mistral Large 2 (Nov '24)

Unknown Size

By Mistral • Released 2024-11-18

Capability Radar

Avg Score
29

Across all benchmarks

Participated
13
Benchmarks

Benchmark Performance

Benchmark Category Score
MATH-500
Reasoning
73.6
MMLU-Pro
Knowledge
69.7
GPQA Diamond
Knowledge
48.6
IFBench
Agent
31.2
𝜏²-Bench Telecom
Reasoning Knowledge
30.7
LiveCodeBench
Coding
29.3
SciCode
Reasoning Knowledge
29.2
Artificial Analysis Intelligence Index
Knowledge
15.1
AIME 2025
Reasoning
14
Artificial Analysis Coding Index
Coding
13.8
Terminal-Bench Hard
Agent Coding
6.1
LCR
Long-Context Reasoning
5.3
HLE
Knowledge Multi-Modal
4