MiniMax

MiniMax-M2

Unknown Size

By MiniMax • Released 2025-10-26

Capability Radar

Avg Score
57

Across all benchmarks

Participated
13
Benchmarks

Benchmark Performance

Benchmark Category Score
𝜏²-Bench Telecom
Reasoning Knowledge
86.8
LiveCodeBench
Coding
82.6
MMLU-Pro
Knowledge
82
AIME 2025
Reasoning
78.3
GPQA Diamond
Knowledge
77.7
IFBench
Agent
72.3
LCR
Long-Context Reasoning
61
SWE-bench (Bash Only)
Coding Agent
61
SciCode
Reasoning Knowledge
36.1
Artificial Analysis Intelligence Index
Knowledge
36
Artificial Analysis Coding Index
Coding
29.2
Terminal-Bench Hard
Agent Coding
25.8
HLE
Knowledge Multi-Modal
12.5