OpenAI

o3-mini

Unknown Size

By OpenAI • Released 2025-01-31

Capability Radar

Avg Score
45

Across all benchmarks

Participated
10
Benchmarks

Benchmark Performance

Benchmark Category Score
MATH-500
Reasoning
97.3
MMLU-Pro
Knowledge
79.1
GPQA Diamond
Knowledge
74.8
LiveCodeBench
Coding
71.7
SciCode
Reasoning Knowledge
39.9
𝜏²-Bench Telecom
Reasoning Knowledge
28.7
Artificial Analysis Intelligence Index
Knowledge
25.9
Artificial Analysis Coding Index
Coding
17.9
HLE
Knowledge Multi-Modal
8.7
Terminal-Bench Hard
Agent Coding
6.8