OpenAI

GPT-5 mini (medium)

Unknown Size

By OpenAI • Released 2025-08-07

Capability Radar

Avg Score
57

Across all benchmarks

Participated
13
Benchmarks

Benchmark Performance

Benchmark Category Score
AIME 2025
Reasoning
85
MMLU-Pro
Knowledge
82.8
GPQA Diamond
Knowledge
80.3
IFBench
Agent
71.2
𝜏²-Bench Telecom
Reasoning Knowledge
71.1
LiveCodeBench
Coding
69.2
LCR
Long-Context Reasoning
66
SWE-bench (Bash Only)
Coding Agent
59.8
SciCode
Reasoning Knowledge
41
Artificial Analysis Intelligence Index
Knowledge
38.8
Artificial Analysis Coding Index
Coding
32.9
Terminal-Bench Hard
Agent Coding
28.8
HLE
Knowledge Multi-Modal
14.6