OpenAI

GPT-5 Codex (high)

Unknown Size

By OpenAI • Released 2025-09-23

Capability Radar

Avg Score
64

Across all benchmarks

Participated
12
Benchmarks

Benchmark Performance

Benchmark Category Score
AIME 2025
Reasoning
98.7
𝜏²-Bench Telecom
Reasoning Knowledge
86.8
MMLU-Pro
Knowledge
86.5
LiveCodeBench
Coding
84
GPQA Diamond
Knowledge
83.7
IFBench
Agent
74.1
LCR
Long-Context Reasoning
69
Artificial Analysis Intelligence Index
Knowledge
44.5
SciCode
Reasoning Knowledge
40.9
Artificial Analysis Coding Index
Coding
38.9
Terminal-Bench Hard
Agent Coding
37.9
HLE
Knowledge Multi-Modal
25.6