OpenAI

GPT-5 (ChatGPT)

Unknown Size

By OpenAI • Released 2025-08-07

Capability Radar

Avg Score
38

Across all benchmarks

Participated
12
Benchmarks

Benchmark Performance

Benchmark Category Score
MMLU-Pro
Knowledge
82
GPQA Diamond
Knowledge
68.6
LCR
Long-Context Reasoning
63.7
LiveCodeBench
Coding
54.3
AIME 2025
Reasoning
48.3
IFBench
Agent
45
SciCode
Reasoning Knowledge
37.8
Artificial Analysis Intelligence Index
Knowledge
21.8
Artificial Analysis Coding Index
Coding
21.2
Terminal-Bench Hard
Agent Coding
12.9
HLE
Knowledge Multi-Modal
5.8
𝜏²-Bench Telecom
Reasoning Knowledge
0