OpenAI

GPT-5.1 (high)

Unknown Size

By OpenAI • Released 2025-11-13

Capability Radar

Avg Score
66

Across all benchmarks

Participated
13
Benchmarks

Benchmark Performance

Benchmark Category Score
AIME 2025
Reasoning
94
GPQA Diamond
Knowledge
87.3
MMLU-Pro
Knowledge
87
LiveCodeBench
Coding
86.8
𝜏²-Bench Telecom
Reasoning Knowledge
81.9
LCR
Long-Context Reasoning
75
IFBench
Agent
72.9
SWE-bench (Bash Only)
Coding Agent
66
Artificial Analysis Intelligence Index
Knowledge
47.6
Terminal-Bench Hard
Agent Coding
45.5
Artificial Analysis Coding Index
Coding
44.7
SciCode
Reasoning Knowledge
43.3
HLE
Knowledge Multi-Modal
26.5