OpenAI

GPT-5.1 Codex mini (high)

Unknown Size

By OpenAI • Released 2025-11-13

Capability Radar

Avg Score
58

Across all benchmarks

Participated
12
Benchmarks

Benchmark Performance

Benchmark Category Score
AIME 2025
Reasoning
91.7
LiveCodeBench
Coding
83.6
MMLU-Pro
Knowledge
82
GPQA Diamond
Knowledge
81.3
IFBench
Agent
67.9
𝜏²-Bench Telecom
Reasoning Knowledge
62.9
LCR
Long-Context Reasoning
62.7
SciCode
Reasoning Knowledge
42.6
Artificial Analysis Intelligence Index
Knowledge
38.5
Artificial Analysis Coding Index
Coding
36.4
Terminal-Bench Hard
Agent Coding
33.3
HLE
Knowledge Multi-Modal
16.9