OpenAI

GPT-4o (Aug '24)

Unknown Size

By OpenAI • Released 2024-08-06

Capability Radar

Avg Score
31

Across all benchmarks

Participated
11
Benchmarks

Benchmark Performance

Benchmark Category Score
MATH-500
Reasoning
79.5
GPQA Diamond
Knowledge
52.1
IFBench
Agent
36
LCR
Long-Context Reasoning
35
SciCode
Reasoning Knowledge
33.1
LiveCodeBench
Coding
31.7
𝜏²-Bench Telecom
Reasoning Knowledge
28.9
Artificial Analysis Intelligence Index
Knowledge
18.8
Artificial Analysis Coding Index
Coding
16.6
Terminal-Bench Hard
Agent Coding
8.3
HLE
Knowledge Multi-Modal
2.9