OpenAI

gpt-oss-20B (high)

Unknown Size

By OpenAI • Released 2025-08-05

Capability Radar

Avg Score
47

Across all benchmarks

Participated
12
Benchmarks

Benchmark Performance

Benchmark Category Score
AIME 2025
Reasoning
89.3
LiveCodeBench
Coding
77.7
MMLU-Pro
Knowledge
74.8
GPQA Diamond
Knowledge
68.8
IFBench
Agent
65.1
𝜏²-Bench Telecom
Reasoning Knowledge
60.2
SciCode
Reasoning Knowledge
34.4
LCR
Long-Context Reasoning
30.7
Artificial Analysis Intelligence Index
Knowledge
24.5
Artificial Analysis Coding Index
Coding
18.5
Terminal-Bench Hard
Agent Coding
10.6
HLE
Knowledge Multi-Modal
9.8