ByteDance Seed

Seed-OSS-36B-Instruct

Unknown Size

By ByteDance Seed • Released 2025-08-20

Capability Radar

Avg Score
47

Across all benchmarks

Participated
12
Benchmarks

Benchmark Performance

Benchmark Category Score
AIME 2025
Reasoning
84.7
MMLU-Pro
Knowledge
81.5
LiveCodeBench
Coding
76.5
GPQA Diamond
Knowledge
72.6
LCR
Long-Context Reasoning
57.7
𝜏²-Bench Telecom
Reasoning Knowledge
49.4
IFBench
Agent
41.9
SciCode
Reasoning Knowledge
36.5
Artificial Analysis Intelligence Index
Knowledge
25
Artificial Analysis Coding Index
Coding
16.7
HLE
Knowledge Multi-Modal
9.1
Terminal-Bench Hard
Agent Coding
6.8