Alibaba

Qwen3 Coder 480B A35B Instruct

Unknown Size

By Alibaba • Released 2025-07-22

Capability Radar

Avg Score
44

Across all benchmarks

Participated
14
Benchmarks

Benchmark Performance

Benchmark Category Score
MATH-500
Reasoning
94.2
MMLU-Pro
Knowledge
78.8
GPQA Diamond
Knowledge
61.8
LiveCodeBench
Coding
58.5
SWE-bench (Bash Only)
Coding Agent
55.4
𝜏²-Bench Telecom
Reasoning Knowledge
43.6
LCR
Long-Context Reasoning
42.3
IFBench
Agent
40.5
AIME 2025
Reasoning
39.3
SciCode
Reasoning Knowledge
35.9
Artificial Analysis Coding Index
Coding
24.6
Artificial Analysis Intelligence Index
Knowledge
24.6
Terminal-Bench Hard
Agent Coding
18.9
HLE
Knowledge Multi-Modal
4.4