Alibaba

Qwen3 30B A3B (Reasoning)

Unknown Size

By Alibaba • Released 2025-04-28

Capability Radar

Avg Score
38

Across all benchmarks

Participated
13
Benchmarks

Benchmark Performance

Benchmark Category Score
MATH-500
Reasoning
95.9
MMLU-Pro
Knowledge
77.7
AIME 2025
Reasoning
72.3
GPQA Diamond
Knowledge
61.6
LiveCodeBench
Coding
50.6
IFBench
Agent
41.5
SciCode
Reasoning Knowledge
28.5
𝜏²-Bench Telecom
Reasoning Knowledge
26
Artificial Analysis Intelligence Index
Knowledge
15.3
Artificial Analysis Coding Index
Coding
11
HLE
Knowledge Multi-Modal
6.6
Terminal-Bench Hard
Agent Coding
2.3
LCR
Long-Context Reasoning
0