Alibaba

Qwen3 30B A3B 2507 (Reasoning)

Unknown Size

By Alibaba • Released 2025-07-30

Capability Radar

Avg Score
46

Across all benchmarks

Participated
13
Benchmarks

Benchmark Performance

Benchmark Category Score
MATH-500
Reasoning
97.6
MMLU-Pro
Knowledge
80.5
GPQA Diamond
Knowledge
70.7
LiveCodeBench
Coding
70.7
LCR
Long-Context Reasoning
59
AIME 2025
Reasoning
56.3
IFBench
Agent
50.7
SciCode
Reasoning Knowledge
33.3
𝜏²-Bench Telecom
Reasoning Knowledge
28.1
Artificial Analysis Intelligence Index
Knowledge
22.4
Artificial Analysis Coding Index
Coding
14.7
HLE
Knowledge Multi-Modal
9.8
Terminal-Bench Hard
Agent Coding
5.3