Alibaba

Qwen3 30B A3B 2507 Instruct

Unknown Size

By Alibaba • Released 2025-07-29

Capability Radar

Avg Score
38

Across all benchmarks

Participated
13
Benchmarks

Benchmark Performance

Benchmark Category Score
MATH-500
Reasoning
97.5
MMLU-Pro
Knowledge
77.7
AIME 2025
Reasoning
66.3
GPQA Diamond
Knowledge
65.9
LiveCodeBench
Coding
51.5
IFBench
Agent
33.1
SciCode
Reasoning Knowledge
30.4
LCR
Long-Context Reasoning
22.7
Artificial Analysis Intelligence Index
Knowledge
15
Artificial Analysis Coding Index
Coding
14.2
𝜏²-Bench Telecom
Reasoning Knowledge
10.2
HLE
Knowledge Multi-Modal
6.8
Terminal-Bench Hard
Agent Coding
6.1