Alibaba

Qwen3 0.6B (Non-reasoning)

Unknown Size

By Alibaba • Released 2025-04-28

Capability Radar

Avg Score
13

Across all benchmarks

Participated
13
Benchmarks

Benchmark Performance

Benchmark Category Score
MATH-500
Reasoning
52.1
GPQA Diamond
Knowledge
23.1
MMLU-Pro
Knowledge
23.1
IFBench
Agent
21.9
𝜏²-Bench Telecom
Reasoning Knowledge
14.6
AIME 2025
Reasoning
10.3
LiveCodeBench
Coding
7.3
Artificial Analysis Intelligence Index
Knowledge
5.6
HLE
Knowledge Multi-Modal
5.2
SciCode
Reasoning Knowledge
4.1
Artificial Analysis Coding Index
Coding
1.4
LCR
Long-Context Reasoning
0
Terminal-Bench Hard
Agent Coding
0