Alibaba

Qwen3 0.6B (Reasoning)

Unknown Size

By Alibaba • Released 2025-04-28

Capability Radar

Avg Score
17

Across all benchmarks

Participated
13
Benchmarks

Benchmark Performance

Benchmark Category Score
MATH-500
Reasoning
75
MMLU-Pro
Knowledge
34.7
GPQA Diamond
Knowledge
23.9
IFBench
Agent
23.3
𝜏²-Bench Telecom
Reasoning Knowledge
21.1
AIME 2025
Reasoning
18
LiveCodeBench
Coding
12.1
Artificial Analysis Intelligence Index
Knowledge
6.4
HLE
Knowledge Multi-Modal
5.7
SciCode
Reasoning Knowledge
2.8
Artificial Analysis Coding Index
Coding
0.9
LCR
Long-Context Reasoning
0
Terminal-Bench Hard
Agent Coding
0