Alibaba

Qwen3 1.7B (Reasoning)

Unknown Size

By Alibaba • Released 2025-04-28

Capability Radar

Avg Score
25

Across all benchmarks

Participated
13
Benchmarks

Benchmark Performance

Benchmark Category Score
MATH-500
Reasoning
89.4
MMLU-Pro
Knowledge
57
AIME 2025
Reasoning
38.7
GPQA Diamond
Knowledge
35.6
LiveCodeBench
Coding
30.8
IFBench
Agent
26.9
𝜏²-Bench Telecom
Reasoning Knowledge
26
Artificial Analysis Intelligence Index
Knowledge
7.9
HLE
Knowledge Multi-Modal
4.8
SciCode
Reasoning Knowledge
4.3
Artificial Analysis Coding Index
Coding
1.4
LCR
Long-Context Reasoning
0
Terminal-Bench Hard
Agent Coding
0