Alibaba

Qwen3 14B (Non-reasoning)

Unknown Size

By Alibaba • Released 2025-04-28

Capability Radar

Avg Score
31

Across all benchmarks

Participated
13
Benchmarks

Benchmark Performance

Benchmark Category Score
MATH-500
Reasoning
87.1
MMLU-Pro
Knowledge
67.5
AIME 2025
Reasoning
58
GPQA Diamond
Knowledge
47
𝜏²-Bench Telecom
Reasoning Knowledge
32.2
LiveCodeBench
Coding
28
SciCode
Reasoning Knowledge
26.5
IFBench
Agent
23.9
Artificial Analysis Intelligence Index
Knowledge
12.7
Artificial Analysis Coding Index
Coding
12.4
Terminal-Bench Hard
Agent Coding
5.3
HLE
Knowledge Multi-Modal
4.2
LCR
Long-Context Reasoning
0