Alibaba

Qwen3 4B (Non-reasoning)

Unknown Size

By Alibaba • Released 2025-04-28

Capability Radar

Avg Score
34

Across all benchmarks

Participated
7
Benchmarks

Benchmark Performance

Benchmark Category Score
MATH-500
Reasoning
84.3
MMLU-Pro
Knowledge
58.6
GPQA Diamond
Knowledge
39.8
LiveCodeBench
Coding
23.3
SciCode
Reasoning Knowledge
16.7
Artificial Analysis Intelligence Index
Knowledge
12.5
HLE
Knowledge Multi-Modal
3.7