DeepSeek

DeepSeek R1 Distill Qwen 32B

Unknown Size

By DeepSeek • Released 2025-01-20

Capability Radar

Avg Score
41

Across all benchmarks

Participated
10
Benchmarks

Benchmark Performance

Benchmark Category Score
MATH-500
Reasoning
94.1
MMLU-Pro
Knowledge
73.9
AIME 2025
Reasoning
63
GPQA Diamond
Knowledge
61.5
SciCode
Reasoning Knowledge
37.6
LiveCodeBench
Coding
27
IFBench
Agent
22.9
Artificial Analysis Intelligence Index
Knowledge
17.2
LCR
Long-Context Reasoning
9.7
HLE
Knowledge Multi-Modal
5.5