DeepSeek

DeepSeek R1 Distill Qwen 1.5B

Unknown Size

By DeepSeek • Released 2025-01-20

Capability Radar

Avg Score
17

Across all benchmarks

Participated
10
Benchmarks

Benchmark Performance

Benchmark Category Score
MATH-500
Reasoning
68.7
MMLU-Pro
Knowledge
26.9
AIME 2025
Reasoning
22
IFBench
Agent
13.2
GPQA Diamond
Knowledge
9.8
Artificial Analysis Intelligence Index
Knowledge
9.1
LiveCodeBench
Coding
7
SciCode
Reasoning Knowledge
6.6
HLE
Knowledge Multi-Modal
3.3
LCR
Long-Context Reasoning
0.3