DeepSeek

DeepSeek R1 Distill Qwen 14B

Unknown Size

By DeepSeek • Released 2025-01-20

Capability Radar

Avg Score
38

Across all benchmarks

Participated
10
Benchmarks

Benchmark Performance

Benchmark Category Score
MATH-500
Reasoning
94.9
MMLU-Pro
Knowledge
74
AIME 2025
Reasoning
55.7
GPQA Diamond
Knowledge
48.4
LiveCodeBench
Coding
37.6
SciCode
Reasoning Knowledge
23.9
IFBench
Agent
22.1
Artificial Analysis Intelligence Index
Knowledge
15.8
LCR
Long-Context Reasoning
7
HLE
Knowledge Multi-Modal
4.4