DeepSeek

DeepSeek R1 Distill Llama 8B

Unknown Size

By DeepSeek • Released 2025-01-20

Capability Radar

Avg Score
28

Across all benchmarks

Participated
10
Benchmarks

Benchmark Performance

Benchmark Category Score
MATH-500
Reasoning
85.3
MMLU-Pro
Knowledge
54.3
AIME 2025
Reasoning
41.3
GPQA Diamond
Knowledge
30.2
LiveCodeBench
Coding
23.3
IFBench
Agent
17.6
Artificial Analysis Intelligence Index
Knowledge
12.1
SciCode
Reasoning Knowledge
11.9
HLE
Knowledge Multi-Modal
4.2
LCR
Long-Context Reasoning
0