Microsoft Azure

Phi-4

Unknown Size

By Microsoft Azure • Released 2024-12-12

Capability Radar

Avg Score
26

Across all benchmarks

Participated
13
Benchmarks

Benchmark Performance

Benchmark Category Score
MATH-500
Reasoning
81
MMLU-Pro
Knowledge
71.4
GPQA Diamond
Knowledge
57.5
SciCode
Reasoning Knowledge
26
IFBench
Agent
23.5
LiveCodeBench
Coding
23.1
AIME 2025
Reasoning
18
Artificial Analysis Intelligence Index
Knowledge
13.2
Artificial Analysis Coding Index
Coding
11.2
HLE
Knowledge Multi-Modal
4.1
Terminal-Bench Hard
Agent Coding
3.8
LCR
Long-Context Reasoning
0
𝜏²-Bench Telecom
Reasoning Knowledge
0