MBZUAI Institute of Foundation Models

K2-V2 (medium)

Unknown Size

By MBZUAI Institute of Foundation Models • Released 2025-12-05

Capability Radar

Avg Score
36

Across all benchmarks

Participated
12
Benchmarks

Benchmark Performance

Benchmark Category Score
MMLU-Pro
Knowledge
76.1
AIME 2025
Reasoning
64.7
GPQA Diamond
Knowledge
59.8
IFBench
Agent
55.1
LiveCodeBench
Coding
54.1
LCR
Long-Context Reasoning
28
SciCode
Reasoning Knowledge
25.2
𝜏²-Bench Telecom
Reasoning Knowledge
24.9
Artificial Analysis Intelligence Index
Knowledge
18.7
Artificial Analysis Coding Index
Coding
14
Terminal-Bench Hard
Agent Coding
8.3
HLE
Knowledge Multi-Modal
4.4