MBZUAI Institute of Foundation Models

K2-V2 (high)

Unknown Size

By MBZUAI Institute of Foundation Models • Released 2025-12-05

Capability Radar

Avg Score
42

Across all benchmarks

Participated
12
Benchmarks

Benchmark Performance

Benchmark Category Score
MMLU-Pro
Knowledge
78.6
AIME 2025
Reasoning
78.3
LiveCodeBench
Coding
69.4
GPQA Diamond
Knowledge
68.1
IFBench
Agent
60.1
LCR
Long-Context Reasoning
33.3
SciCode
Reasoning Knowledge
28.6
𝜏²-Bench Telecom
Reasoning Knowledge
27.8
Artificial Analysis Intelligence Index
Knowledge
20.7
Artificial Analysis Coding Index
Coding
16.1
HLE
Knowledge Multi-Modal
9.8
Terminal-Bench Hard
Agent Coding
9.8