Google

Gemini 2.5 Flash (Reasoning)

Unknown Size

By Google • Released 2025-05-20

Capability Radar

Avg Score
51

Across all benchmarks

Participated
13
Benchmarks

Benchmark Performance

Benchmark Category Score
MATH-500
Reasoning
98.1
MMLU-Pro
Knowledge
83.2
GPQA Diamond
Knowledge
79
AIME 2025
Reasoning
73.3
LiveCodeBench
Coding
69.5
LCR
Long-Context Reasoning
61.7
IFBench
Agent
50.3
SciCode
Reasoning Knowledge
39.4
𝜏²-Bench Telecom
Reasoning Knowledge
31.6
Artificial Analysis Intelligence Index
Knowledge
26.8
Artificial Analysis Coding Index
Coding
22.2
Terminal-Bench Hard
Agent Coding
13.6
HLE
Knowledge Multi-Modal
11.1