Google

Gemini 2.5 Flash (Non-reasoning)

Unknown Size

By Google • Released 2025-05-20

Capability Radar

Avg Score
40

Across all benchmarks

Participated
14
Benchmarks

Benchmark Performance

Benchmark Category Score
MATH-500
Reasoning
93.2
MMLU-Pro
Knowledge
80.9
GPQA Diamond
Knowledge
68.3
AIME 2025
Reasoning
60.3
LiveCodeBench
Coding
49.5
LCR
Long-Context Reasoning
45.9
IFBench
Agent
39
SciCode
Reasoning Knowledge
29.1
SWE-bench (Bash Only)
Coding Agent
28.73
Artificial Analysis Intelligence Index
Knowledge
20.5
Artificial Analysis Coding Index
Coding
17.8
𝜏²-Bench Telecom
Reasoning Knowledge
14.9
Terminal-Bench Hard
Agent Coding
12.1
HLE
Knowledge Multi-Modal
5.1