Google

Gemini 3 Flash Preview (Reasoning)

Unknown Size

By Google • Released 2025-12-17

Capability Radar

Avg Score
67

Across all benchmarks

Participated
12
Benchmarks

Benchmark Performance

Benchmark Category Score
AIME 2025
Reasoning
97
LiveCodeBench
Coding
90.8
GPQA Diamond
Knowledge
89.8
MMLU-Pro
Knowledge
89
𝜏²-Bench Telecom
Reasoning Knowledge
80.4
IFBench
Agent
78
LCR
Long-Context Reasoning
66.3
SciCode
Reasoning Knowledge
50.6
Artificial Analysis Intelligence Index
Knowledge
46.4
Artificial Analysis Coding Index
Coding
42.6
Terminal-Bench Hard
Agent Coding
38.6
HLE
Knowledge Multi-Modal
34.7