Gemini 3.0 promo art

Platform Core Capabilities & Examples

Highlights across authoritative benchmarks and real-world examples.

ARC-AGI-2 results

ARC-AGI-2 Results

On the ultra-difficult ARC-AGI-2 general intelligence test, Gemini 3.0 with thinking mode reaches ~35% accuracy, while others stay below 20%.

HLE benchmark score

Top HLE Benchmark Score

On the notoriously hard “Human Last Exam (HLE)” benchmark, Gemini 3.0 scores 32.4% — outperforming GPT-5 (high) and Grok 4.

SVG pelican output

SVG Pelican Test

Gemini 3.0 handles images including SVGs with ease. The famous cycling pelican SVG test wowed the community with Gemini 3.0 Pro output.

Gundam and controller render

Gundam & Controller Rendering

Previously hard Gundam robot and Switch controller renders now look visibly upgraded — getting very close to real product photos.

Coding arena performance

Leading in Coding Arena

Gemini 3.0 Pro leads the coding arena by a wide margin.

Series capability leap

Series Leap Forward

Gemini 3 has become truly super‑intelligent; the series makes a leap forward.

Prompt image example

Prompted Image Example

Gemini 3.0 Pro generated an image from the prompt “the Power Rangers standing in the scene with typical poses the power rangers do”.

Reference:“Gemini 3 internal tests praised as possibly the best frontend dev model”. Examples and numbers compiled from the article and community tests.