Gemini 2.5 Flash

by Google

Second-generation Flash. 1M context, excellent speed/cost ratio. Supports thinking budgets.

Context window 1M 1M tokens

Max output 64K 64K tokens

Pricing /1M tok $0.30 in / $2.50 out per 1M tokens (API)

Released May 1, 2025

// specifications

Specs & capabilities

// notes

Workhorse speed model

// more_from_google

Google's most capable model. 1M context, multimodal. Price doubles for context >200K.

Cost-efficient model in the Gemini 3.1 family. 1M context, lowest pricing tier.

Third-generation Flash model. 1M context, fast and affordable.

Smallest 2.5 model. 1M context, best budget option in the Gemini lineup.

Previous-gen flagship with thinking budgets. 1M context. Price doubles for context >200K.

Mid-size open-weight Gemma. 12B parameters, 128K context, vision.

// external_links