// model_detail
active balanced
Gemini 2.5 Flash
by Google
Second-generation Flash. 1M context, excellent speed/cost ratio. Supports thinking budgets.
Context window 1M 1M tokens
Max output 64K 64K tokens
Pricing /1M tok $0.30 in / $2.50 out per 1M tokens (API)
Released May 1, 2025
// specifications
Specs & capabilities
API details
- API model ID
gemini-2.5-flash- Internal ID
gemini-2-5-flash- Status
- active
- Type
- balanced
Capabilities
- Vision
- 👁️ Yes
- Function calling
- 🔧 Yes
- Knowledge cutoff
- 📅 2025-01
Pricing
- Input
- $0.30 / 1M tokens
- Output
- $2.50 / 1M tokens
- Source
- First-party API pricing
Lifecycle
- Predecessor
- Gemini 2.0 Flash
// notes
Additional notes
Workhorse speed model
// more_from_google
More Google models
Gemini 3.1 Pro preview frontier
Google's most capable model. 1M context, multimodal. Price doubles for context >200K.
Gemini 3.1 Flash-Lite preview fast
Cost-efficient model in the Gemini 3.1 family. 1M context, lowest pricing tier.
Gemini 3 Flash preview balanced
Third-generation Flash model. 1M context, fast and affordable.
Gemini 2.5 Flash-Lite active fast
Smallest 2.5 model. 1M context, best budget option in the Gemini lineup.
Gemini 2.5 Pro active frontier
Previous-gen flagship with thinking budgets. 1M context. Price doubles for context >200K.
Gemma 3 12B active open-weight
Mid-size open-weight Gemma. 12B parameters, 128K context, vision.
// external_links