Gemma 3 4B

by Google

Small open-weight Gemma. 4B parameters, 128K context.

Context window 128K 128K tokens

Max output 8K 8K tokens

Pricing /1M tok $0.02 in / $0.02 out per 1M tokens (OpenRouter)

Released Mar 12, 2025

// specifications

Specs & capabilities

// data_confidence

⚠️ Pricing shown via OpenRouter, not first-party

// notes

Open-weight, 4B params

// more_from_google

Google's most capable model. 1M context, multimodal. Price doubles for context >200K.

Cost-efficient model in the Gemini 3.1 family. 1M context, lowest pricing tier.

Third-generation Flash model. 1M context, fast and affordable.

Smallest 2.5 model. 1M context, best budget option in the Gemini lineup.

Second-generation Flash. 1M context, excellent speed/cost ratio. Supports thinking budgets.

Previous-gen flagship with thinking budgets. 1M context. Price doubles for context >200K.

// external_links