GLM-4 Flash

by ZAI

Free/fast GLM model. 128K context. Minimal cost for basic tasks.

Context window 128K 128K tokens

Max output 4K 4K tokens

Pricing /1M tok $0.01 in / $0.01 out per 1M tokens (API)

Released Aug 1, 2024

// specifications

Specs & capabilities

// notes

Free tier GLM model

// more_from_zai

Zhipu's latest model. Optimized for coding and agent tasks. Available via z.ai and OpenRouter.

Fast variant of GLM-5. Lower cost, higher speed.

Zhipu's GLM-5 model. Strong reasoning and coding. Available via z.ai API.

Vision-capable GLM-4 variant. 128K context, image understanding.

Previous-generation GLM flagship. 128K context. Superseded by GLM-5.

// external_links