// model_detail
active fast
GLM-4 Flash
by ZAI
Free/fast GLM model. 128K context. Minimal cost for basic tasks.
Context window 128K 128K tokens
Max output 4K 4K tokens
Pricing /1M tok $0.01 in / $0.01 out per 1M tokens (API)
Released Aug 1, 2024
// specifications
Specs & capabilities
API details
- API model ID
glm-4-flash- Internal ID
glm-4-flash- Status
- active
- Type
- fast
Capabilities
- Vision
- No
- Function calling
- 🔧 Yes
- Knowledge cutoff
- 📅 2025-01
Pricing
- Input
- $0.01 / 1M tokens
- Output
- $0.01 / 1M tokens
- Source
- First-party API pricing
// notes
Additional notes
Free tier GLM model
// more_from_zai
More ZAI models
GLM-5.1 active frontier
Zhipu's latest model. Optimized for coding and agent tasks. Available via z.ai and OpenRouter.
GLM-5 Turbo active fast
Fast variant of GLM-5. Lower cost, higher speed.
GLM-5 active frontier
Zhipu's GLM-5 model. Strong reasoning and coding. Available via z.ai API.
GLM-4V legacy multimodal
Vision-capable GLM-4 variant. 128K context, image understanding.
GLM-4 legacy balanced
Previous-generation GLM flagship. 128K context. Superseded by GLM-5.
// external_links