changelogs.info
OpenClaw Claude Code Codex Gemini Kilo Code Hermes Models Dispatches

AI Model Directory

99 models across 11 providers · 57 live or preview · timeline →

99models
11providers
57live
99 shown
Provider
Type
Status

Anthropic

All Anthropic models →
frontier active

Claude Opus 4.6

claude-opus-4-6
Context 1M Output 128K Pricing /1M $$5.00 / $$25.00

Anthropic's flagship model for complex agents and coding. Extended + adaptive thinking, 1M context, 128K output.

Released Mar 24, 2026 Vision Functions
balanced active

Claude Sonnet 4.6

claude-sonnet-4-6
Context 1M Output 64K Pricing /1M $$3.00 / $$15.00

Fast balanced model with extended + adaptive thinking. 1M context, 64K output.

Released Mar 24, 2026 Vision Functions
fast active

Claude Haiku 4.5

claude-haiku-4-5-20251001
Context 200K Output 64K Pricing /1M $$1.00 / $$5.00

Fastest Claude model with near-frontier intelligence. Extended thinking, 200K context, 64K output.

Released Oct 1, 2025 Vision Functions
frontier legacy

Claude Opus 4.5

claude-opus-4-5-20251101
Context 200K Output 64K Pricing /1M $$5.00 / $$25.00

Previous-generation flagship. Extended thinking, 200K context.

Released Nov 1, 2025 Vision Functions
frontier legacy

Claude Opus 4.1

claude-opus-4-1-20250805
Context 200K Output 32K Pricing /1M $$15.00 / $$75.00

Premium reasoning model with extended thinking. 200K context, 32K output.

Released Aug 5, 2025 Vision Functions
frontier legacy

Claude Opus 4

claude-opus-4-20250514
Context 200K Output 32K Pricing /1M $$15.00 / $$75.00

First-generation Claude 4 flagship. Extended thinking, 200K context, 32K output.

Released May 14, 2025 Vision Functions
frontier legacy

Claude 3 Opus

claude-3-opus-20240229
Context 200K Output 4K Pricing /1M $$15.00 / $$75.00

Original Claude 3 flagship model. 200K context. Still callable but superseded.

Released Feb 29, 2024 Vision Functions
balanced legacy

Claude Sonnet 4.5

claude-sonnet-4-5-20250929
Context 200K Output 64K Pricing /1M $$3.00 / $$15.00

Previous-generation balanced model with extended thinking. 200K context.

Released Sep 29, 2025 Vision Functions
balanced legacy

Claude Sonnet 4

claude-sonnet-4-20250514
Context 200K Output 64K Pricing /1M $$3.00 / $$15.00

First-generation Claude 4 balanced model. Extended thinking, 200K context.

Released May 14, 2025 Vision Functions
balanced legacy

Claude 3.5 Sonnet

claude-3-5-sonnet-20241022
Context 200K Output 8K Pricing /1M $$3.00 / $$15.00

Claude 3.5 generation balanced model. 200K context, 8K output. Still callable.

Released Oct 22, 2024 Vision Functions
balanced legacy

Claude 3 Sonnet

claude-3-sonnet-20240229
Context 200K Output 4K Pricing /1M $$3.00 / $$15.00

Original Claude 3 balanced model. 200K context.

Released Feb 29, 2024 Vision Functions
fast deprecated

Claude 3 Haiku

claude-3-haiku-20240307
Context 200K Output 4K Pricing /1M $$0.25 / $$1.25

Deprecated fast model. Will be retired April 19, 2026. Migrate to Claude Haiku 4.5.

Released Mar 7, 2024 Vision Functions

DeepSeek

All DeepSeek models →
reasoning active

DeepSeek V3.2 Thinking

deepseek-reasoner
Context 128K Output 64K Pricing /1M $$0.28 / $$0.42

DeepSeek V3.2 thinking mode via deepseek-reasoner endpoint. Up to 64K output with reasoning chains.

Released Mar 1, 2026 Functions
balanced active

DeepSeek V3.2

deepseek-chat
Context 128K Output 8K Pricing /1M $$0.28 / $$0.42

DeepSeek's latest model via deepseek-chat endpoint. V3.2 non-thinking mode. 128K context.

Released Mar 1, 2026 Functions
reasoning legacy

DeepSeek R1

deepseek-reasoner
Context 128K Output 32K Pricing /1M $$0.55 / $$2.19

Open-weight reasoning model. Chain-of-thought with distilled variants. Superseded by V3.2 thinking.

Released Jan 20, 2025 Functions
open-weight legacy

DeepSeek V3

deepseek-chat
Context 128K Output 8K Pricing /1M $$0.27 / $$1.10

Open-weight MoE model. 671B total, 37B active params. Superseded by V3.2 on API.

Released Dec 26, 2024 Functions
open-weight legacy

DeepSeek V2

deepseek-v2
Context 128K Output 8K Pricing /1M $$0.14 / $$0.28

Previous-generation MoE model. 236B total params. Superseded by V3.

Released May 1, 2024 Functions

Google

All Google models →
frontier active

Gemini 2.5 Pro

gemini-2.5-pro
Context 1M Output 64K Pricing /1M $$1.25 / $$10.00

Previous-gen flagship with thinking budgets. 1M context. Price doubles for context >200K.

Released Mar 25, 2025 Vision Functions
balanced active

Gemini 2.5 Flash

gemini-2.5-flash
Context 1M Output 64K Pricing /1M $$0.30 / $$2.50

Second-generation Flash. 1M context, excellent speed/cost ratio. Supports thinking budgets.

Released May 1, 2025 Vision Functions
fast active

Gemini 2.5 Flash-Lite

gemini-2.5-flash-lite
Context 1M Output 64K Pricing /1M $$0.10 / $$0.40

Smallest 2.5 model. 1M context, best budget option in the Gemini lineup.

Released Sep 1, 2025 Vision Functions
open-weight active

Gemma 3 12B

google/gemma-3-12b-it
Context 128K Output 8K Pricing /1M $$0.04 / $$0.04

Mid-size open-weight Gemma. 12B parameters, 128K context, vision.

Released Mar 12, 2025 Vision
open-weight active

Gemma 3 27B

google/gemma-3-27b-it
Context 128K Output 8K Pricing /1M $$0.08 / $$0.08

Google's open-weight model. 27B parameters, 128K context, vision support.

Released Mar 12, 2025 Vision
open-weight active

Gemma 3 4B

google/gemma-3-4b-it
Context 128K Output 8K Pricing /1M $$0.02 / $$0.02

Small open-weight Gemma. 4B parameters, 128K context.

Released Mar 12, 2025 Vision
open-weight active

Gemma 3 1B

google/gemma-3-1b-it
Context 32K Output 8K Pricing /1M $$0.01 / $$0.01

Smallest Gemma 3 model. 1B parameters, 32K context. Text only.

Released Mar 12, 2025
frontier preview

Gemini 3.1 Pro

gemini-3.1-pro-preview
Context 1M Output 64K Pricing /1M $$2.00 / $$12.00

Google's most capable model. 1M context, multimodal. Price doubles for context >200K.

Released Mar 9, 2026 Vision Functions
balanced preview

Gemini 3 Flash

gemini-3-flash-preview
Context 1M Output 64K Pricing /1M $$0.50 / $$3.00

Third-generation Flash model. 1M context, fast and affordable.

Released Dec 1, 2025 Vision Functions
fast preview

Gemini 3.1 Flash-Lite

gemini-3.1-flash-lite-preview
Context 1M Output 64K Pricing /1M $$0.25 / $$1.50

Cost-efficient model in the Gemini 3.1 family. 1M context, lowest pricing tier.

Released Mar 9, 2026 Vision Functions
frontier legacy

Gemini 1.5 Pro

gemini-1.5-pro
Context 2M Output 8K Pricing /1M $$1.25 / $$5.00

First model with 2M context. Tiered pricing (doubles for >128K). Still available.

Released Feb 15, 2024 Vision Functions
balanced legacy

Gemini 1.5 Flash

gemini-1.5-flash
Context 1M Output 8K Pricing /1M $$0.07 / $$0.30

First-gen Flash model. 1M context. Superseded by 2.5 Flash.

Released May 14, 2024 Vision Functions
balanced deprecated

Gemini 2.0 Flash

gemini-2.0-flash
Context 1M Output 8K Pricing /1M $$0.10 / $$0.40

Deprecated. Will be shut down June 1, 2026. Migrate to Gemini 2.5 Flash.

Released Dec 11, 2024 Vision Functions

Meta

All Meta models →
open-weight active

Llama 4 Scout

meta-llama/llama-4-scout
Context 10M Output 64K Pricing /1M $$0.10 / $$0.40

Smaller MoE Llama 4 model. 109B active params, massive 10M context window.

Released Apr 5, 2025 Vision Functions
open-weight active

Llama 4 Maverick

meta-llama/llama-4-maverick
Context 1M Output 64K Pricing /1M $$0.15 / $$0.60

Meta's MoE model with 400B active parameters. 1M context, vision support. Strong performance at low cost.

Released Apr 5, 2025 Vision Functions
open-weight active

Llama 3.3 70B

meta-llama/llama-3.3-70b-instruct
Context 128K Output 8K Pricing /1M $$0.12 / $$0.30

Strong 70B dense model. 128K context. Best Llama 3.x text-only model.

Released Dec 6, 2024 Functions
open-weight active

Llama 3.2 11B Vision

meta-llama/llama-3.2-11b-vision-instruct
Context 128K Output 8K Pricing /1M $$0.05 / $$0.10

11B multimodal model. 128K context. Good efficiency for vision tasks.

Released Sep 25, 2024 Vision Functions
open-weight active

Llama 3.2 90B Vision

meta-llama/llama-3.2-90b-vision-instruct
Context 128K Output 8K Pricing /1M $$0.18 / $$0.45

90B multimodal model. 128K context. Best Llama 3.x vision model.

Released Sep 25, 2024 Vision Functions
open-weight legacy

Llama 3.1 405B

meta-llama/llama-3.1-405b-instruct
Context 128K Output 8K Pricing /1M $$0.55 / $$0.80

405B dense model. Was the largest open-weight model at release. Superseded by Llama 4.

Released Jul 23, 2024 Functions
open-weight legacy

Llama 3.1 70B

meta-llama/llama-3.1-70b-instruct
Context 128K Output 8K Pricing /1M $$0.12 / $$0.30

70B dense model. Standard workhorse before Llama 3.3.

Released Jul 23, 2024 Functions
open-weight legacy

Llama 3.1 8B

meta-llama/llama-3.1-8b-instruct
Context 128K Output 8K Pricing /1M $$0.02 / $$0.05

8B dense model. Smallest in the Llama 3.1 family. Good for local inference.

Released Jul 23, 2024 Functions
open-weight legacy

Llama 3 70B

meta-llama/llama-3-70b-instruct
Context 8K Output 4K Pricing /1M $$0.12 / $$0.30

Original Llama 3 70B. Only 8K context. Superseded by 3.1 with 128K.

Released Apr 18, 2024 Functions
open-weight legacy

Llama 3 8B

meta-llama/llama-3-8b-instruct
Context 8K Output 4K Pricing /1M $$0.02 / $$0.05

Original Llama 3 8B. Only 8K context. Superseded by 3.1 with 128K.

Released Apr 18, 2024 Functions

Minimax

All Minimax models →
balanced active

MiniMax M2.7

minimax-m2.7
Context 205K Output 65K Pricing /1M $$0.30 / $$1.20

Next-gen LLM designed for autonomous real-world productivity. 205K context, large output window.

Released Mar 18, 2026 Functions
balanced legacy

MiniMax-01

minimax-01
Context 1M Output 8K Pricing /1M $$0.10 / $$0.40

Previous-generation MiniMax model. 456B MoE with 1M context. Superseded by M2.7.

Released Jan 15, 2025 Functions

Mistral

All Mistral models →
coding active

Codestral 25.01

codestral-2501
Context 256K Output 8K Pricing /1M $$0.30 / $$0.90

Mistral's coding specialist. 256K context, strong code generation.

Released Jan 1, 2025 Functions
coding active

Devstral Small 2

labs-devstral-small-2512
Context 128K Output 8K Pricing /1M $$0.15 / $$0.45

Mistral's latest coding-focused model. Agentic coding capabilities.

Released Dec 1, 2025 Functions
coding active

Devstral Medium 1.0

devstral-medium-2507
Context 128K Output 8K Pricing /1M $$0.80 / $$2.40

Medium-size coding model. Balanced capability and cost.

Released Jul 1, 2025 Functions
frontier active

Mistral Large 2.1

mistral-large-2411
Context 128K Output 8K Pricing /1M $$2.00 / $$6.00

Mistral's largest non-reasoning model. 123B parameters, 128K context.

Released Nov 1, 2024 Functions
reasoning active

Magistral Medium 1.1

magistral-medium-2507
Context 128K Output 32K Pricing /1M $$2.00 / $$6.00

Mistral's flagship reasoning model with chain-of-thought capabilities. 128K context.

Released Jul 10, 2025 Functions
reasoning active

Magistral Small 1.1

magistral-small-2507
Context 128K Output 32K Pricing /1M $$0.50 / $$1.50

Smaller reasoning model. Open-source. 128K context.

Released Jul 10, 2025 Functions
multimodal active

Pixtral Large

pixtral-large-2411
Context 128K Output 8K Pricing /1M $$2.00 / $$6.00

Vision-capable variant of Mistral Large. 128K context, image understanding.

Released Nov 1, 2024 Vision Functions
open-weight active

Mistral Small 3.1

mistral-small-2503
Context 128K Output 8K Pricing /1M $$0.10 / $$0.30

Open-weight small model. 24B parameters, 128K context, vision support.

Released Mar 1, 2025 Vision Functions
open-weight active

Pixtral 12B

pixtral-12b-2409
Context 128K Output 8K Pricing /1M $$0.10 / $$0.30

Open-weight 12B multimodal model. 128K context, image understanding.

Released Sep 1, 2024 Vision Functions
open-weight legacy

Mistral Small 3.0

mistral-small-2501
Context 128K Output 8K Pricing /1M $$0.10 / $$0.30

First Mistral Small 3 model. 24B params, text-only. Superseded by 3.1 with vision.

Released Jan 1, 2025 Functions
open-weight legacy

Mixtral 8x7B

open-mixtral-8x7b
Context 32K Output 8K Pricing /1M $$0.10 / $$0.30

Pioneering open-weight MoE model. 8 experts, 7B each. 32K context.

Released Dec 11, 2023 Functions
open-weight legacy

Mistral 7B

open-mistral-7b
Context 32K Output 8K Pricing /1M $$0.02 / $$0.06

Mistral's first open-weight model. 7B params, 32K context. Still used for local inference.

Released Sep 27, 2023

Moonshot AI

All Moonshot AI models →
reasoning active

Kimi K2 Thinking

kimi-k2-thinking
Context 256K Output 32K Pricing /1M $$0.60 / $$3.00

Thinking model based on K2. General agentic and reasoning capabilities, deep reasoning tasks.

Released Jul 11, 2025 Functions
multimodal active

Kimi K2.5

kimi-k2.5
Context 256K Output 32K Pricing /1M $$0.60 / $$3.00

Kimi's most versatile model. Native multimodal architecture, vision + text, thinking and non-thinking modes. 256K context.

Released Mar 1, 2026 Vision Functions
balanced active

Kimi K2

kimi-k2
Context 256K Output 16K Pricing /1M $$0.60 / $$2.50

MoE model with 1T total / 32B active params. Exceptional coding and agent capabilities. 256K context.

Released Jul 11, 2025 Functions
balanced legacy

Moonshot v1 128K

moonshot-v1-128k
Context 128K Output 8K Pricing /1M $$0.80 / $$3.20

Original Moonshot model. 128K context. Superseded by K2 series.

Released Mar 1, 2024 Functions

OpenAI

All OpenAI models →
coding active

GPT-5.3 Codex

gpt-5.3-codex
Context 400K Output 64K Pricing /1M $$1.75 / $$14.00

OpenAI's latest coding-focused model. 91.5% LiveCodeBench. 400K context.

Released Feb 1, 2026 Vision Functions
frontier active

GPT-5.4

gpt-5.4
Context 1.1M Output 128K Pricing /1M $$2.50 / $$15.00

OpenAI's most capable model for complex reasoning and coding. 1.1M context, multimodal.

Released Mar 5, 2026 Vision Functions
reasoning active

GPT-5.4 Pro

gpt-5.4-pro
Context 1.1M Output 128K Pricing /1M $$30.00 / $$180.00

Highest-tier reasoning model with deep thinking. 1.1M context.

Released Mar 5, 2026 Vision Functions
reasoning active

o3

o3
Context 200K Output 100K Pricing /1M $$2.00 / $$8.00

OpenAI's reasoning model. Excels at math, science, and complex multi-step problems. 85.3% GPQA.

Released Apr 16, 2025 Vision Functions
reasoning active

o4 Mini

o4-mini
Context 200K Output 100K Pricing /1M $$1.10 / $$4.40

Latest mini reasoning model. 83.2% GPQA, 85.9% coding benchmarks.

Released Apr 16, 2025 Vision Functions
balanced active

GPT-5.4 Mini

gpt-5.4-mini
Context 1.1M Output 64K Pricing /1M $$0.75 / $$4.50

Smaller 5.4 variant for coding, computer use, and subagents. 1.1M context.

Released Mar 5, 2026 Vision Functions
balanced active

GPT-5.3 Chat

gpt-5.3-chat-latest
Context 128K Output 32K Pricing /1M $$1.75 / $$14.00

Chat-optimized variant of the GPT-5.3 series.

Released Mar 1, 2026 Vision Functions
fast active

GPT-5.4 Nano

gpt-5.4-nano
Context 1.1M Output 64K Pricing /1M $$0.20 / $$1.25

Cheapest model in the GPT-5.4 family for high-volume simple tasks.

Released Mar 5, 2026 Vision Functions
frontier legacy

GPT-5.2

gpt-5.2
Context 400K Output 64K Pricing /1M $$0.88 / $$7.00

Previous-generation GPT-5 flagship. 400K context.

Released Dec 1, 2025 Vision Functions
frontier legacy

GPT-5

gpt-5
Context 400K Output 32K Pricing /1M $$1.25 / $$10.00

Original GPT-5 frontier model. 400K context.

Released Aug 1, 2025 Vision Functions
frontier legacy

GPT-4

gpt-4
Context 8K Output 4K Pricing /1M $$30.00 / $$60.00

Original GPT-4 model. 8K context. Still available but expensive for its capability.

Released Mar 14, 2023 Functions
reasoning legacy

o3 Mini

o3-mini
Context 200K Output 100K Pricing /1M $$1.10 / $$4.40

Smaller reasoning model. Good balance of reasoning capability and cost. 79.1% GPQA.

Released Jan 31, 2025 Functions
reasoning legacy

o1

o1
Context 200K Output 100K Pricing /1M $$15.00 / $$60.00

OpenAI's first reasoning model. Still available but superseded by o3.

Released Dec 5, 2024 Vision Functions
balanced legacy

GPT-4.1

gpt-4.1-2025-04-14
Context 1M Output 32K Pricing /1M $$2.00 / $$8.00

GPT-4.1 model with 1M context window. Good instruction following.

Released Apr 14, 2025 Vision Functions
balanced legacy

GPT-4.1 Mini

gpt-4.1-mini-2025-04-14
Context 1M Output 32K Pricing /1M $$0.40 / $$1.60

Smaller GPT-4.1 model with 1M context. Cost-efficient.

Released Apr 14, 2025 Vision Functions
balanced legacy

GPT-4o

gpt-4o-2024-08-06
Context 128K Output 16K Pricing /1M $$2.50 / $$10.00

OpenAI's former flagship. Multimodal (text + image). 128K context.

Released May 13, 2024 Vision Functions
balanced legacy

GPT-4 Turbo

gpt-4-turbo-2024-04-09
Context 128K Output 4K Pricing /1M $$5.00 / $$15.00

GPT-4 Turbo with 128K context and vision. Superseded by GPT-4o.

Released Apr 9, 2024 Vision Functions
fast legacy

GPT-4o Mini

gpt-4o-mini-2024-07-18
Context 128K Output 16K Pricing /1M $$0.15 / $$0.60

Smaller GPT-4o variant. Very cost-effective for production use.

Released Jul 18, 2024 Vision Functions

Qwen

All Qwen models →
coding active

Qwen2.5 Coder 32B

qwen/qwen2.5-coder-32b-instruct
Context 128K Output 8K Pricing /1M $$0.08 / $$0.22

Open-weight coding specialist. 32B params, 128K context.

Released Nov 1, 2024 Functions
frontier active

Qwen3 Max

qwen3-max
Context 262K Output 32K Pricing /1M $$0.35 / $$1.40

Alibaba's flagship Qwen model. 262K context. Supports thinking mode with chain-of-thought. Tiered pricing by context length.

Released Sep 23, 2025 Functions
reasoning active

QwQ 32B

qwen/qwq-32b
Context 128K Output 16K Pricing /1M $$0.08 / $$0.22

Open-weight reasoning model. 32B params, chain-of-thought. Budget reasoning option.

Released Mar 5, 2025 Functions
balanced active

Qwen3.5 Plus

qwen3.5-plus
Context 1M Output 32K Pricing /1M $$0.11 / $$0.67

Qwen3.5 series balanced model. Text/image/video input. 1M context, faster and cheaper than Qwen3-Max.

Released Mar 1, 2026 Vision Functions
fast active

Qwen3.5 Flash

qwen3.5-flash
Context 1M Output 32K Pricing /1M $$0.03 / $$0.28

Qwen3.5 speed model. 1M context, lowest cost in Qwen lineup.

Released Mar 1, 2026 Functions
open-weight active

Qwen3 235B-A22B

qwen/qwen3-235b-a22b
Context 128K Output 8K Pricing /1M $$0.15 / $$0.60

Open-weight MoE Qwen3. 235B total, 22B active params. 128K context.

Released Apr 28, 2025 Functions
open-weight active

Qwen3 32B

qwen/qwen3-32b
Context 128K Output 8K Pricing /1M $$0.08 / $$0.22

Open-weight dense Qwen3. 32B params, 128K context.

Released Apr 28, 2025 Functions
open-weight active

Qwen2.5 VL 72B

qwen/qwen2.5-vl-72b-instruct
Context 128K Output 8K Pricing /1M $$0.15 / $$0.40

Open-weight multimodal model. 72B params, 128K context, strong vision capabilities.

Released Jan 28, 2025 Vision Functions
open-weight legacy

Qwen2.5 72B

qwen/qwen2.5-72b-instruct
Context 128K Output 8K Pricing /1M $$0.15 / $$0.40

Previous-gen open-weight model. 72B dense. Superseded by Qwen3.

Released Sep 19, 2024 Functions

xAI

All xAI models →
coding active

Grok Code Fast 1

grok-code-fast-1
Context 256K Output 16K Pricing /1M $$0.20 / $$1.50

xAI's coding-focused model. 256K context, optimized for code generation.

Released Aug 1, 2025 Functions
frontier active

Grok 4.20

grok-4.20
Context 2M Output 128K Pricing /1M $$2.00 / $$6.00

xAI's newest flagship with industry-leading speed and agentic tool calling. 2M context, lowest hallucination rate.

Released Mar 9, 2026 Vision Functions
frontier active

Grok 4.20 Multi-Agent

grok-4.20-multi-agent
Context 2M Output 128K Pricing /1M $$2.00 / $$6.00

Variant of Grok 4.20 for collaborative agent-based workflows. Multiple agents operate in parallel.

Released Mar 9, 2026 Vision Functions
fast active

Grok 4.1 Fast

grok-4.1-fast
Context 2M Output 32K Pricing /1M $$0.20 / $$0.50

Fast non-reasoning variant. 2M context. Cost leader among frontier-class providers.

Released Nov 1, 2025 Vision Functions
frontier legacy

Grok 4

grok-4
Context 256K Output 32K Pricing /1M $$3.00 / $$15.00

Previous-generation Grok flagship. 256K context. Superseded by Grok 4.20.

Released Jul 1, 2025 Vision Functions
frontier legacy

Grok 3

grok-3
Context 131K Output 32K Pricing /1M $$3.00 / $$15.00

First stable Grok 3 release. 131K context. Superseded by Grok 4.

Released Apr 1, 2025 Vision Functions
fast legacy

Grok 4 Fast

grok-4-fast
Context 2M Output 32K Pricing /1M $$0.20 / $$0.50

Previous-generation fast Grok. 2M context. Superseded by Grok 4.1 Fast.

Released Sep 1, 2025 Vision Functions
fast legacy

Grok 3 Mini

grok-3-mini
Context 131K Output 32K Pricing /1M $$0.25 / $$0.50

Smaller Grok 3 variant with reasoning. 131K context. Cost-efficient.

Released Jun 1, 2025 Functions

ZAI

All ZAI models →
frontier active

GLM-5.1

glm-5.1
Context 128K Output 16K Pricing /1M $$0.80 / $$3.20

Zhipu's latest model. Optimized for coding and agent tasks. Available via z.ai and OpenRouter.

Released Feb 1, 2026 Vision Functions
frontier active

GLM-5

glm-5
Context 128K Output 16K Pricing /1M $$0.80 / $$3.20

Zhipu's GLM-5 model. Strong reasoning and coding. Available via z.ai API.

Released Nov 1, 2025 Vision Functions
fast active

GLM-5 Turbo

glm-5-turbo
Context 128K Output 8K Pricing /1M $$0.20 / $$0.80

Fast variant of GLM-5. Lower cost, higher speed.

Released Dec 1, 2025 Vision Functions
fast active

GLM-4 Flash

glm-4-flash
Context 128K Output 4K Pricing /1M $$0.01 / $$0.01

Free/fast GLM model. 128K context. Minimal cost for basic tasks.

Released Aug 1, 2024 Functions
multimodal legacy

GLM-4V

glm-4v
Context 128K Output 8K Pricing /1M $$0.80 / $$3.20

Vision-capable GLM-4 variant. 128K context, image understanding.

Released Jun 1, 2024 Vision Functions
balanced legacy

GLM-4

glm-4
Context 128K Output 8K Pricing /1M $$0.80 / $$3.20

Previous-generation GLM flagship. 128K context. Superseded by GLM-5.

Released Jun 1, 2024 Functions