Qwen3.5 Flash

by Qwen

Qwen3.5 speed model. 1M context, lowest cost in Qwen lineup.

Context window 1M 1M tokens

Max output 32K 32K tokens

Pricing /1M tok $0.03 in / $0.28 out per 1M tokens (API)

Released Mar 1, 2026

// specifications

Specs & capabilities

// notes

Cheapest Qwen, 1M context

// more_from_qwen

Qwen3.5 series balanced model. Text/image/video input. 1M context, faster and cheaper than Qwen3-Max.

Alibaba's flagship Qwen model. 262K context. Supports thinking mode with chain-of-thought. Tiered pricing by context length.

Open-weight MoE Qwen3. 235B total, 22B active params. 128K context.

Open-weight dense Qwen3. 32B params, 128K context.

Open-weight reasoning model. 32B params, chain-of-thought. Budget reasoning option.

Open-weight multimodal model. 72B params, 128K context, strong vision capabilities.

// external_links