// model_detail
active fast
Qwen3.5 Flash
by Qwen
Qwen3.5 speed model. 1M context, lowest cost in Qwen lineup.
Context window 1M 1M tokens
Max output 32K 32K tokens
Pricing /1M tok $0.03 in / $0.28 out per 1M tokens (API)
Released Mar 1, 2026
// specifications
Specs & capabilities
API details
- API model ID
qwen3.5-flash- Internal ID
qwen3-5-flash- Status
- active
- Type
- fast
Capabilities
- Vision
- No
- Function calling
- 🔧 Yes
- Knowledge cutoff
- 📅 2025-06
Pricing
- Input
- $0.03 / 1M tokens
- Output
- $0.28 / 1M tokens
- Source
- First-party API pricing
// notes
Additional notes
Cheapest Qwen, 1M context
// more_from_qwen
More Qwen models
Qwen3.5 Plus active balanced
Qwen3.5 series balanced model. Text/image/video input. 1M context, faster and cheaper than Qwen3-Max.
Qwen3 Max active frontier
Alibaba's flagship Qwen model. 262K context. Supports thinking mode with chain-of-thought. Tiered pricing by context length.
Qwen3 235B-A22B active open-weight
Open-weight MoE Qwen3. 235B total, 22B active params. 128K context.
Qwen3 32B active open-weight
Open-weight dense Qwen3. 32B params, 128K context.
QwQ 32B active reasoning
Open-weight reasoning model. 32B params, chain-of-thought. Budget reasoning option.
Qwen2.5 VL 72B active open-weight
Open-weight multimodal model. 72B params, 128K context, strong vision capabilities.
// external_links