Qwen3 Coder Flash

Fast, cost-effective Qwen 3 model for code generation.

qwen3-coder-flash
STABLE
1,000,000 context
Starting at $0.24/M (20% off) input tokens
Starting at $1.20/M (20% off) output tokens
Streaming
Tools
JSON Output

Select Provider

All Providers for Qwen3 Coder Flash

LLM Gateway routes requests to the best providers that are able to handle your prompt size and parameters.

Alibaba Cloud

alibaba/qwen3-coder-flash
Context Size
1M
Stability
STABLE
Pricing
Input
$0.30$0.24
/M
Cached
$0.06$0.05
/M
Output
$1.50$1.20
/M
-20% off
Tiered Pricing
≤32K tokens$0.30 in / $0.06 cached / $1.50 out$0.24 in / $0.05 cached / $1.20 out
≤128K tokens$0.50 in / $0.10 cached / $2.50 out$0.40 in / $0.08 cached / $2.00 out
≤256K tokens$0.80 in / $0.16 cached / $4.00 out$0.64 in / $0.13 cached / $3.20 out
>256K tokens$1.60 in / $0.32 cached / $9.60 out$1.28 in / $0.26 cached / $7.68 out
Capabilities
Streaming
Tools
JSON Output
Try in Playground