Qwen3 VL Flash

Fast Qwen 3 vision-language model for quick image tasks.

qwen3-vl-flash
STABLE
262,144 context
Starting at $0.04/M (20% off) input tokens
Starting at $0.32/M (20% off) output tokens
Streaming
Vision
Tools
JSON Output

Select Provider

All Providers for Qwen3 VL Flash

LLM Gateway routes requests to the best providers that are able to handle your prompt size and parameters.

Alibaba Cloud

alibaba/qwen3-vl-flash
Context Size
262.1k
Stability
STABLE
Pricing
Input
$0.05$0.04
/M
Cached
$0.01$0.01
/M
Output
$0.40$0.32
/M
-20% off
Tiered Pricing
≤32K tokens$0.05 in / $0.01 cached / $0.40 out$0.04 in / $0.01 cached / $0.32 out
≤128K tokens$0.07 in / $0.01 cached / $0.60 out$0.06 in / $0.01 cached / $0.48 out
>128K tokens$0.12 in / $0.02 cached / $0.96 out$0.10 in / $0.02 cached / $0.77 out
Capabilities
Streaming
Vision
Tools
JSON Output
Try in Playground