20% Off All Alibaba Cloud Qwen Models on LLM Gateway
LLM Gateway partners with Alibaba Cloud to bring you 20% off 26 Qwen AI models — including Qwen3 Max, Qwen3 Coder, QwQ reasoning, vision-language, and image generation models.

We're excited to announce our partnership with Alibaba Cloud — one of the world's largest cloud providers and the company behind the Qwen family of AI models.
Starting today, every Alibaba Cloud model on LLM Gateway comes with a 20% discount applied automatically. No promo codes, no special configuration. Just lower prices on 26 Qwen models across chat, coding, reasoning, vision, and image generation.
Why Qwen Models
Alibaba Cloud's Qwen series has quickly become one of the most competitive model families in the AI space. With context windows up to 1 million tokens, built-in tool calling, vision capabilities, and dedicated coding models, Qwen offers a strong alternative to models from OpenAI, Anthropic, and Google — often at a fraction of the cost.
Combined with our 20% discount, Qwen models are now among the most affordable high-quality AI models available through any API gateway.
Discounted Models
All 26 Alibaba Cloud models below now include the 20% discount, applied automatically to every request.
Flagship Models
The core Qwen models for general-purpose AI tasks — from content generation to complex reasoning.
| Model | Context | Discounted Input | Discounted Output | |
|---|---|---|---|---|
| Qwen3 Max 2026-01-23 | 262K | $0.96/M | $4.80/M | Try it |
| Qwen3 Max | 256K | $2.40/M | $12.00/M | Try it |
| Qwen Max | 128K | $1.28/M | $5.12/M | Try it |
| Qwen Max Latest | 128K | $1.28/M | $5.12/M | Try it |
| Qwen Plus | 128K | $0.32/M | $0.96/M | Try it |
| Qwen Plus Latest | 1M | $0.32/M | $0.96/M | Try it |
Fast & Cost-Effective Models
High-speed models optimized for low-latency, high-volume workloads — ideal for real-time applications and batch processing.
| Model | Context | Discounted Input | Discounted Output | |
|---|---|---|---|---|
| Qwen Flash | 1M | $0.04/M | $0.32/M | Try it |
| Qwen Turbo | 1M | $0.04/M | $0.16/M | Try it |
| Qwen Omni Turbo | 32K | $0.16/M | $0.64/M | Try it |
Coding Models
Purpose-built for code generation, completion, and analysis. Qwen3 Coder Plus supports up to 1M tokens of context — enough for entire codebases.
| Model | Context | Discounted Input | Discounted Output | |
|---|---|---|---|---|
| Qwen3 Coder Plus | 1M | $4.80/M | $48.00/M | Try it |
| Qwen3 Coder Flash | 1M | $0.24/M | $1.20/M | Try it |
| Qwen Coder Plus | 128K | $0.80/M | $4.00/M | Try it |
Vision-Language Models
Multimodal models that understand both text and images — useful for document analysis, visual Q&A, and image-based reasoning.
| Model | Context | Discounted Input | Discounted Output | |
|---|---|---|---|---|
| Qwen3 VL 235B A22B Instruct | 128K | $0.40/M | $1.60/M | Try it |
| Qwen3 VL 235B A22B Thinking | 128K | $0.40/M | $1.60/M | Try it |
| Qwen3 VL Plus | 256K | $0.16/M | $1.28/M | Try it |
| Qwen3 VL Flash | 256K | $0.04/M | $0.32/M | Try it |
| Qwen VL Max | 128K | $0.64/M | $2.56/M | Try it |
| Qwen VL Plus | 128K | $0.17/M | $0.51/M | Try it |
| Qwen2.5 VL 32B Instruct | 128K | $1.12/M | $3.36/M | Try it |
Reasoning Models
Models with built-in chain-of-thought reasoning — designed for math, logic, and multi-step problem solving.
| Model | Context | Discounted Input | Discounted Output | |
|---|---|---|---|---|
| QwQ Plus | 128K | $0.64/M | $1.92/M | Try it |
| Qwen3 Next 80B A3B Thinking | 128K | $0.40/M | $4.80/M | Try it |
| Qwen3 Next 80B A3B Instruct | 128K | $0.40/M | $1.60/M | Try it |
Image Generation & Editing Models
Text-to-image and image editing models for visual content creation.
| Model | Price per Request | |
|---|---|---|
| Qwen Image | $0.028 | Try it |
| Qwen Image Plus | $0.024 | Try it |
| Qwen Image Edit Plus | $0.032 | Try it |
| Qwen Image Edit Max | $0.064 | Try it |
How It Works
The 20% discount is applied automatically to all Alibaba Cloud models on LLM Gateway. There's nothing to configure — just use any Qwen model through our OpenAI-compatible API and pay the discounted rate.
1curl https://api.llmgateway.io/v1/chat/completions \2 -H "Authorization: Bearer YOUR_API_KEY" \3 -H "Content-Type: application/json" \4 -d '{5 "model": "alibaba/qwen3-max-2026-01-23",6 "messages": [{"role": "user", "content": "Hello!"}]7 }'
1curl https://api.llmgateway.io/v1/chat/completions \2 -H "Authorization: Bearer YOUR_API_KEY" \3 -H "Content-Type: application/json" \4 -d '{5 "model": "alibaba/qwen3-max-2026-01-23",6 "messages": [{"role": "user", "content": "Hello!"}]7 }'
All Qwen models support our full feature set: smart routing, automatic fallback, request caching, cost tracking, and real-time analytics.
Get Started
- Sign up for a free LLM Gateway account
- Browse all Alibaba Cloud models to compare pricing and capabilities
- Try any model in the Playground before integrating
If you have questions, reach out on GitHub or Discord.
Browse all discounted models | Try Qwen3 Max in the Playground | Get started