Models
Comprehensive list of all supported models and their providers
Use Case
Capabilities
Provider
Input Price ($/M tokens)
Output Price ($/M tokens)
Context Size (tokens)
91/227
Models
24/34
Providers
58
Vision Models (filtered)
84
Tool-enabled (filtered)
2
Free Models (filtered)
Kimi K2.6
moonshot30% off
kimi-k2.6Streaming
Vision
Tools
Reasoning
JSON Output
CanopyWave
Context: 262.1k30% off
Input
$0.50$0.35
-30% off/M tokens
Cached
$0.10$0.07
-30% off/M tokens
Output
$2.80$1.96
-30% off/M tokens
Claude Opus 4.7
anthropic30% off
claude-opus-4-7Streaming
Vision
Tools
Reasoning
Structured JSON Output
Native Web Search
Anthropic
Context: 1M
Input
$5.00
/M tokens
Cached
$0.50
/M tokens
Output
$25.00
/M tokens
+ $0.010 per search
GLM-5.1
glm10% off
glm-5.1Streaming
Tools
Reasoning
JSON Output
Native Web Search
NovitaAI
Context: 204.8k
Input
$1.40
/M tokens
Cached
$0.26
/M tokens
Output
$4.40
/M tokens
MiniMax M2.5 Highspeed
minimax
minimax-m2.5-highspeedStreaming
Reasoning
MiniMax
Context: 204.8k
Input
$0.60
/M tokens
Cached
$0.03
/M tokens
Output
$2.40
/M tokens
MiniMax M2.7 Highspeed
minimax
minimax-m2.7-highspeedStreaming
Reasoning
MiniMax
Context: 204.8k
Input
$0.60
/M tokens
Cached
$0.06
/M tokens
Output
$2.40
/M tokens
MiniMax M2.7
minimax
minimax-m2.7Streaming
Reasoning
Tools
JSON Output
Structured JSON Output
MiniMax
Context: 204.8k
Input
$0.30
/M tokens
Cached
$0.06
/M tokens
Output
$1.20
/M tokens
Gemini Pro Latest
google
gemini-pro-latestStreaming
Vision
Tools
Reasoning
Reasoning Budget
JSON Output
Structured JSON Output
Native Web Search
Google AI Studio
Context: 1.0M
Input
$2.00
/M tokens
Cached
$0.20
/M tokens
Output
$12.00
/M tokens
+ $0.014 per search
GPT-5.4 Nano
openai30% off
gpt-5.4-nanoStreaming
Vision
Tools
Reasoning
JSON Output
Structured JSON Output
Native Web Search
Azure
Context: 400k30% off
Input
$0.20$0.14
-30% off/M tokens
Cached
$0.02$0.01
-30% off/M tokens
Output
$1.25$0.88
-30% off/M tokens
+ $0.010 per search
GPT-5.4 Mini
openai30% off
gpt-5.4-miniStreaming
Vision
Tools
Reasoning
JSON Output
Structured JSON Output
Native Web Search
Azure
Context: 400k30% off
Input
$0.75$0.52
-30% off/M tokens
Cached
$0.07$0.05
-30% off/M tokens
Output
$4.50$3.15
-30% off/M tokens
+ $0.010 per search
Grok 4.20 Beta Reasoning (0309)
xai
grok-4-20-beta-0309-reasoningStreaming
Vision
Tools
Reasoning
JSON Output
xAI
Context: 2M
Input
$2.00
/M tokens
Cached
$0.20
/M tokens
Output
$6.00
/M tokens
Grok 4.20 Multi-Agent Beta (0309)
xaiModel Deactivated
grok-4-20-multi-agent-beta-0309Streaming
Vision
Tools
Reasoning
JSON Output
xAI
Context: 2M
Deactivated since Mar 27, 2026
Input
$2.00
/M tokens
Cached
$0.20
/M tokens
Output
$6.00
/M tokens
GPT-5.3 Chat
openai
gpt-5.3-chat-latestStreaming
Vision
Tools
Reasoning
Native Web Search
Azure
Context: 128k
Input
$1.75
/M tokens
Cached
$0.17
/M tokens
Output
$14.00
/M tokens
GPT-5.3 Codex
openai
gpt-5.3-codexStreaming
Vision
Tools
Reasoning
JSON Output
Native Web Search
Azure
Context: 400k
Input
$1.75
/M tokens
Cached
$0.17
/M tokens
Output
$14.00
/M tokens
GPT-5.2 Codex
openai
gpt-5.2-codexStreaming
Vision
Tools
Reasoning
JSON Output
Native Web Search
Azure
Context: 400k
Input
$1.75
/M tokens
Cached
$0.17
/M tokens
Output
$14.00
/M tokens
o4 Mini
openai
o4-miniStreaming
Vision
Tools
Reasoning
JSON Output
Structured JSON Output
Azure
Context: 200k
Input
$1.10
/M tokens
Cached
$0.28
/M tokens
Output
$4.40
/M tokens
Grok 4.1 Fast
xai
grok-4-1-fastStreaming
Vision
Tools
Reasoning
JSON Output
xAI
Context: 2M
Input
$0.20
/M tokens
Cached
$0.05
/M tokens
Output
$0.50
/M tokens
Grok 4 Fast
xai
grok-4-fastStreaming
Vision
Tools
Reasoning
JSON Output
xAI
Context: 2M
Input
$0.20
/M tokens
Cached
$0.05
/M tokens
Output
$0.50
/M tokens
GPT-5.4 Pro
openai30% off
gpt-5.4-proStreaming
Vision
Tools
Reasoning
JSON Output
Native Web Search
Azure
Context: 1.1M
Input
$30.00
/M tokens
Cached
—
/M tokens
Output
$180.00
/M tokens
+ $0.010 per search
GPT-5.4
openai30% off
gpt-5.4Streaming
Vision
Tools
Reasoning
JSON Output
Structured JSON Output
Native Web Search
Azure
Context: 1.1M30% off
Input
$2.50$1.75
-30% off/M tokens
Cached
$0.25$0.17
-30% off/M tokens
Output
$15.00$10.50
-30% off/M tokens
+ $0.010 per search
Qwen3.5 397B A17B
alibaba20% off
qwen35-397b-a17bStreaming
Vision
Tools
Reasoning
JSON Output
Native Web Search
Alibaba Cloud
Context: 262.1k20% off
Input
$0.17$0.14
-20% off/M tokens
Cached
—
/M tokens
Output
$1.03$0.83
-20% off/M tokens
+ $0.010 per search
Gemini 3.1 Pro (Preview)
google20% off
gemini-3.1-pro-previewStreaming
Vision
Tools
Reasoning
Reasoning Budget
JSON Output
Structured JSON Output
Native Web Search
Google AI Studio
Context: 1.0M
Input
$2.00
/M tokens
Cached
$0.20
/M tokens
Output
$12.00
/M tokens
+ $0.014 per search
Claude Sonnet 4.6
anthropic30% off
claude-sonnet-4-6Streaming
Vision
Tools
Reasoning
Reasoning Budget
Structured JSON Output
Native Web Search
Anthropic
Context: 200k
Input
$3.00
/M tokens
Cached
$0.30
/M tokens
Output
$15.00
/M tokens
+ $0.010 per search
GLM-5
glm30% off
glm-5Streaming
Tools
Reasoning
JSON Output
Structured JSON Output
Native Web Search
Alibaba Cloud
Context: 202.8k
Input
$0.57
/M tokens
Cached
—
/M tokens
Output
$2.58
/M tokens
MiniMax M2.5
minimax30% off
minimax-m2.5Streaming
Reasoning
Tools
JSON Output
Structured JSON Output
CanopyWave
Context: 204.8k30% off
Input
$0.27$0.19
-30% off/M tokens
Cached
$0.03$0.02
-30% off/M tokens
Output
$1.08$0.76
-30% off/M tokens
Claude Opus 4.6
anthropic30% off
claude-opus-4-6Streaming
Vision
Tools
Reasoning
Reasoning Budget
Structured JSON Output
Native Web Search
Anthropic
Context: 1M
Input
$5.00
/M tokens
Cached
$0.50
/M tokens
Output
$25.00
/M tokens
+ $0.010 per search
Qwen3 VL 30B A3B Thinking
alibaba
qwen3-vl-30b-a3b-thinkingStreaming
Vision
Tools
Reasoning
JSON Output
NovitaAI
Context: 131.1k
Input
$0.20
/M tokens
Cached
—
/M tokens
Output
$1.00
/M tokens
Kimi K2.5
moonshot30% off
kimi-k2.5Streaming
Vision
Tools
Reasoning
JSON Output
Alibaba Cloud
Context: 262.1k
Input
$0.57
/M tokens
Cached
—
/M tokens
Output
$3.01
/M tokens
Qwen3 Max 2026-01-23
alibaba20% off
qwen3-max-2026-01-23Streaming
Vision
Tools
Reasoning
JSON Output
Alibaba Cloud
Context: 262.1k20% off
Input
$0.36$0.29
-20% off/M tokens
Cached
$0.07$0.06
-20% off/M tokens
Output
$1.43$1.15
-20% off/M tokens
Qwen3 VL 235B A22B Thinking
alibaba20% off
qwen3-vl-235b-a22b-thinkingStreaming
Vision
Reasoning
Alibaba Cloud
Context: 131.1k20% off
Input
$0.50$0.40
-20% off/M tokens
Cached
—
/M tokens
Output
$2.00$1.60
-20% off/M tokens
QwQ Plus
alibaba20% off
qwq-plusStreaming
Reasoning
Alibaba Cloud
Context: 131.1k20% off
Input
$0.23$0.18
-20% off/M tokens
Cached
—
/M tokens
Output
$0.57$0.46
-20% off/M tokens
MiniMax Text 01
minimax
minimax-text-01Streaming
Reasoning
MiniMax
Context: 1M
Input
$0.20
/M tokens
Cached
—
/M tokens
Output
$1.10
/M tokens
MiniMax M2.1 Lightning
minimax
minimax-m2.1-lightningStreaming
Reasoning
MiniMax
Context: 196.6k
Input
$0.12
/M tokens
Cached
—
/M tokens
Output
$0.48
/M tokens
GLM-4.7 Flash
glm
glm-4.7-flashStreaming
Tools
JSON Output
Reasoning
EmberCloud
Context: 200k
Input
$0.06
/M tokens
Cached
$0.01
/M tokens
Output
$0.40
/M tokens
GLM-4.7 FlashX
glm10% off
glm-4.7-flashxStreaming
Tools
Reasoning
JSON Output
Z AI
Context: 200k10% off
Input
$0.07$0.06
-10% off/M tokens
Cached
$0.01$0.01
-10% off/M tokens
Output
$0.40$0.36
-10% off/M tokens
Seed 1.8 (251228)
bytedance
seed-1-8-251228Streaming
Vision
Tools
Reasoning
JSON Output
ByteDance
Context: 256k
Input
$0.25
/M tokens
Cached
$0.05
/M tokens
Output
$2.00
/M tokens
Seed 1.6 Flash (250715)
bytedance
seed-1-6-flash-250715Streaming
Vision
Tools
Reasoning
JSON Output
ByteDance
Context: 256k
Input
$0.07
/M tokens
Cached
$0.02
/M tokens
Output
$0.30
/M tokens
Seed 1.6 (250915)
bytedance
seed-1-6-250915Streaming
Vision
Tools
Reasoning
JSON Output
ByteDance
Context: 256k
Input
$0.25
/M tokens
Cached
$0.05
/M tokens
Output
$2.00
/M tokens
Seed 1.6 (250615)
bytedance
seed-1-6-250615Streaming
Vision
Tools
Reasoning
JSON Output
ByteDance
Context: 256k
Input
$0.25
/M tokens
Cached
$0.05
/M tokens
Output
$2.00
/M tokens
GLM-4.6V FlashX
glm10% off
glm-4.6v-flashxStreaming
Vision
Tools
Reasoning
JSON Output
Z AI
Context: 128k10% off
Input
$0.04$0.04
-10% off/M tokens
Cached
$0.00$0.00
-10% off/M tokens
Output
$0.40$0.36
-10% off/M tokens
MiniMax M2.1
minimax30% off
minimax-m2.1Streaming
Tools
Reasoning
JSON Output
CanopyWave
Context: 204.8k30% off
Deactivated since Mar 31, 2026
Input
$0.27$0.19
-30% off/M tokens
Cached
$0.07$0.05
-30% off/M tokens
Output
$1.08$0.76
-30% off/M tokens
GLM-4.7
glm30% off
glm-4.7Streaming
Tools
Reasoning
JSON Output
Native Web Search
Alibaba Cloud
Context: 202.8k
Input
$0.43
/M tokens
Cached
—
/M tokens
Output
$2.01
/M tokens
Gemini 3 Flash (Preview)
google
gemini-3-flash-previewStreaming
Vision
Tools
Reasoning
Reasoning Budget
JSON Output
Structured JSON Output
Native Web Search
Google AI Studio
Context: 1.0M
Input
$0.50
/M tokens
Cached
$0.05
/M tokens
Output
$3.00
/M tokens
+ $0.014 per search
GPT-5.2 Chat
openai
gpt-5.2-chat-latestStreaming
Vision
Tools
Reasoning
Native Web Search
Azure
Context: 128k
Input
$1.75
/M tokens
Cached
$0.17
/M tokens
Output
$14.00
/M tokens
GPT-5.2 Pro
openai
gpt-5.2-proStreaming
Vision
Tools
Reasoning
JSON Output
Native Web Search
Azure
Context: 400k
Input
$21.00
/M tokens
Cached
—
/M tokens
Output
$168.00
/M tokens
GPT-5.2
openai30% off
gpt-5.2Streaming
Vision
Tools
Reasoning
JSON Output
Structured JSON Output
Native Web Search
Azure
Context: 400k30% off
Input
$1.75$1.22
-30% off/M tokens
Cached
$0.17$0.12
-30% off/M tokens
Output
$14.00$9.80
-30% off/M tokens
Claude Sonnet 4.5 (2025-09-29)
anthropic30% off
claude-sonnet-4-5-20250929Streaming
Tools
Reasoning
Reasoning Budget
Structured JSON Output
Native Web Search
Anthropic
Context: 200k
Input
$3.00
/M tokens
Cached
$0.30
/M tokens
Output
$15.00
/M tokens
+ $0.010 per search
GLM-4.6V Flash
glm
glm-4.6v-flashStreaming
Vision
Tools
Reasoning
JSON Output
Z AI
Context: 128k
Input
$0.00
/M tokens
Cached
$0.00
/M tokens
Output
$0.00
/M tokens
GLM-4.6V
glm10% off
glm-4.6vStreaming
Vision
Tools
Reasoning
JSON Output
NovitaAI
Context: 131.1k
Input
$0.30
/M tokens
Cached
$0.06
/M tokens
Output
$0.90
/M tokens
DeepSeek V3.2
deepseek30% off
deepseek-v3.2Streaming
Tools
JSON Output
Reasoning
Alibaba Cloud
Context: 131.1k20% off
Input
$0.29$0.23
-20% off/M tokens
Cached
$0.06$0.05
-20% off/M tokens
Output
$0.43$0.34
-20% off/M tokens
Kimi K2 Thinking Turbo
moonshot
kimi-k2-thinking-turboStreaming
Tools
Reasoning
JSON Output
Moonshot AI
Context: 262.1k
Input
$1.15
/M tokens
Cached
$0.15
/M tokens
Output
$8.00
/M tokens