Models
Comprehensive list of all supported models and their providers
Use Case
Capabilities
Provider
Input Price ($/M tokens)
Output Price ($/M tokens)
Context Size (tokens)
227/227
Models
30/34
Providers
107
Vision Models (filtered)
144
Tool-enabled (filtered)
3
Free Models (filtered)
Kimi K2.6
moonshot30% off
kimi-k2.6Streaming
Vision
Tools
Reasoning
JSON Output
CanopyWave
Context: 262.1k30% off
Input
$0.50$0.35
-30% off/M tokens
Cached
$0.10$0.07
-30% off/M tokens
Output
$2.80$1.96
-30% off/M tokens
Claude Opus 4.7
anthropic30% off
claude-opus-4-7Streaming
Vision
Tools
Reasoning
Structured JSON Output
Native Web Search
Anthropic
Context: 1M
Input
$5.00
/M tokens
Cached
$0.50
/M tokens
Output
$25.00
/M tokens
+ $0.010 per search
GLM-5.1
glm10% off
glm-5.1Streaming
Tools
Reasoning
JSON Output
Native Web Search
NovitaAI
Context: 204.8k
Input
$1.40
/M tokens
Cached
$0.26
/M tokens
Output
$4.40
/M tokens
Mimo V2 Flash
mimo
mimo-v2-flashStreaming
CanopyWave
Context: 256k
Input
$0.08
/M tokens
Cached
$0.04
/M tokens
Output
$0.24
/M tokens
Sora 2 Pro
openaiModel Deactivated
sora-2-proVideo Generation
Avalanche
Context: 32.8k
Deactivated since Mar 24, 2026
Per Second Pricing
720p$0.24/sec
hd$0.4/sec
Sora 2
openaiModel Deactivated
sora-2Video Generation
Avalanche
Context: 32.8k
Deactivated since Mar 24, 2026
Per Second Pricing
720p$0.08/sec
Qwen3 Coder Next
alibaba
qwen3-coder-nextStreaming
Tools
EmberCloud
Context: 262.1k
Input
$0.11
/M tokens
Cached
$0.06
/M tokens
Output
$0.68
/M tokens
Veo 3.1 Fast
google
veo-3.1-fast-generate-previewVideo Generation
Avalanche
Context: 32.8k
Per Second Pricing
Default$0.15/sec
Veo 3.1
google20% off
veo-3.1-generate-previewVideo Generation
Avalanche
Context: 32.8k20% off
Per Second Pricing
Video / Audio$0.2 – $0.4/sec
MiniMax M2.5 Highspeed
minimax
minimax-m2.5-highspeedStreaming
Reasoning
MiniMax
Context: 204.8k
Input
$0.60
/M tokens
Cached
$0.03
/M tokens
Output
$2.40
/M tokens
MiniMax M2.7 Highspeed
minimax
minimax-m2.7-highspeedStreaming
Reasoning
MiniMax
Context: 204.8k
Input
$0.60
/M tokens
Cached
$0.06
/M tokens
Output
$2.40
/M tokens
MiniMax M2.7
minimax
minimax-m2.7Streaming
Reasoning
Tools
JSON Output
Structured JSON Output
MiniMax
Context: 204.8k
Input
$0.30
/M tokens
Cached
$0.06
/M tokens
Output
$1.20
/M tokens
Gemini Pro Latest
google
gemini-pro-latestStreaming
Vision
Tools
Reasoning
Reasoning Budget
JSON Output
Structured JSON Output
Native Web Search
Google AI Studio
Context: 1.0M
Input
$2.00
/M tokens
Cached
$0.20
/M tokens
Output
$12.00
/M tokens
+ $0.014 per search
GPT-5.4 Nano
openai30% off
gpt-5.4-nanoStreaming
Vision
Tools
Reasoning
JSON Output
Structured JSON Output
Native Web Search
Azure
Context: 400k30% off
Input
$0.20$0.14
-30% off/M tokens
Cached
$0.02$0.01
-30% off/M tokens
Output
$1.25$0.88
-30% off/M tokens
+ $0.010 per search
GPT-5.4 Mini
openai30% off
gpt-5.4-miniStreaming
Vision
Tools
Reasoning
JSON Output
Structured JSON Output
Native Web Search
Azure
Context: 400k30% off
Input
$0.75$0.52
-30% off/M tokens
Cached
$0.07$0.05
-30% off/M tokens
Output
$4.50$3.15
-30% off/M tokens
+ $0.010 per search
Grok 4.20 Beta Non-Reasoning (0309)
xai
grok-4-20-beta-0309-non-reasoningStreaming
Vision
Tools
JSON Output
xAI
Context: 2M
Input
$2.00
/M tokens
Cached
$0.20
/M tokens
Output
$6.00
/M tokens
Grok 4.20 Beta Reasoning (0309)
xai
grok-4-20-beta-0309-reasoningStreaming
Vision
Tools
Reasoning
JSON Output
xAI
Context: 2M
Input
$2.00
/M tokens
Cached
$0.20
/M tokens
Output
$6.00
/M tokens
Grok 4.20 Multi-Agent Beta (0309)
xaiModel Deactivated
grok-4-20-multi-agent-beta-0309Streaming
Vision
Tools
Reasoning
JSON Output
xAI
Context: 2M
Deactivated since Mar 27, 2026
Input
$2.00
/M tokens
Cached
$0.20
/M tokens
Output
$6.00
/M tokens
GPT-5.3 Chat
openai
gpt-5.3-chat-latestStreaming
Vision
Tools
Reasoning
Native Web Search
Azure
Context: 128k
Input
$1.75
/M tokens
Cached
$0.17
/M tokens
Output
$14.00
/M tokens
GPT-5.3 Codex
openai
gpt-5.3-codexStreaming
Vision
Tools
Reasoning
JSON Output
Native Web Search
Azure
Context: 400k
Input
$1.75
/M tokens
Cached
$0.17
/M tokens
Output
$14.00
/M tokens
GPT-5.2 Codex
openai
gpt-5.2-codexStreaming
Vision
Tools
Reasoning
JSON Output
Native Web Search
Azure
Context: 400k
Input
$1.75
/M tokens
Cached
$0.17
/M tokens
Output
$14.00
/M tokens
o4 Mini
openai
o4-miniStreaming
Vision
Tools
Reasoning
JSON Output
Structured JSON Output
Azure
Context: 200k
Input
$1.10
/M tokens
Cached
$0.28
/M tokens
Output
$4.40
/M tokens
Grok 4.1 Fast
xai
grok-4-1-fastStreaming
Vision
Tools
Reasoning
JSON Output
xAI
Context: 2M
Input
$0.20
/M tokens
Cached
$0.05
/M tokens
Output
$0.50
/M tokens
Grok 4 Fast
xai
grok-4-fastStreaming
Vision
Tools
Reasoning
JSON Output
xAI
Context: 2M
Input
$0.20
/M tokens
Cached
$0.05
/M tokens
Output
$0.50
/M tokens
GPT-5.4 Pro
openai30% off
gpt-5.4-proStreaming
Vision
Tools
Reasoning
JSON Output
Native Web Search
Azure
Context: 1.1M
Input
$30.00
/M tokens
Cached
—
/M tokens
Output
$180.00
/M tokens
+ $0.010 per search
GPT-5.4
openai30% off
gpt-5.4Streaming
Vision
Tools
Reasoning
JSON Output
Structured JSON Output
Native Web Search
Azure
Context: 1.1M30% off
Input
$2.50$1.75
-30% off/M tokens
Cached
$0.25$0.17
-30% off/M tokens
Output
$15.00$10.50
-30% off/M tokens
+ $0.010 per search
Grok Imagine Image
xai
grok-imagine-imageVision
Image Generation
xAI
Context: 2k
Per image
$0.0200
Grok Imagine Image Pro
xai
grok-imagine-image-proVision
Image Generation
xAI
Context: 2k
Per image
$0.0700
Gemini 3.1 Flash Lite (Preview)
google
gemini-3.1-flash-lite-previewStreaming
Vision
Tools
JSON Output
Structured JSON Output
Google AI Studio
Context: 1.0M
Input
$0.25
/M tokens
Cached
$0.03
/M tokens
Output
$1.50
/M tokens
Gemini 3.1 Flash Image (Preview)
google20% off
gemini-3.1-flash-image-previewStreaming
Vision
JSON Output
Structured JSON Output
Image Generation
Glacier
Context: 65.5k20% off
Per image (0.5K)
$0.0448$0.0359
Image Pricing (est. per image)
Input
any size~$0.0001~$0.0001
Output
0.5K~$0.0448~$0.0359
1K~$0.0672~$0.0538
2K~$0.1008~$0.0806
4K~$0.1512~$0.1210
Qwen3.5 397B A17B
alibaba20% off
qwen35-397b-a17bStreaming
Vision
Tools
Reasoning
JSON Output
Native Web Search
Alibaba Cloud
Context: 262.1k20% off
Input
$0.17$0.14
-20% off/M tokens
Cached
—
/M tokens
Output
$1.03$0.83
-20% off/M tokens
+ $0.010 per search
Devstral Small 1.1
mistral
devstral-small-2507Streaming
JSON Output
Mistral AI
Context: 131.1k
Input
$0.10
/M tokens
Cached
—
/M tokens
Output
$0.30
/M tokens
Devstral 2
mistral
devstral-2512Streaming
JSON Output
Mistral AI
Context: 262.1k
Input
$0.40
/M tokens
Cached
—
/M tokens
Output
$2.00
/M tokens
Codestral
mistral
codestral-2508Streaming
JSON Output
Mistral AI
Context: 256k
Input
$0.30
/M tokens
Cached
—
/M tokens
Output
$0.90
/M tokens
Ministral 3B
mistral
ministral-3b-2512Streaming
Vision
JSON Output
Mistral AI
Context: 131.1k
Input
$0.10
/M tokens
Cached
—
/M tokens
Output
$0.10
/M tokens
Ministral 8B
mistral
ministral-8b-2512Streaming
Vision
JSON Output
Mistral AI
Context: 262.1k
Input
$0.15
/M tokens
Cached
—
/M tokens
Output
$0.15
/M tokens
Ministral 14B
mistral
ministral-14b-2512Streaming
Vision
JSON Output
Mistral AI
Context: 262.1k
Input
$0.20
/M tokens
Cached
—
/M tokens
Output
$0.20
/M tokens
Mistral Small 3.2
mistral
mistral-small-2506Streaming
Vision
JSON Output
Mistral AI
Context: 128k
Input
$0.10
/M tokens
Cached
—
/M tokens
Output
$0.30
/M tokens
Mistral Large 3
mistral
mistral-large-2512Streaming
Vision
JSON Output
Mistral AI
Context: 262.1k
Input
$0.50
/M tokens
Cached
—
/M tokens
Output
$1.50
/M tokens
Gemini 3.1 Pro (Preview)
google20% off
gemini-3.1-pro-previewStreaming
Vision
Tools
Reasoning
Reasoning Budget
JSON Output
Structured JSON Output
Native Web Search
Google AI Studio
Context: 1.0M
Input
$2.00
/M tokens
Cached
$0.20
/M tokens
Output
$12.00
/M tokens
+ $0.014 per search
Claude Sonnet 4.6
anthropic30% off
claude-sonnet-4-6Streaming
Vision
Tools
Reasoning
Reasoning Budget
Structured JSON Output
Native Web Search
Anthropic
Context: 200k
Input
$3.00
/M tokens
Cached
$0.30
/M tokens
Output
$15.00
/M tokens
+ $0.010 per search
GLM-5
glm30% off
glm-5Streaming
Tools
Reasoning
JSON Output
Structured JSON Output
Native Web Search
Alibaba Cloud
Context: 202.8k
Input
$0.57
/M tokens
Cached
—
/M tokens
Output
$2.58
/M tokens
MiniMax M2.5
minimax30% off
minimax-m2.5Streaming
Reasoning
Tools
JSON Output
Structured JSON Output
CanopyWave
Context: 204.8k30% off
Input
$0.27$0.19
-30% off/M tokens
Cached
$0.03$0.02
-30% off/M tokens
Output
$1.08$0.76
-30% off/M tokens
Claude Opus 4.6
anthropic30% off
claude-opus-4-6Streaming
Vision
Tools
Reasoning
Reasoning Budget
Structured JSON Output
Native Web Search
Anthropic
Context: 1M
Input
$5.00
/M tokens
Cached
$0.50
/M tokens
Output
$25.00
/M tokens
+ $0.010 per search
Hermes 2 Pro Llama 3 8B
nousresearch
hermes-2-pro-llama-3-8bStreaming
NovitaAI
Context: 8.2k
Input
$0.14
/M tokens
Cached
—
/M tokens
Output
$0.14
/M tokens
Qwen3 4B FP8
alibaba
qwen3-4b-fp8Streaming
NovitaAI
Context: 128k
Input
$0.03
/M tokens
Cached
—
/M tokens
Output
$0.03
/M tokens
Qwen3 30B A3B FP8
alibaba
qwen3-30b-a3b-fp8Streaming
NovitaAI
Context: 41.0k
Input
$0.09
/M tokens
Cached
—
/M tokens
Output
$0.45
/M tokens
Qwen3 32B FP8
alibaba
qwen3-32b-fp8Streaming
NovitaAI
Context: 41.0k
Input
$0.10
/M tokens
Cached
—
/M tokens
Output
$0.45
/M tokens
Qwen3 VL 30B A3B Thinking
alibaba
qwen3-vl-30b-a3b-thinkingStreaming
Vision
Tools
Reasoning
JSON Output
NovitaAI
Context: 131.1k
Input
$0.20
/M tokens
Cached
—
/M tokens
Output
$1.00
/M tokens
Qwen3 VL 30B A3B Instruct
alibaba
qwen3-vl-30b-a3b-instructStreaming
Vision
Tools
NovitaAI
Context: 131.1k
Input
$0.20
/M tokens
Cached
—
/M tokens
Output
$0.70
/M tokens