Models
Comprehensive list of all supported models and their providers
Use Case
Capabilities
Provider
Input Price ($/M tokens)
Output Price ($/M tokens)
Context Size (tokens)
204/204
Models
25/30
Providers
92
Vision Models (filtered)
127
Tool-enabled (filtered)
3
Free Models (filtered)
Gemini 3.1 Flash Lite (Preview)
google
gemini-3.1-flash-lite-previewProviders
Google AI Studio
google-ai-studio/gemini-3.1-flash-lite-previewContext Size
1.0M
Stability
stablePricing
Input
$0.25/M
Cached
$0.03/M
Output
$1.50/M
Capabilities
Streaming
Vision
Tools
JSON Output
Structured JSON Output
Grok Imagine Image
xai
grok-imagine-imageProviders
xAI
xai/grok-imagine-imageContext Size
2k
Stability
stablePricing
Input
$0.00/M
Cached
—/M
Output
$0.00/M
Per Request
$0.020/req
Capabilities
Image Generation
Grok Imagine Image Pro
xai
grok-imagine-image-proProviders
xAI
xai/grok-imagine-image-proContext Size
2k
Stability
stablePricing
Input
$0.00/M
Cached
—/M
Output
$0.00/M
Per Request
$0.070/req
Capabilities
Image Generation
Gemini 3.1 Flash Image (Preview)
google
gemini-3.1-flash-image-previewProviders
Google AI Studio
google-ai-studio/gemini-3.1-flash-image-previewContext Size
65.5k
Stability
stablePricing
Input
$0.25/M
Cached
—/M
Output
$1.50/M
Capabilities
Streaming
Vision
JSON Output
Structured JSON Output
Image Generation
Gemini 3.1 Pro (Preview)
google
gemini-3.1-pro-previewProviders
Google AI Studio
google-ai-studio/gemini-3.1-pro-previewContext Size
1.0M
Stability
stablePricing
Input
$2.00/M
Cached
$0.20/M
Output
$12.00/M
Capabilities
Streaming
Vision
Tools
Reasoning
Reasoning Budget
JSON Output
Structured JSON Output
Native Web Search
Claude Sonnet 4.6
anthropic
claude-sonnet-4-6Providers
Anthropic
anthropic/claude-sonnet-4-6Context Size
200k
Stability
stablePricing
Input
$3.00/M
Cached
$0.30/M
Output
$15.00/M
Capabilities
Streaming
Tools
Reasoning
Reasoning Budget
Structured JSON Output
Native Web Search
Qwen3.5 397B A17B
alibaba
qwen35-397b-a17bProviders
Alibaba Cloud
alibaba/qwen35-397b-a17bContext Size
262.1k
Stability
stablePricing
20% offInput
$0.60$0.48
-20% offCached
—/M
Output
$3.60$2.88
-20% offCapabilities
Streaming
Vision
Tools
Reasoning
JSON Output
Native Web Search
GLM-5
glm
glm-5Providers
CanopyWave
canopywave/glm-5Context Size
200k
Stability
stablePricing
30% offInput
$0.90$0.63
-30% offCached
—/M
Output
$3.10$2.17
-30% offCapabilities
Streaming
Tools
Reasoning
MiniMax M2.5
minimax
minimax-m2.5Providers
CanopyWave
canopywave/minimax-m2.5Context Size
204.8k
Stability
stablePricing
30% offInput
$0.27$0.19
-30% offCached
—/M
Output
$1.08$0.76
-30% offCapabilities
Streaming
Reasoning
Claude Opus 4.6
anthropic
claude-opus-4-6Providers
Anthropic
anthropic/claude-opus-4-6Context Size
1M
Stability
stablePricing
Input
$5.00/M
Cached
$0.50/M
Output
$25.00/M
Capabilities
Streaming
Vision
Tools
Reasoning
Structured JSON Output
Native Web Search
Kimi K2.5
moonshot
kimi-k2.5Providers
CanopyWave
canopywave/kimi-k2.5Context Size
262.1k
Stability
stablePricing
30% offInput
$0.50$0.35
-30% offCached
—/M
Output
$2.80$1.96
-30% offCapabilities
Streaming
Vision
Tools
JSON Output
Qwen3 Max 2026-01-23
alibaba
qwen3-max-2026-01-23Providers
Alibaba Cloud
alibaba/qwen3-max-2026-01-23Context Size
262.1k
Stability
stablePricing
20% offInput
$1.20$0.96
-20% offCached
$0.24$0.19
-20% offOutput
$6.00$4.80
-20% offCapabilities
Streaming
Vision
Tools
Reasoning
JSON Output
Qwen Image Edit Max
alibaba
qwen-image-edit-maxProviders
Alibaba Cloud
alibaba/qwen-image-edit-maxContext Size
2k
Stability
stablePricing
20% offInput
$0.00$0.00
-20% offCached
—/M
Output
$0.00$0.00
-20% offPer Request
$0.080/req
Capabilities
Vision
Image Generation
Qwen Image Max 2025-12-30
alibaba
qwen-image-max-2025-12-30Providers
Alibaba Cloud
alibaba/qwen-image-max-2025-12-30Context Size
2k
Stability
stablePricing
Input
$0.00/M
Cached
—/M
Output
$0.00/M
Per Request
$0.075/req
Capabilities
Image Generation
MiniMax M2.1 Lightning
minimax
minimax-m2.1-lightningProviders
MiniMax
minimax/minimax-m2.1-lightningContext Size
196.6k
Stability
stablePricing
Input
$0.12/M
Cached
—/M
Output
$0.48/M
Capabilities
Streaming
Reasoning
MiniMax M2.1
minimax
minimax-m2.1Providers
CanopyWave
canopywave/minimax-m2.1Context Size
204.8k
Stability
stablePricing
30% offInput
$0.27$0.19
-30% offCached
$0.07$0.05
-30% offOutput
$1.08$0.76
-30% offCapabilities
Streaming
Tools
Reasoning
JSON Output
GLM-4.7 Flash
glm
glm-4.7-flashProviders
Z AI
zai/glm-4.7-flashContext Size
200k
Stability
stablePricing
Input
$0.00/M
Cached
$0.00/M
Output
$0.00/M
Capabilities
Streaming
Tools
Reasoning
JSON Output
GLM-4.7 FlashX
glm
glm-4.7-flashxProviders
Z AI
zai/glm-4.7-flashxContext Size
200k
Stability
stablePricing
10% offInput
$0.07$0.06
-10% offCached
$0.01$0.01
-10% offOutput
$0.40$0.36
-10% offCapabilities
Streaming
Tools
Reasoning
JSON Output
GLM-4.7
glm
glm-4.7Providers
ByteDance
bytedance/glm-4.7Context Size
200k
Stability
stablePricing
Input
$0.60/M
Cached
$0.11/M
Output
$2.20/M
Capabilities
Streaming
Tools
Reasoning
Seed 1.8 (251228)
bytedance
seed-1-8-251228Providers
ByteDance
bytedance/seed-1-8-251228Context Size
256k
Stability
stablePricing
Input
$0.25/M
Cached
$0.05/M
Output
$2.00/M
Capabilities
Streaming
Vision
Tools
Reasoning
JSON Output
Gemini 3 Flash (Preview)
google
gemini-3-flash-previewProviders
Google AI Studio
google-ai-studio/gemini-3-flash-previewContext Size
1.0M
Stability
stablePricing
Input
$0.50/M
Cached
$0.05/M
Output
$3.00/M
Capabilities
Streaming
Vision
Tools
Reasoning
Reasoning Budget
JSON Output
Structured JSON Output
Native Web Search
GPT-5.2 Chat
openai
gpt-5.2-chat-latestProviders
Azure
azure/gpt-5.2-chat-latestContext Size
128k
Stability
unstablePricing
Input
$1.75/M
Cached
$0.17/M
Output
$14.00/M
Capabilities
Streaming
Vision
Tools
Reasoning
GPT-5.2 Pro
openai
gpt-5.2-proProviders
Azure
azure/gpt-5.2-proContext Size
400k
Stability
unstablePricing
Input
$21.00/M
Cached
—/M
Output
$168.00/M
Capabilities
Streaming
Vision
Tools
Reasoning
JSON Output
GPT-5.2
openai
gpt-5.2Providers
Azure
azure/gpt-5.2Context Size
400k
Stability
stablePricing
30% offInput
$1.75$1.22
-30% offCached
$0.17$0.12
-30% offOutput
$14.00$9.80
-30% offCapabilities
Streaming
Vision
Tools
Reasoning
JSON Output
Structured JSON Output
Devstral 2
mistral
devstral-2512Providers
Mistral AI
mistral/devstral-2512Context Size
262.1k
Stability
stablePricing
Input
$0.40/M
Cached
—/M
Output
$2.00/M
Capabilities
Streaming
JSON Output
GLM-4.6V FlashX
glm
glm-4.6v-flashxProviders
Z AI
zai/glm-4.6v-flashxContext Size
128k
Stability
stablePricing
10% offInput
$0.04$0.04
-10% offCached
$0.00$0.00
-10% offOutput
$0.40$0.36
-10% offCapabilities
Streaming
Vision
Tools
Reasoning
JSON Output
GLM-4.6V Flash
glm
glm-4.6v-flashProviders
Z AI
zai/glm-4.6v-flashContext Size
128k
Stability
stablePricing
Input
$0.00/M
Cached
$0.00/M
Output
$0.00/M
Capabilities
Streaming
Vision
Tools
Reasoning
JSON Output
GLM-4.6V
glm
glm-4.6vProviders
NovitaAI
novita/glm-4.6vContext Size
131.1k
Stability
stablePricing
Input
$0.30/M
Cached
$0.06/M
Output
$0.90/M
Capabilities
Streaming
Vision
Tools
Reasoning
JSON Output
Seedream 4.5
bytedance
seedream-4-5Providers
ByteDance
bytedance/seedream-4-5Context Size
2k
Stability
stablePricing
10% offInput
$0.00$0.00
-10% offCached
—/M
Output
$0.00$0.00
-10% offPer Request
$0.045/req
Capabilities
Image Generation
Ministral 3B
mistral
ministral-3b-2512Providers
Mistral AI
mistral/ministral-3b-2512Context Size
131.1k
Stability
stablePricing
Input
$0.10/M
Cached
—/M
Output
$0.10/M
Capabilities
Streaming
Vision
JSON Output
Ministral 8B
mistral
ministral-8b-2512Providers
Mistral AI
mistral/ministral-8b-2512Context Size
262.1k
Stability
stablePricing
Input
$0.15/M
Cached
—/M
Output
$0.15/M
Capabilities
Streaming
Vision
JSON Output
Ministral 14B
mistral
ministral-14b-2512Providers
Mistral AI
mistral/ministral-14b-2512Context Size
262.1k
Stability
stablePricing
Input
$0.20/M
Cached
—/M
Output
$0.20/M
Capabilities
Streaming
Vision
JSON Output
Mistral Large 3
mistral
mistral-large-2512Providers
Mistral AI
mistral/mistral-large-2512Context Size
262.1k
Stability
stablePricing
Input
$0.50/M
Cached
—/M
Output
$1.50/M
Capabilities
Streaming
Vision
JSON Output
Mistral Large Latest
mistral
mistral-large-latestProviders
Mistral AI
mistral/mistral-large-latestContext Size
128k
Stability
stablePricing
Input
$4.00/M
Cached
—/M
Output
$12.00/M
Capabilities
Streaming
Claude Opus 4.5
anthropic
claude-opus-4-5-20251101Providers
Anthropic
anthropic/claude-opus-4-5-20251101Context Size
200k
Stability
stablePricing
Input
$5.00/M
Cached
$0.50/M
Output
$25.00/M
Capabilities
Streaming
Vision
Tools
Reasoning
Reasoning Budget
Structured JSON Output
Native Web Search
Gemini 3 Pro Image (Preview)
google
gemini-3-pro-image-previewProviders
Google AI Studio
google-ai-studio/gemini-3-pro-image-previewContext Size
65.5k
Stability
stablePricing
Input
$2.00/M
Cached
$0.20/M
Output
$12.00/M
Capabilities
Streaming
Vision
JSON Output
Structured JSON Output
Image Generation
Grok 4.1 Fast Non-Reasoning
xai
grok-4-1-fast-non-reasoningProviders
xAI
xai/grok-4-1-fast-non-reasoningContext Size
2M
Stability
stablePricing
Input
$0.20/M
Cached
$0.05/M
Output
$0.50/M
Capabilities
Streaming
Vision
Tools
JSON Output
Grok 4.1 Fast Reasoning
xai
grok-4-1-fast-reasoningProviders
xAI
xai/grok-4-1-fast-reasoningContext Size
2M
Stability
stablePricing
Input
$0.20/M
Cached
$0.05/M
Output
$0.50/M
Capabilities
Streaming
Vision
Tools
JSON Output
Gemini 3 Pro (Preview)
google
gemini-3-pro-previewProviders
Google AI Studio
google-ai-studio/gemini-3-pro-previewContext Size
1.0M
Stability
stablePricing
Input
$2.00/M
Cached
$0.20/M
Output
$12.00/M
Capabilities
Streaming
Vision
Tools
Reasoning
Reasoning Budget
JSON Output
Structured JSON Output
Native Web Search
GPT-5.1 Codex
openai
gpt-5.1-codexProviders
Azure
azure/gpt-5.1-codexContext Size
400k
Stability
unstablePricing
Input
$1.25/M
Cached
—/M
Output
$10.00/M
Capabilities
Streaming
Vision
Tools
Reasoning
JSON Output
GPT-5.1 Codex mini
openai
gpt-5.1-codex-miniProviders
Azure
azure/gpt-5.1-codex-miniContext Size
400k
Stability
unstablePricing
Input
$0.25/M
Cached
$0.03/M
Output
$2.00/M
Capabilities
Streaming
Vision
Tools
Reasoning
JSON Output
Kimi K2 Thinking Turbo
moonshot
kimi-k2-thinking-turboProviders
Moonshot AI
moonshot/kimi-k2-thinking-turboContext Size
262.1k
Stability
stablePricing
Input
$1.15/M
Cached
$0.15/M
Output
$8.00/M
Capabilities
Streaming
Tools
Reasoning
JSON Output
Kimi K2 Thinking
moonshot
kimi-k2-thinkingProviders
ByteDance
bytedance/kimi-k2-thinkingContext Size
256k
Stability
stablePricing
Input
$0.60/M
Cached
$0.12/M
Output
$2.50/M
Capabilities
Streaming
Tools
Reasoning
GPT-5.1
openai
gpt-5.1Providers
Azure
azure/gpt-5.1Context Size
400k
Stability
stablePricing
30% offInput
$1.25$0.88
-30% offCached
$0.13$0.09
-30% offOutput
$10.00$7.00
-30% offCapabilities
Streaming
Vision
Tools
Reasoning
JSON Output
Structured JSON Output
MiniMax M2
minimax
minimax-m2Providers
CanopyWave
canopywave/minimax-m2Context Size
196.6k
Stability
stableDeactivated since Jan 1, 2026
Pricing
30% offInput
$0.25$0.17
-30% offCached
—/M
Output
$1.00$0.70
-30% offCapabilities
Streaming
Tools
Reasoning
JSON Output
Qwen3 VL Flash
alibaba
qwen3-vl-flashProviders
Alibaba Cloud
alibaba/qwen3-vl-flashContext Size
262.1k
Stability
stablePricing
20% offInput
$0.05$0.04
-20% offCached
$0.01$0.01
-20% offOutput
$0.40$0.32
-20% offCapabilities
Streaming
Vision
Tools
JSON Output
Claude Haiku 4.5 (2025-10-01)
anthropic
claude-haiku-4-5-20251001Providers
Anthropic
anthropic/claude-haiku-4-5-20251001Context Size
200k
Stability
stablePricing
Input
$1.00/M
Cached
$0.10/M
Output
$5.00/M
Capabilities
Streaming
Tools
Structured JSON Output
Native Web Search
Claude Haiku 4.5
anthropic
claude-haiku-4-5Providers
Anthropic
anthropic/claude-haiku-4-5Context Size
200k
Stability
stablePricing
Input
$1.00/M
Cached
$0.10/M
Output
$5.00/M
Capabilities
Streaming
Tools
Structured JSON Output
Native Web Search
Qwen3 VL 8B Instruct
alibaba
qwen3-vl-8b-instructProviders
NovitaAI
novita/qwen3-vl-8b-instructContext Size
131.1k
Stability
stablePricing
Input
$0.08/M
Cached
—/M
Output
$0.50/M
Capabilities
Streaming
Vision
JSON Output
Qwen3 VL 30B A3B Thinking
alibaba
qwen3-vl-30b-a3b-thinkingProviders
NovitaAI
novita/qwen3-vl-30b-a3b-thinkingContext Size
131.1k
Stability
stablePricing
Input
$0.20/M
Cached
—/M
Output
$1.00/M
Capabilities
Streaming
Vision
Tools
Reasoning
JSON Output
Grok 4 Fast Non-Reasoning
xai
grok-4-fast-non-reasoningProviders
xAI
xai/grok-4-fast-non-reasoningContext Size
2M
Stability
stablePricing
Input
$0.20/M
Cached
$0.05/M
Output
$0.50/M
Capabilities
Streaming
Vision
Tools
JSON Output
Qwen3 VL 30B A3B Instruct
alibaba
qwen3-vl-30b-a3b-instructProviders
NovitaAI
novita/qwen3-vl-30b-a3b-instructContext Size
131.1k
Stability
stablePricing
Input
$0.20/M
Cached
—/M
Output
$0.70/M
Capabilities
Streaming
Vision
Tools
Gemini 2.5 Flash Image
google
gemini-2.5-flash-imageProviders
Google AI Studio
google-ai-studio/gemini-2.5-flash-imageContext Size
32.8k
Stability
stablePricing
Input
$0.30/M
Cached
$0.03/M
Output
$30.00/M
Capabilities
Streaming
Vision
JSON Output
Structured JSON Output
Image Generation
Gemini 2.5 Flash Image (Preview)
google
gemini-2.5-flash-image-previewProviders
Google Vertex AI
google-vertex/gemini-2.5-flash-image-previewContext Size
32.8k
Stability
stablePricing
Input
$0.30/M
Cached
—/M
Output
$2.50/M
Capabilities
Streaming
Vision
JSON Output
Structured JSON Output
Image Generation
GLM-4.6
glm
glm-4.6Providers
CanopyWave
canopywave/glm-4.6Context Size
202.8k
Stability
stableDeactivated since Jan 1, 2026
Pricing
30% offInput
$0.45$0.32
-30% offCached
—/M
Output
$1.50$1.05
-30% offCapabilities
Streaming
Tools
Reasoning
JSON Output
Claude Sonnet 4.5 (2025-09-29)
anthropic
claude-sonnet-4-5-20250929Providers
Anthropic
anthropic/claude-sonnet-4-5-20250929Context Size
200k
Stability
stablePricing
Input
$3.00/M
Cached
$0.30/M
Output
$15.00/M
Capabilities
Streaming
Tools
Reasoning
Reasoning Budget
Structured JSON Output
Native Web Search
DeepSeek V3.2
deepseek
deepseek-v3.2Providers
ByteDance
bytedance/deepseek-v3.2Context Size
131.1k
Stability
stablePricing
Input
$0.28/M
Cached
$0.06/M
Output
$0.42/M
Capabilities
Streaming
Tools
Reasoning
Claude Sonnet 4.5
anthropic
claude-sonnet-4-5Providers
Anthropic
anthropic/claude-sonnet-4-5Context Size
200k
Stability
stablePricing
Input
$3.00/M
Cached
$0.30/M
Output
$15.00/M
Capabilities
Streaming
Tools
Reasoning
Reasoning Budget
Structured JSON Output
Native Web Search
Gemini 2.5 Flash Lite Preview (09-2025)
google
gemini-2.5-flash-lite-preview-09-2025Providers
Google AI Studio
google-ai-studio/gemini-2.5-flash-lite-preview-09-2025Context Size
1.0M
Stability
stablePricing
Input
$0.10/M
Cached
$0.01/M
Output
$0.40/M
Capabilities
Streaming
Vision
Tools
JSON Output
Structured JSON Output
Gemini 2.5 Flash Preview (09-2025)
googleModel Deactivated
gemini-2.5-flash-preview-09-2025Providers
Google AI Studio
google-ai-studio/gemini-2.5-flash-preview-09-2025Context Size
1.0M
Stability
stableDeactivated since Jan 17, 2026
Pricing
Input
$0.30/M
Cached
$0.03/M
Output
$2.50/M
Capabilities
Streaming
Vision
Tools
Reasoning
Reasoning Budget
JSON Output
Structured JSON Output
Qwen3 Max
alibaba
qwen3-maxProviders
Alibaba Cloud
alibaba/qwen3-maxContext Size
256k
Stability
stablePricing
20% offInput
$3.00$2.40
-20% offCached
$0.60$0.48
-20% offOutput
$15.00$12.00
-20% offCapabilities
Streaming
Vision
Tools
Reasoning
JSON Output
Qwen3 VL 235B A22B Thinking
alibaba
qwen3-vl-235b-a22b-thinkingProviders
Alibaba Cloud
alibaba/qwen3-vl-235b-a22b-thinkingContext Size
131.1k
Stability
stablePricing
20% offInput
$0.50$0.40
-20% offCached
—/M
Output
$2.00$1.60
-20% offCapabilities
Streaming
Vision
Reasoning
Qwen3 VL 235B A22B Instruct
alibaba
qwen3-vl-235b-a22b-instructProviders
Alibaba Cloud
alibaba/qwen3-vl-235b-a22b-instructContext Size
131.1k
Stability
stablePricing
20% offInput
$0.50$0.40
-20% offCached
—/M
Output
$2.00$1.60
-20% offCapabilities
Streaming
Vision
Tools
JSON Output
Qwen3 VL Plus
alibaba
qwen3-vl-plusProviders
Alibaba Cloud
alibaba/qwen3-vl-plusContext Size
262.1k
Stability
stablePricing
20% offInput
$0.20$0.16
-20% offCached
$0.04$0.03
-20% offOutput
$1.60$1.28
-20% offCapabilities
Streaming
Vision
JSON Output
Qwen3 Coder Plus
alibaba
qwen3-coder-plusProviders
Alibaba Cloud
alibaba/qwen3-coder-plusContext Size
1M
Stability
stablePricing
20% offInput
$6.00$4.80
-20% offCached
—/M
Output
$60.00$48.00
-20% offCapabilities
Streaming
Tools
JSON Output
Seedream 4.0
bytedance
seedream-4-0Providers
ByteDance
bytedance/seedream-4-0Context Size
2k
Stability
stablePricing
10% offInput
$0.00$0.00
-10% offCached
—/M
Output
$0.00$0.00
-10% offPer Request
$0.035/req
Capabilities
Image Generation
Seed 1.6 (250915)
bytedance
seed-1-6-250915Providers
ByteDance
bytedance/seed-1-6-250915Context Size
256k
Stability
stablePricing
Input
$0.25/M
Cached
$0.05/M
Output
$2.00/M
Capabilities
Streaming
Vision
Tools
Reasoning
JSON Output
Qwen3 Next 80B A3B Instruct
alibaba
qwen3-next-80b-a3b-instructProviders
Alibaba Cloud
alibaba/qwen3-next-80b-a3b-instructContext Size
129.0k
Stability
stablePricing
20% offInput
$0.50$0.40
-20% offCached
—/M
Output
$2.00$1.60
-20% offCapabilities
Streaming
Tools
JSON Output
Qwen3 Next 80B A3B Thinking
alibaba
qwen3-next-80b-a3b-thinkingProviders
Alibaba Cloud
alibaba/qwen3-next-80b-a3b-thinkingContext Size
131.1k
Stability
unstablePricing
20% offInput
$0.50$0.40
-20% offCached
—/M
Output
$6.00$4.80
-20% offCapabilities
Streaming
Tools
Reasoning
Qwen Max
alibaba
qwen-maxProviders
Alibaba Cloud
alibaba/qwen-maxContext Size
131.1k
Stability
stablePricing
20% offInput
$1.60$1.28
-20% offCached
—/M
Output
$6.40$5.12
-20% offCapabilities
Streaming
Vision
Tools
JSON Output
Grok Code Fast 1
xai
grok-code-fast-1Providers
xAI
xai/grok-code-fast-1Context Size
256k
Stability
stablePricing
Input
$0.20/M
Cached
—/M
Output
$1.50/M
Capabilities
Streaming
Tools
JSON Output
Gemini 2.5 Flash
google
gemini-2.5-flashProviders
Google AI Studio
google-ai-studio/gemini-2.5-flashContext Size
1.0M
Stability
stablePricing
Input
$0.30/M
Cached
$0.03/M
Output
$2.50/M
Capabilities
Streaming
Vision
Tools
Reasoning
Reasoning Budget
JSON Output
Structured JSON Output
Native Web Search
DeepSeek V3.1
deepseek
deepseek-v3.1Providers
ByteDance
bytedance/deepseek-v3.1Context Size
128k
Stability
stablePricing
Input
$0.56/M
Cached
$0.11/M
Output
$1.68/M
Capabilities
Streaming
Tools
Reasoning
Qwen Image Edit Plus
alibaba
qwen-image-edit-plusProviders
Alibaba Cloud
alibaba/qwen-image-edit-plusContext Size
2k
Stability
stablePricing
20% offInput
$0.00$0.00
-20% offCached
—/M
Output
$0.00$0.00
-20% offPer Request
$0.040/req
Capabilities
Vision
Image Generation
GLM-4.5 Flash
glm
glm-4.5-flashProviders
Z AI
zai/glm-4.5-flashContext Size
128k
Stability
stablePricing
Input
$0.00/M
Cached
$0.00/M
Output
$0.00/M
Capabilities
Streaming
Tools
JSON Output
GLM-4.5V
glm
glm-4.5vProviders
NovitaAI
novita/glm-4.5vContext Size
65.5k
Stability
stablePricing
Input
$0.60/M
Cached
$0.11/M
Output
$1.80/M
Capabilities
Streaming
Vision
Tools
Reasoning
JSON Output
Claude Opus 4.1
anthropic
claude-opus-4-1-20250805Providers
Anthropic
anthropic/claude-opus-4-1-20250805Context Size
200k
Stability
stablePricing
Input
$15.00/M
Cached
$1.50/M
Output
$75.00/M
Capabilities
Streaming
Vision
Tools
Reasoning
Reasoning Budget
Structured JSON Output
Native Web Search
GPT OSS 20B
openai
gpt-oss-20bProviders
Groq
groq/gpt-oss-20bContext Size
131.1k
Stability
stablePricing
Input
$0.10/M
Cached
—/M
Output
$0.50/M
Capabilities
Streaming
Tools
Reasoning
JSON Output
GPT OSS 120B
openai
gpt-oss-120bProviders
ByteDance
bytedance/gpt-oss-120bContext Size
128k
Stability
stablePricing
Input
$0.10/M
Cached
$0.02/M
Output
$0.50/M
Capabilities
Streaming
Tools
Reasoning
Qwen Image
alibaba
qwen-imageProviders
Alibaba Cloud
alibaba/qwen-imageContext Size
2k
Stability
stablePricing
20% offInput
$0.00$0.00
-20% offCached
—/M
Output
$0.00$0.00
-20% offPer Request
$0.035/req
Capabilities
Image Generation
Qwen Image Max
alibaba
qwen-image-maxProviders
Alibaba Cloud
alibaba/qwen-image-maxContext Size
2k
Stability
stablePricing
Input
$0.00/M
Cached
—/M
Output
$0.00/M
Per Request
$0.075/req
Capabilities
Image Generation
Qwen Image Plus
alibaba
qwen-image-plusProviders
Alibaba Cloud
alibaba/qwen-image-plusContext Size
2k
Stability
stablePricing
20% offInput
$0.00$0.00
-20% offCached
—/M
Output
$0.00$0.00
-20% offPer Request
$0.030/req
Capabilities
Image Generation
GPT-5 Pro
openai
gpt-5-proProviders
OpenAI
openai/gpt-5-proContext Size
400k
Stability
stablePricing
Input
$15.00/M
Cached
—/M
Output
$120.00/M
Capabilities
Streaming
Vision
Tools
Reasoning
JSON Output
Structured JSON Output
Native Web Search
GPT-5 Chat Latest
openai
gpt-5-chat-latestProviders
OpenAI
openai/gpt-5-chat-latestContext Size
400k
Stability
stablePricing
Input
$1.25/M
Cached
$0.13/M
Output
$10.00/M
Capabilities
Streaming
Vision
JSON Output
Structured JSON Output
GPT-5 Nano
openai
gpt-5-nanoProviders
Azure
azure/gpt-5-nanoContext Size
400k
Stability
unstablePricing
Input
$0.05/M
Cached
$0.01/M
Output
$0.40/M
Capabilities
Streaming
Tools
Reasoning
JSON Output
Structured JSON Output
GPT-5 Mini
openai
gpt-5-miniProviders
Azure
azure/gpt-5-miniContext Size
400k
Stability
unstablePricing
Input
$0.25/M
Cached
$0.03/M
Output
$2.00/M
Capabilities
Streaming
Vision
Tools
Reasoning
JSON Output
Structured JSON Output
GPT-5
openai
gpt-5Providers
Azure
azure/gpt-5Context Size
400k
Stability
unstablePricing
Input
$1.25/M
Cached
$0.13/M
Output
$10.00/M
Capabilities
Streaming
Vision
Tools
Reasoning
JSON Output
Structured JSON Output
Qwen3 Coder 30B A3B Instruct
alibaba
qwen3-coder-30b-a3b-instructProviders
Nebius AI
nebius/qwen3-coder-30b-a3b-instructContext Size
262k
Stability
stablePricing
Input
$0.10/M
Cached
—/M
Output
$0.30/M
Capabilities
Streaming
Tools
JSON Output
Codestral
mistral
codestral-2508Providers
Mistral AI
mistral/codestral-2508Context Size
256k
Stability
stablePricing
Input
$0.30/M
Cached
—/M
Output
$0.90/M
Capabilities
Streaming
JSON Output
Qwen3 30B A3B Thinking 2507
alibaba
qwen3-30b-a3b-thinking-2507Providers
Nebius AI
nebius/qwen3-30b-a3b-thinking-2507Context Size
262k
Stability
stablePricing
Input
$0.10/M
Cached
—/M
Output
$0.30/M
Capabilities
Streaming
Tools
Reasoning
JSON Output
Qwen3 30B A3B Instruct 2507
alibaba
qwen3-30b-a3b-instruct-2507Providers
Nebius AI
nebius/qwen3-30b-a3b-instruct-2507Context Size
262k
Stability
stablePricing
Input
$0.10/M
Cached
—/M
Output
$0.30/M
Capabilities
Streaming
Tools
JSON Output
GLM-4.5 AirX
glm
glm-4.5-airxProviders
Z AI
zai/glm-4.5-airxContext Size
128k
Stability
stablePricing
10% offInput
$1.10$0.99
-10% offCached
$0.22$0.20
-10% offOutput
$4.50$4.05
-10% offCapabilities
Streaming
Tools
JSON Output
GLM-4.5 X
glm
glm-4.5-xProviders
Z AI
zai/glm-4.5-xContext Size
128k
Stability
unstablePricing
10% offInput
$2.20$1.98
-10% offCached
$0.45$0.41
-10% offOutput
$8.90$8.01
-10% offCapabilities
Streaming
Tools
Reasoning
JSON Output
GLM-4.5
glm
glm-4.5Providers
Z AI
zai/glm-4.5Context Size
128k
Stability
stablePricing
10% offInput
$0.60$0.54
-10% offCached
$0.11$0.10
-10% offOutput
$2.20$1.98
-10% offCapabilities
Streaming
Tools
Reasoning
JSON Output
Native Web Search
Seed 1.6 Flash (250715)
bytedance
seed-1-6-flash-250715Providers
ByteDance
bytedance/seed-1-6-flash-250715Context Size
256k
Stability
stablePricing
Input
$0.07/M
Cached
$0.02/M
Output
$0.30/M
Capabilities
Streaming
Vision
Tools
Reasoning
JSON Output
GLM-4.5 Air
glm
glm-4.5-airProviders
Z AI
zai/glm-4.5-airContext Size
128k
Stability
stablePricing
10% offInput
$0.20$0.18
-10% offCached
$0.03$0.03
-10% offOutput
$1.10$0.99
-10% offCapabilities
Streaming
Tools
JSON Output
Qwen3 235B A22B Thinking 2507
alibaba
qwen3-235b-a22b-thinking-2507Providers
Nebius AI
nebius/qwen3-235b-a22b-thinking-2507Context Size
262k
Stability
unstablePricing
Input
$0.20/M
Cached
—/M
Output
$0.60/M
Capabilities
Streaming
Tools
Reasoning
JSON Output
Qwen3 Coder
alibabaModel Deactivated
qwen3-coderProviders
CanopyWave
canopywave/qwen3-coderContext Size
262k
Stability
stableDeactivated since Feb 1, 2026
Pricing
30% offInput
$0.22$0.15
-30% offCached
—/M
Output
$0.95$0.66
-30% offCapabilities
Streaming
Tools
JSON Output
Qwen3 Coder Flash
alibaba
qwen3-coder-flashProviders
Alibaba Cloud
alibaba/qwen3-coder-flashContext Size
1M
Stability
stablePricing
20% offInput
$0.30$0.24
-20% offCached
$0.06$0.05
-20% offOutput
$1.50$1.20
-20% offCapabilities
Streaming
Tools
JSON Output
Gemini 2.5 Flash Lite
google
gemini-2.5-flash-liteProviders
Google AI Studio
google-ai-studio/gemini-2.5-flash-liteContext Size
1.0M
Stability
stablePricing
Input
$0.10/M
Cached
$0.01/M
Output
$0.40/M
Capabilities
Streaming
Vision
Tools
JSON Output
Structured JSON Output
Devstral Small 1.1
mistral
devstral-small-2507Providers
Mistral AI
mistral/devstral-small-2507Context Size
131.1k
Stability
stablePricing
Input
$0.10/M
Cached
—/M
Output
$0.30/M
Capabilities
Streaming
JSON Output
Qwen3 235B A22B Instruct 2507
alibaba
qwen3-235b-a22b-instruct-2507Providers
Cerebras
cerebras/qwen3-235b-a22b-instruct-2507Context Size
262k
Stability
stablePricing
Input
$0.60/M
Cached
—/M
Output
$1.20/M
Capabilities
Streaming
Tools
JSON Output
Kimi K2
moonshot
kimi-k2Providers
ByteDance
bytedance/kimi-k2Context Size
256k
Stability
stablePricing
Input
$0.60/M
Cached
$0.12/M
Output
$2.50/M
Capabilities
Streaming
Tools
Grok 4 Fast Reasoning
xai
grok-4-fast-reasoningProviders
xAI
xai/grok-4-fast-reasoningContext Size
2M
Stability
stablePricing
Input
$0.20/M
Cached
$0.05/M
Output
$0.50/M
Capabilities
Streaming
Vision
Tools
JSON Output
Grok 4
xai
grok-4Providers
xAI
xai/grok-4Context Size
256k
Stability
stablePricing
Input
$3.00/M
Cached
—/M
Output
$15.00/M
Capabilities
Streaming
Vision
Tools
JSON Output
Grok 4 (0709)
xai
grok-4-0709Providers
xAI
xai/grok-4-0709Context Size
256k
Stability
stablePricing
Input
$3.00/M
Cached
—/M
Output
$15.00/M
Capabilities
Streaming
Tools
JSON Output
Gemma 3n E4B IT
google
gemma-3n-e4b-itProviders
Google AI Studio
google-ai-studio/gemma-3n-e4b-itContext Size
1M
Stability
stablePricing
Input
$0.07/M
Cached
—/M
Output
$0.30/M
Capabilities
Streaming
Gemma 3n E2B IT
google
gemma-3n-e2b-itProviders
Google AI Studio
google-ai-studio/gemma-3n-e2b-itContext Size
1M
Stability
stablePricing
Input
$0.07/M
Cached
—/M
Output
$0.30/M
Capabilities
Streaming
Seed 1.6 (250615)
bytedance
seed-1-6-250615Providers
ByteDance
bytedance/seed-1-6-250615Context Size
256k
Stability
stablePricing
Input
$0.25/M
Cached
$0.05/M
Output
$2.00/M
Capabilities
Streaming
Vision
Tools
Reasoning
JSON Output
Mistral Small 3.2
mistral
mistral-small-2506Providers
Mistral AI
mistral/mistral-small-2506Context Size
128k
Stability
stablePricing
Input
$0.10/M
Cached
—/M
Output
$0.30/M
Capabilities
Streaming
Vision
JSON Output
Gemini 2.5 Pro Preview (06-05)
googleModel Deactivated
gemini-2.5-pro-preview-06-05Providers
Google AI Studio
google-ai-studio/gemini-2.5-pro-preview-06-05Context Size
1M
Stability
stableDeactivated since Jul 15, 2025
Pricing
Input
$1.25/M
Cached
—/M
Output
$10.00/M
Capabilities
Streaming
Vision
Tools
Reasoning
Reasoning Budget
JSON Output
Structured JSON Output
o3 Mini
openai
o3-miniProviders
Azure
azure/o3-miniContext Size
200k
Stability
unstablePricing
Input
$1.10/M
Cached
—/M
Output
$4.40/M
Capabilities
Streaming
JSON Output
Structured JSON Output
o3
openai
o3Providers
Azure
azure/o3Context Size
200k
Stability
unstablePricing
Input
$2.00/M
Cached
—/M
Output
$8.00/M
Capabilities
Vision
JSON Output
Structured JSON Output
DeepSeek R1 (0528)
deepseek
deepseek-r1-0528Providers
DeepSeek
deepseek/deepseek-r1-0528Context Size
64k
Stability
stablePricing
Input
$0.55/M
Cached
—/M
Output
$2.19/M
Capabilities
Streaming
Claude Opus 4 (2025-05-14)
anthropic
claude-opus-4-20250514Providers
Anthropic
anthropic/claude-opus-4-20250514Context Size
200k
Stability
stablePricing
Input
$15.00/M
Cached
$1.50/M
Output
$75.00/M
Capabilities
Streaming
Tools
Reasoning
Reasoning Budget
Native Web Search
Gemini 2.5 Flash Preview (05-20)
googleModel Deactivated
gemini-2.5-flash-preview-05-20Providers
Google AI Studio
google-ai-studio/gemini-2.5-flash-preview-05-20Context Size
1M
Stability
stableDeactivated since Jul 15, 2025
Pricing
Input
$0.15/M
Cached
—/M
Output
$0.60/M
Capabilities
Streaming
Vision
Tools
Reasoning
Reasoning Budget
JSON Output
Structured JSON Output
Claude Sonnet 4 (2025-05-14)
anthropic
claude-sonnet-4-20250514Providers
Anthropic
anthropic/claude-sonnet-4-20250514Context Size
200k
Stability
stablePricing
Input
$3.00/M
Cached
$0.30/M
Output
$15.00/M
Capabilities
Streaming
Tools
Reasoning
Reasoning Budget
Native Web Search
Gemini 2.5 Pro Preview (05-06)
googleModel Deactivated
gemini-2.5-pro-preview-05-06Providers
Google AI Studio
google-ai-studio/gemini-2.5-pro-preview-05-06Context Size
1M
Stability
stableDeactivated since Jul 15, 2025
Pricing
Input
$1.25/M
Cached
—/M
Output
$10.00/M
Capabilities
Streaming
Vision
Tools
Reasoning
Reasoning Budget
JSON Output
Structured JSON Output
Llama Guard 4 12B
meta
llama-guard-4-12bProviders
Groq
groq/llama-guard-4-12bContext Size
131.1k
Stability
stablePricing
Input
$0.20/M
Cached
—/M
Output
$0.20/M
Capabilities
Streaming
Qwen3 4B FP8
alibaba
qwen3-4b-fp8Providers
NovitaAI
novita/qwen3-4b-fp8Context Size
128k
Stability
stablePricing
Input
$0.03/M
Cached
—/M
Output
$0.03/M
Capabilities
Streaming
Qwen3 30B A3B FP8
alibaba
qwen3-30b-a3b-fp8Providers
NovitaAI
novita/qwen3-30b-a3b-fp8Context Size
41.0k
Stability
stablePricing
Input
$0.09/M
Cached
—/M
Output
$0.45/M
Capabilities
Streaming
Qwen3 32B FP8
alibaba
qwen3-32b-fp8Providers
NovitaAI
novita/qwen3-32b-fp8Context Size
41.0k
Stability
stablePricing
Input
$0.10/M
Cached
—/M
Output
$0.45/M
Capabilities
Streaming
Qwen3 235B A22B FP8
alibaba
qwen3-235b-a22b-fp8Providers
NovitaAI
novita/qwen3-235b-a22b-fp8Context Size
41.0k
Stability
stablePricing
Input
$0.20/M
Cached
—/M
Output
$0.80/M
Capabilities
Streaming
JSON Output
Qwen3 30B A3B
alibabaModel Deactivated
qwen3-30b-a3bProviders
Nebius AI
nebius/qwen3-30b-a3bContext Size
32.8k
Stability
stableDeactivated since Nov 3, 2025
Pricing
Input
$0.10/M
Cached
—/M
Output
$0.30/M
Capabilities
Streaming
Tools
JSON Output
Qwen3 32B
alibaba
qwen3-32bProviders
Cerebras
cerebras/qwen3-32bContext Size
32.8k
Stability
stableDeactivated since Feb 16, 2026
Pricing
Input
$0.40/M
Cached
—/M
Output
$0.80/M
Capabilities
Streaming
Tools
JSON Output
Qwen3 14B
alibabaModel Deactivated
qwen3-14bProviders
Nebius AI
nebius/qwen3-14bContext Size
32.8k
Stability
stableDeactivated since Nov 3, 2025
Pricing
Input
$0.08/M
Cached
—/M
Output
$0.24/M
Capabilities
Streaming
Tools
JSON Output
Gemini 2.5 Flash Preview Thinking (04-17)
googleModel Deactivated
gemini-2.5-flash-preview-04-17-thinkingProviders
Google AI Studio
google-ai-studio/gemini-2.5-flash-preview-04-17-thinkingContext Size
1M
Stability
stableDeactivated since Jul 22, 2025
Pricing
Input
$0.15/M
Cached
—/M
Output
$0.60/M
Capabilities
Streaming
Vision
Tools
Reasoning
Reasoning Budget
JSON Output
Structured JSON Output
Gemini 2.5 Flash Preview (04-17)
googleModel Deactivated
gemini-2.5-flash-preview-04-17Providers
Google AI Studio
google-ai-studio/gemini-2.5-flash-preview-04-17Context Size
1M
Stability
stableDeactivated since Jul 15, 2025
Pricing
Input
$0.15/M
Cached
—/M
Output
$0.60/M
Capabilities
Streaming
Vision
Tools
Reasoning
Reasoning Budget
JSON Output
Structured JSON Output
GLM-4 32B (0414-128k)
glm
glm-4-32b-0414-128kProviders
Z AI
zai/glm-4-32b-0414-128kContext Size
128k
Stability
stablePricing
10% offInput
$0.10$0.09
-10% offCached
$0.00$0.00
-10% offOutput
$0.10$0.09
-10% offCapabilities
Streaming
Tools
JSON Output
GPT-4.1 Nano
openai
gpt-4.1-nanoProviders
Azure
azure/gpt-4.1-nanoContext Size
1M
Stability
unstablePricing
Input
$0.10/M
Cached
—/M
Output
$0.40/M
Capabilities
Streaming
Vision
Tools
JSON Output
Structured JSON Output
GPT-4.1 Mini
openai
gpt-4.1-miniProviders
Azure
azure/gpt-4.1-miniContext Size
1M
Stability
unstablePricing
Input
$0.40/M
Cached
—/M
Output
$1.60/M
Capabilities
Streaming
Vision
Tools
JSON Output
Structured JSON Output
GPT-4.1
openai
gpt-4.1Providers
Azure
azure/gpt-4.1Context Size
1M
Stability
unstablePricing
Input
$2.00/M
Cached
—/M
Output
$8.00/M
Capabilities
Streaming
Vision
Tools
JSON Output
Structured JSON Output
Llama 3.1 Nemotron Ultra 253B
meta
llama-3.1-nemotron-ultra-253bProviders
Nebius AI
nebius/llama-3.1-nemotron-ultra-253bContext Size
128k
Stability
stablePricing
Input
$0.60/M
Cached
—/M
Output
$1.80/M
Capabilities
Streaming
JSON Output
Llama 4 Maverick 17B Instruct
meta
llama-4-maverick-17b-instructProviders
AWS Bedrock
aws-bedrock/llama-4-maverick-17b-instructContext Size
8.2k
Stability
unstablePricing
Input
$0.24/M
Cached
—/M
Output
$0.97/M
Capabilities
Streaming
Vision
Llama 4 Scout 17B Instruct
meta
llama-4-scout-17b-instructProviders
AWS Bedrock
aws-bedrock/llama-4-scout-17b-instructContext Size
8.2k
Stability
unstablePricing
Input
$0.17/M
Cached
—/M
Output
$0.66/M
Capabilities
Streaming
Vision
Llama 4 Scout
meta
llama-4-scoutProviders
Together AI
together.ai/llama-4-scoutContext Size
32.8k
Stability
unstablePricing
Input
$0.18/M
Cached
—/M
Output
$0.59/M
Capabilities
Streaming
Tools
Llama 3 8B Instruct
meta
llama-3-8b-instructProviders
NovitaAI
novita/llama-3-8b-instructContext Size
8.2k
Stability
stablePricing
Input
$0.04/M
Cached
—/M
Output
$0.04/M
Capabilities
Streaming
JSON Output
Qwen Omni Turbo
alibaba
qwen-omni-turboProviders
Alibaba Cloud
alibaba/qwen-omni-turboContext Size
32.8k
Stability
stablePricing
20% offInput
$0.20$0.16
-20% offCached
—/M
Output
$0.80$0.64
-20% offCapabilities
Streaming
Vision
JSON Output
Gemini 2.5 Pro
google
gemini-2.5-proProviders
Google AI Studio
google-ai-studio/gemini-2.5-proContext Size
1.0M
Stability
stablePricing
Input
$1.25/M
Cached
$0.13/M
Output
$10.00/M
Capabilities
Streaming
Vision
Tools
Reasoning
Reasoning Budget
JSON Output
Structured JSON Output
Native Web Search
Gemma 3 27B
google
gemma-3-27bProviders
Nebius AI
nebius/gemma-3-27bContext Size
128k
Stability
stablePricing
Input
$0.27/M
Cached
—/M
Output
$0.27/M
Capabilities
Streaming
Vision
Gemma 3 1B IT
google
gemma-3-1b-itProviders
Google AI Studio
google-ai-studio/gemma-3-1b-itContext Size
1M
Stability
stablePricing
Input
$0.07/M
Cached
—/M
Output
$0.30/M
Capabilities
Streaming
Gemma 3 12B IT
google
gemma-3-12b-itProviders
Google AI Studio
google-ai-studio/gemma-3-12b-itContext Size
1M
Stability
stablePricing
Input
$0.07/M
Cached
—/M
Output
$0.30/M
Capabilities
Streaming
Gemma 3 4B IT
google
gemma-3-4b-itProviders
Google AI Studio
google-ai-studio/gemma-3-4b-itContext Size
1M
Stability
stablePricing
Input
$0.07/M
Cached
—/M
Output
$0.30/M
Capabilities
Streaming
Sonar Pro
perplexity
sonar-proProviders
Perplexity
perplexity/sonar-proContext Size
200k
Stability
stablePricing
Input
$3.00/M
Cached
—/M
Output
$15.00/M
Per Request
$0.005/req
Capabilities
Streaming
Structured JSON Output
Sonar Reasoning Pro
perplexity
sonar-reasoning-proProviders
Perplexity
perplexity/sonar-reasoning-proContext Size
128k
Stability
stablePricing
Input
$2.00/M
Cached
—/M
Output
$8.00/M
Per Request
$0.005/req
Capabilities
Streaming
Structured JSON Output
QwQ Plus
alibaba
qwq-plusProviders
Alibaba Cloud
alibaba/qwq-plusContext Size
131.1k
Stability
stablePricing
20% offInput
$0.80$0.64
-20% offCached
—/M
Output
$2.40$1.92
-20% offCapabilities
Streaming
Reasoning
CogView-4
zai
cogview-4Providers
Z AI
zai/cogview-4Context Size
2k
Stability
stablePricing
10% offInput
$0.00$0.00
-10% offCached
—/M
Output
$0.00$0.00
-10% offPer Request
$0.010/req
Capabilities
Image Generation
Qwen QwQ 32B
alibabaModel Deactivated
qwen-qwq-32bProviders
Nebius AI
nebius/qwen-qwq-32bContext Size
32.8k
Stability
stableDeactivated since Nov 3, 2025
Pricing
Input
$0.15/M
Cached
—/M
Output
$0.45/M
Capabilities
Streaming
JSON Output
Claude 3.7 Sonnet
anthropic
claude-3-7-sonnetProviders
Anthropic
anthropic/claude-3-7-sonnetContext Size
200k
Stability
stablePricing
Input
$3.00/M
Cached
$0.30/M
Output
$15.00/M
Capabilities
Streaming
Tools
Reasoning
Reasoning Budget
Native Web Search
Qwen2.5 VL 32B Instruct
alibaba
qwen2-5-vl-32b-instructProviders
Alibaba Cloud
alibaba/qwen2-5-vl-32b-instructContext Size
131.1k
Stability
stablePricing
20% offInput
$1.40$1.12
-20% offCached
—/M
Output
$4.20$3.36
-20% offCapabilities
Streaming
Vision
JSON Output
Claude 3.7 Sonnet (2025-02-19)
anthropic
claude-3-7-sonnet-20250219Providers
Anthropic
anthropic/claude-3-7-sonnet-20250219Context Size
200k
Stability
stablePricing
Input
$3.00/M
Cached
$0.30/M
Output
$15.00/M
Capabilities
Streaming
Tools
Reasoning
Reasoning Budget
Native Web Search
Grok-3
xai
grok-3Providers
xAI
xai/grok-3Context Size
131.1k
Stability
stablePricing
Input
$3.00/M
Cached
—/M
Output
$15.00/M
Capabilities
Streaming
Tools
JSON Output
Qwen VL Plus
alibaba
qwen-vl-plusProviders
Alibaba Cloud
alibaba/qwen-vl-plusContext Size
131.1k
Stability
stablePricing
20% offInput
$0.21$0.17
-20% offCached
—/M
Output
$0.64$0.51
-20% offCapabilities
Streaming
Vision
JSON Output
Qwen VL Max
alibaba
qwen-vl-maxProviders
Alibaba Cloud
alibaba/qwen-vl-maxContext Size
131.1k
Stability
stablePricing
20% offInput
$0.80$0.64
-20% offCached
—/M
Output
$3.20$2.56
-20% offCapabilities
Streaming
Vision
JSON Output
Qwen Turbo
alibaba
qwen-turboProviders
Alibaba Cloud
alibaba/qwen-turboContext Size
1M
Stability
stablePricing
20% offInput
$0.05$0.04
-20% offCached
—/M
Output
$0.20$0.16
-20% offCapabilities
Streaming
JSON Output
Qwen3 Coder 480B A35B Instruct
alibaba
qwen3-coder-480b-a35b-instructProviders
CanopyWave
canopywave/qwen3-coder-480b-a35b-instructContext Size
262.1k
Stability
stableDeactivated since Feb 1, 2026
Pricing
30% offInput
$0.30$0.21
-30% offCached
—/M
Output
$1.30$0.91
-30% offCapabilities
Streaming
Tools
JSON Output
Qwen2.5 VL 72B Instruct
alibaba
qwen2-5-vl-72b-instructProviders
Nebius AI
nebius/qwen2-5-vl-72b-instructContext Size
32.8k
Stability
stablePricing
Input
$0.13/M
Cached
—/M
Output
$0.40/M
Capabilities
Streaming
Vision
JSON Output
Qwen Plus
alibaba
qwen-plusProviders
Alibaba Cloud
alibaba/qwen-plusContext Size
131.1k
Stability
stablePricing
20% offInput
$0.40$0.32
-20% offCached
$0.08$0.06
-20% offOutput
$1.20$0.96
-20% offCapabilities
Streaming
Tools
JSON Output
Qwen Max Latest
alibaba
qwen-max-latestProviders
Alibaba Cloud
alibaba/qwen-max-latestContext Size
131.1k
Stability
stablePricing
20% offInput
$1.60$1.28
-20% offCached
—/M
Output
$6.40$5.12
-20% offCapabilities
Streaming
Vision
Tools
JSON Output
DeepSeek R1 Distill Llama 70B
deepseekModel Deactivated
deepseek-r1-distill-llama-70bProviders
Groq
groq/deepseek-r1-distill-llama-70bContext Size
131.1k
Stability
stableDeactivated since Oct 9, 2025
Pricing
Input
$0.75/M
Cached
—/M
Output
$0.99/M
Capabilities
Streaming
Tools
JSON Output
MiniMax Text 01
minimax
minimax-text-01Providers
MiniMax
minimax/minimax-text-01Context Size
1M
Stability
stablePricing
Input
$0.20/M
Cached
—/M
Output
$1.10/M
Capabilities
Streaming
GLM-Image
glm
glm-imageProviders
Z AI
zai/glm-imageContext Size
2k
Stability
stablePricing
10% offInput
$0.00$0.00
-10% offCached
—/M
Output
$0.00$0.00
-10% offPer Request
$0.015/req
Capabilities
Image Generation
Sonar
perplexity
sonarProviders
Perplexity
perplexity/sonarContext Size
130k
Stability
stablePricing
Input
$1.00/M
Cached
—/M
Output
$1.00/M
Per Request
$0.005/req
Capabilities
Streaming
Structured JSON Output
DeepSeek V3
deepseekModel Deactivated
deepseek-v3Providers
Nebius AI
nebius/deepseek-v3Context Size
64k
Stability
unstableDeactivated since Nov 3, 2025
Pricing
Input
$0.50/M
Cached
—/M
Output
$1.50/M
Capabilities
Streaming
Llama 3.3 70B Instruct
meta
llama-3.3-70b-instructProviders
Cerebras
cerebras/llama-3.3-70b-instructContext Size
128k
Stability
stablePricing
Input
$0.85/M
Cached
—/M
Output
$1.20/M
Capabilities
Streaming
Tools
JSON Output
Pixtral Large Latest
mistral
pixtral-large-latestProviders
Mistral AI
mistral/pixtral-large-latestContext Size
128k
Stability
stablePricing
Input
$4.00/M
Cached
—/M
Output
$12.00/M
Capabilities
Streaming
Vision
Claude 3.5 Sonnet (2024-10-22)
anthropicModel Deactivated
claude-3-5-sonnet-20241022Providers
Anthropic
anthropic/claude-3-5-sonnet-20241022Context Size
200k
Stability
stableDeactivated since Oct 22, 2025
Pricing
Input
$3.00/M
Cached
$0.30/M
Output
$15.00/M
Capabilities
Streaming
Tools
Native Web Search
Gemini 1.5 Flash 8B
googleModel Deactivated
gemini-1.5-flash-8bProviders
Google AI Studio
google-ai-studio/gemini-1.5-flash-8bContext Size
1M
Stability
stableDeactivated since Sep 20, 2025
Pricing
Input
$0.04/M
Cached
—/M
Output
$0.15/M
Capabilities
Streaming
Tools
Reasoning
Reasoning Budget
JSON Output
Structured JSON Output
GPT-4o Mini Search Preview
openai
gpt-4o-mini-search-previewProviders
OpenAI
openai/gpt-4o-mini-search-previewContext Size
128k
Stability
stablePricing
Input
$0.15/M
Cached
—/M
Output
$0.60/M
Capabilities
Streaming
Vision
Native Web Search
GPT-4o Search Preview
openai
gpt-4o-search-previewProviders
OpenAI
openai/gpt-4o-search-previewContext Size
128k
Stability
stablePricing
Input
$2.50/M
Cached
—/M
Output
$10.00/M
Capabilities
Streaming
Vision
Native Web Search
Llama 3.2 11B Instruct
meta
llama-3.2-11b-instructProviders
Inference.net
inference.net/llama-3.2-11b-instructContext Size
128k
Stability
unstablePricing
Input
$0.07/M
Cached
—/M
Output
$0.33/M
Capabilities
Streaming
JSON Output
Qwen2 VL 72B Instruct
alibabaModel Deactivated
qwen2-vl-72b-instructProviders
Nebius AI
nebius/qwen2-vl-72b-instructContext Size
32.8k
Stability
stableDeactivated since Sep 10, 2025
Pricing
Input
$0.13/M
Cached
—/M
Output
$0.40/M
Capabilities
Streaming
Vision
JSON Output
Qwen2.5 72B Instruct
alibabaModel Deactivated
qwen25-72b-instructProviders
Nebius AI
nebius/qwen25-72b-instructContext Size
32.8k
Stability
stableDeactivated since Nov 3, 2025
Pricing
Input
$0.13/M
Cached
—/M
Output
$0.40/M
Capabilities
Streaming
Tools
JSON Output
Qwen2.5 32B Instruct
alibabaModel Deactivated
qwen25-32b-instructProviders
Nebius AI
nebius/qwen25-32b-instructContext Size
32.8k
Stability
stableDeactivated since Sep 10, 2025
Pricing
Input
$0.06/M
Cached
—/M
Output
$0.20/M
Capabilities
Streaming
Tools
JSON Output
Qwen2.5 Coder 7B
alibaba
qwen25-coder-7bProviders
Nebius AI
nebius/qwen25-coder-7bContext Size
32.8k
Stability
stablePricing
Input
$0.01/M
Cached
—/M
Output
$0.03/M
Capabilities
Streaming
JSON Output
Llama 3.2 3B Instruct
meta
llama-3.2-3b-instructProviders
NovitaAI
novita/llama-3.2-3b-instructContext Size
32.8k
Stability
unstablePricing
Input
$0.03/M
Cached
—/M
Output
$0.05/M
Capabilities
Streaming
JSON Output
Qwen Coder Plus
alibaba
qwen-coder-plusProviders
Alibaba Cloud
alibaba/qwen-coder-plusContext Size
131.1k
Stability
stablePricing
20% offInput
$1.00$0.80
-20% offCached
—/M
Output
$5.00$4.00
-20% offCapabilities
Streaming
Tools
JSON Output
o1 Mini
openai
o1-miniProviders
Azure
azure/o1-miniContext Size
128k
Stability
unstablePricing
Input
$1.10/M
Cached
—/M
Output
$4.40/M
Capabilities
o1
openai
o1Providers
Azure
azure/o1Context Size
200k
Stability
unstablePricing
Input
$15.00/M
Cached
—/M
Output
$60.00/M
Capabilities
Streaming
Vision
Reasoning
JSON Output
Structured JSON Output
Qwen Flash
alibaba
qwen-flashProviders
Alibaba Cloud
alibaba/qwen-flashContext Size
1M
Stability
stablePricing
20% offInput
$0.05$0.04
-20% offCached
$0.01$0.01
-20% offOutput
$0.40$0.32
-20% offCapabilities
Streaming
Tools
JSON Output
Qwen Plus Latest
alibaba
qwen-plus-latestProviders
Alibaba Cloud
alibaba/qwen-plus-latestContext Size
1M
Stability
stablePricing
20% offInput
$0.40$0.32
-20% offCached
$0.08$0.06
-20% offOutput
$1.20$0.96
-20% offCapabilities
Streaming
Tools
JSON Output
Hermes 3 Llama 405B
nousresearchModel Deactivated
hermes-3-llama-405bProviders
Nebius AI
nebius/hermes-3-llama-405bContext Size
131.1k
Stability
stableDeactivated since Nov 3, 2025
Pricing
Input
$1.00/M
Cached
—/M
Output
$3.00/M
Capabilities
Streaming
JSON Output
Llama 3.1 70B Instruct
meta
llama-3.1-70b-instructProviders
AWS Bedrock
aws-bedrock/llama-3.1-70b-instructContext Size
128k
Stability
unstablePricing
Input
$0.72/M
Cached
—/M
Output
$0.72/M
Capabilities
Streaming
Llama 3.1 405B Instruct
metaModel Deactivated
llama-3.1-405b-instructProviders
Nebius AI
nebius/llama-3.1-405b-instructContext Size
128k
Stability
stableDeactivated since Nov 3, 2025
Pricing
Input
$1.00/M
Cached
—/M
Output
$3.00/M
Capabilities
Streaming
Tools
JSON Output
Llama 3.1 8B Instruct
meta
llama-3.1-8b-instructProviders
AWS Bedrock
aws-bedrock/llama-3.1-8b-instructContext Size
128k
Stability
unstablePricing
Input
$0.22/M
Cached
—/M
Output
$0.22/M
Capabilities
Streaming
GPT-4o Mini
openai
gpt-4o-miniProviders
OpenAI
openai/gpt-4o-miniContext Size
128k
Stability
stablePricing
Input
$0.15/M
Cached
$0.07/M
Output
$0.60/M
Capabilities
Streaming
Tools
JSON Output
Structured JSON Output
Gemma 2 27B IT
google
gemma-2-27b-it-togetherProviders
Together AI
together.ai/gemma-2-27b-it-togetherContext Size
8.2k
Stability
stablePricing
Input
$0.08/M
Cached
—/M
Output
$0.08/M
Capabilities
Streaming
Gemma2 9B IT
googleModel Deactivated
gemma2-9b-itProviders
Groq
groq/gemma2-9b-itContext Size
8.1k
Stability
unstableDeactivated since Oct 8, 2025
Pricing
Input
$0.20/M
Cached
—/M
Output
$0.20/M
Capabilities
Streaming
Tools
Claude 3.5 Sonnet (Old)
anthropicModel Deactivated
claude-3-5-sonnet-20240620Providers
Anthropic
anthropic/claude-3-5-sonnet-20240620Context Size
200k
Stability
stableDeactivated since Feb 19, 2026
Pricing
Input
$3.00/M
Cached
$0.30/M
Output
$15.00/M
Capabilities
Streaming
Vision
Tools
Claude 3.5 Sonnet
anthropic
claude-3-5-sonnetProviders
Anthropic
anthropic/claude-3-5-sonnetContext Size
200k
Stability
stablePricing
Input
$3.00/M
Cached
$0.30/M
Output
$15.00/M
Capabilities
Streaming
Tools
Native Web Search
Hermes 2 Pro Llama 3 8B
nousresearch
hermes-2-pro-llama-3-8bProviders
NovitaAI
novita/hermes-2-pro-llama-3-8bContext Size
8.2k
Stability
unstablePricing
Input
$0.14/M
Cached
—/M
Output
$0.14/M
Capabilities
Streaming
Gemini 1.5 Pro
googleModel Deactivated
gemini-1.5-proProviders
Google AI Studio
google-ai-studio/gemini-1.5-proContext Size
1M
Stability
stableDeactivated since Sep 20, 2025
Pricing
Input
$2.50/M
Cached
—/M
Output
$10.00/M
Capabilities
Streaming
Vision
Tools
Reasoning
Reasoning Budget
JSON Output
Structured JSON Output
GPT-4o
openai
gpt-4oProviders
Azure
azure/gpt-4oContext Size
128k
Stability
unstablePricing
Input
$2.50/M
Cached
$1.25/M
Output
$10.00/M
Capabilities
Streaming
Vision
Tools
JSON Output
Gemini 1.5 Flash
googleModel Deactivated
gemini-1.5-flashProviders
Google AI Studio
google-ai-studio/gemini-1.5-flashContext Size
1M
Stability
stableDeactivated since Sep 20, 2025
Pricing
Input
$0.04/M
Cached
—/M
Output
$0.15/M
Capabilities
Streaming
Vision
Tools
Reasoning
Reasoning Budget
JSON Output
Structured JSON Output
Llama 3 70B Instruct
meta
llama-3-70b-instructProviders
NovitaAI
novita/llama-3-70b-instructContext Size
8.2k
Stability
stablePricing
Input
$0.51/M
Cached
—/M
Output
$0.74/M
Capabilities
Streaming
JSON Output
Claude 3 Haiku
anthropic
claude-3-haikuProviders
Anthropic
anthropic/claude-3-haikuContext Size
200k
Stability
stablePricing
Input
$0.25/M
Cached
$0.03/M
Output
$1.25/M
Capabilities
Streaming
Vision
Tools
Claude 3 Opus
anthropic
claude-3-opusProviders
Anthropic
anthropic/claude-3-opusContext Size
200k
Stability
stablePricing
Input
$15.00/M
Cached
$1.50/M
Output
$75.00/M
Capabilities
Streaming
Vision
Tools
Auto Route
llmgateway
autoProviders
LLM Gateway
llmgateway/autoContext Size
—
Stability
stablePricing
Input
—/M
Cached
—/M
Output
—/M
Capabilities
Streaming
Vision
Tools
JSON Output
Custom Model
llmgateway
customProviders
LLM Gateway
llmgateway/customContext Size
—
Stability
stablePricing
Input
—/M
Cached
—/M
Output
—/M
Capabilities
Streaming
Vision
Tools
JSON Output
Mixtral 8x7B Instruct
mistral
mixtral-8x7b-instruct-togetherProviders
Together AI
together.ai/mixtral-8x7b-instruct-togetherContext Size
32.8k
Stability
stablePricing
Input
$0.06/M
Cached
—/M
Output
$0.06/M
Capabilities
Streaming
JSON Output
GPT-4 Turbo
openai
gpt-4-turboProviders
Azure
azure/gpt-4-turboContext Size
128k
Stability
unstablePricing
Input
$10.00/M
Cached
—/M
Output
$30.00/M
Capabilities
Streaming
Vision
Tools
JSON Output
Mistral 7B Instruct
mistralModel Deactivated
mistral-7b-instruct-togetherProviders
Together AI
together.ai/mistral-7b-instruct-togetherContext Size
8.2k
Stability
stableDeactivated since Nov 13, 2025
Pricing
Input
$0.06/M
Cached
—/M
Output
$0.06/M
Capabilities
Streaming
JSON Output
GPT-4
openai
gpt-4Providers
Azure
azure/gpt-4Context Size
8.2k
Stability
unstablePricing
Input
$30.00/M
Cached
—/M
Output
$60.00/M
Capabilities
Streaming
Tools
GPT-3.5 Turbo
openai
gpt-3.5-turboProviders
Azure
azure/gpt-3.5-turboContext Size
16.4k
Stability
unstablePricing
Input
$0.50/M
Cached
—/M
Output
$1.50/M
Capabilities
Streaming
Tools
JSON Output