AI Models Directory
Browse and compare 180+ AI models from OpenAI, Anthropic, Google, and 30+ providers — filter by capabilities, pricing, and context size.
Use Case
Capabilities
Provider
Input Price ($/M tokens)
Output Price ($/M tokens)
Context Size (tokens)
109/256
Models
28/38
Providers
69
Vision Models (filtered)
106
Tool-enabled (filtered)
2
Free Models (filtered)
Qwen3.7 Plus
alibaba
qwen3.7-plusStreaming
Vision
Tools
Reasoning
JSON Output
Alibaba Cloud
Context: 1M
Input
$0.40
/M tokens
Cached
$0.08
/M tokens
Output
$1.60
/M tokens
Tiered Pricing
IN
CACHED
OUT
≤256K tokens
$0.40
$0.08
$1.60
>256K tokens
$1.20
$0.24
$4.80
MiniMax M3
minimax
minimax-m3Streaming
Vision
Tools
Reasoning
JSON Output
MiniMax
Context: 1.0M
Input
$0.60
/M tokens
Cached
$0.12
/M tokens
Output
$2.40
/M tokens
Claude Opus 4.8
anthropic
claude-opus-4-8Streaming
Vision
Tools
Reasoning
Structured JSON Output
Native Web Search
Anthropic
Context: 1M
Input
$5.00
/M tokens
Cached
$0.50
/M tokens
Output
$25.00
/M tokens
+ $0.010 per search
Qwen3.7 Max
alibaba50% off
qwen3.7-maxStreaming
Tools
Reasoning
JSON Output
Native Web Search
Alibaba Cloud
Context: 1M50% off
Input
$1.72$0.86
-50% off/M tokens
Cached
$0.34$0.17
-50% off/M tokens
Output
$5.17$2.58
-50% off/M tokens
+ $0.010$0.005 per search
Grok Build 0.1
xai
grok-build-0-1Streaming
Vision
Tools
Reasoning
JSON Output
xAI
Context: 256k
Input
$1.00
/M tokens
Cached
$0.20
/M tokens
Output
$2.00
/M tokens
Tiered Pricing
IN
CACHED
OUT
≤200K tokens
$1.00
$0.20
$2.00
>200K tokens
$2.00
$0.40
$4.00
Gemini 3.5 Flash
google
gemini-3.5-flashStreaming
Vision
Tools
Reasoning
Reasoning Budget
JSON Output
Structured JSON Output
Native Web Search
Google AI Studio
Context: 1.0M
Input
$1.50
/M tokens
Cached
$0.15
/M tokens
Output
$9.00
/M tokens
+ $0.014 per search
Grok 4.20 Reasoning
xai
grok-4-20-reasoningStreaming
Vision
Tools
Reasoning
JSON Output
Vertex AI (OpenAI-compatible)
Context: 2M
Input
$2.00
/M tokens
Cached
$0.20
/M tokens
Output
$6.00
/M tokens
Tiered Pricing
IN
CACHED
OUT
≤200K tokens
$2.00
$0.20
$6.00
>200K tokens
$4.00
$0.40
$12.00
MiMo V2.5
xiaomi
mimo-v2.5Streaming
Vision
Tools
Reasoning
JSON Output
Xiaomi
Context: 1M
Input
$0.14
/M tokens
Cached
$0.03
/M tokens
Output
$0.28
/M tokens
MiMo V2 Pro
xiaomi
mimo-v2-proStreaming
Tools
Reasoning
JSON Output
Xiaomi
Context: 1M
Input
$1.00
/M tokens
Cached
$0.20
/M tokens
Output
$3.00
/M tokens
MiMo V2.5 Pro
xiaomi
mimo-v2.5-proStreaming
Tools
Reasoning
JSON Output
Xiaomi
Context: 1M
Input
$0.43
/M tokens
Cached
$0.09
/M tokens
Output
$0.87
/M tokens
Gemini 3.1 Flash Lite
google
gemini-3.1-flash-liteStreaming
Vision
Tools
Reasoning
Reasoning Budget
JSON Output
Structured JSON Output
Native Web Search
Google AI Studio
Context: 1.0M
Input
$0.25
/M tokens
Cached
$0.02
/M tokens
Output
$1.50
/M tokens
+ $0.014 per search
Grok 4.3
xai
grok-4-3Streaming
Vision
Tools
Reasoning
JSON Output
xAI
Context: 1M
Input
$1.25
/M tokens
Cached
$0.31
/M tokens
Output
$2.50
/M tokens
Tiered Pricing
IN
CACHED
OUT
≤200K tokens
$1.25
$0.31
$2.50
>200K tokens
$2.50
$0.00
$5.00
Qwen3.6 35B A3B
alibaba
qwen3.6-35b-a3bStreaming
Vision
Tools
Reasoning
JSON Output
Native Web Search
Alibaba Cloud
Context: 262.1k
Input
$0.25
/M tokens
Cached
—
/M tokens
Output
$1.48
/M tokens
+ $0.010 per search
Qwen3.6 Plus
alibaba
qwen3.6-plusStreaming
Vision
Tools
Reasoning
JSON Output
Native Web Search
Alibaba Cloud
Context: 262.1k
Input
$0.50
/M tokens
Cached
$0.05
/M tokens
Output
$3.00
/M tokens
+ $0.010 per search
Qwen3.6 Max Preview
alibaba
qwen3.6-max-previewStreaming
Tools
Reasoning
JSON Output
Alibaba Cloud
Context: 262.1k
Input
$1.30
/M tokens
Cached
$0.13
/M tokens
Output
$7.80
/M tokens
GPT-5.5 Pro
openai
gpt-5.5-proStreaming
Vision
Tools
Reasoning
JSON Output
Native Web Search
OpenAI
Context: 1.1M
Input
$30.00
/M tokens
Cached
—
/M tokens
Output
$180.00
/M tokens
+ $0.010 per search
GPT-5.5
openai
gpt-5.5Streaming
Vision
Tools
Reasoning
JSON Output
Structured JSON Output
Native Web Search
Azure
Context: 1.1M
Input
$5.00
/M tokens
Cached
$0.50
/M tokens
Output
$30.00
/M tokens
+ $0.010 per search
DeepSeek V4 Flash
deepseek
deepseek-v4-flashStreaming
Tools
Reasoning
JSON Output
Alibaba Cloud
Context: 1M
Input
$0.14
/M tokens
Cached
$0.03
/M tokens
Output
$0.28
/M tokens
DeepSeek V4 Pro
deepseek
deepseek-v4-proStreaming
Tools
Reasoning
JSON Output
Structured JSON Output
Alibaba Cloud
Context: 1M
Input
$1.65
/M tokens
Cached
$0.14
/M tokens
Output
$3.30
/M tokens
Kimi K2.6
moonshot
kimi-k2.6Streaming
Vision
Tools
Reasoning
JSON Output
CanopyWave
Context: 262.1k
Input
$0.50
/M tokens
Cached
$0.10
/M tokens
Output
$2.80
/M tokens
Claude Opus 4.7
anthropic
claude-opus-4-7Streaming
Vision
Tools
Reasoning
Structured JSON Output
Native Web Search
Anthropic
Context: 1M
Input
$5.00
/M tokens
Cached
$0.50
/M tokens
Output
$25.00
/M tokens
+ $0.010 per search
GLM-5.1
glm
glm-5.1Streaming
Tools
Reasoning
JSON Output
Structured JSON Output
Native Web Search
DeepInfra
Context: 198k
Input
$1.05
/M tokens
Cached
$0.20
/M tokens
Output
$3.50
/M tokens
MiMo V2 Flash
xiaomi
mimo-v2-flashStreaming
Tools
Reasoning
JSON Output
Xiaomi
Context: 256k
Input
$0.10
/M tokens
Cached
$0.02
/M tokens
Output
$0.30
/M tokens
MiniMax M2.5 Highspeed
minimax
minimax-m2.5-highspeedStreaming
Tools
Reasoning
MiniMax
Context: 204.8k
Input
$0.60
/M tokens
Cached
$0.03
/M tokens
Output
$2.40
/M tokens
MiniMax M2.7 Highspeed
minimax
minimax-m2.7-highspeedStreaming
Tools
Reasoning
MiniMax
Context: 204.8k
Input
$0.60
/M tokens
Cached
$0.06
/M tokens
Output
$2.40
/M tokens
MiniMax M2.7
minimax
minimax-m2.7Streaming
Tools
Reasoning
JSON Output
Structured JSON Output
MiniMax
Context: 204.8k
Input
$0.30
/M tokens
Cached
$0.06
/M tokens
Output
$1.20
/M tokens
Gemini Pro Latest
google
gemini-pro-latestStreaming
Vision
Tools
Reasoning
Reasoning Budget
JSON Output
Structured JSON Output
Native Web Search
Google AI Studio
Context: 1.0M
Input
$2.00
/M tokens
Cached
$0.20
/M tokens
Output
$12.00
/M tokens
Tiered Pricing
IN
CACHED
OUT
≤200K tokens
$2.00
$0.20
$12.00
>200K tokens
$4.00
$0.40
$18.00
+ $0.014 per search
GPT-5.4 Nano
openai
gpt-5.4-nanoStreaming
Vision
Tools
Reasoning
JSON Output
Structured JSON Output
Native Web Search
Azure
Context: 400k
Input
$0.20
/M tokens
Cached
$0.02
/M tokens
Output
$1.25
/M tokens
+ $0.010 per search
GPT-5.4 Mini
openai
gpt-5.4-miniStreaming
Vision
Tools
Reasoning
JSON Output
Structured JSON Output
Native Web Search
Azure
Context: 400k
Input
$0.75
/M tokens
Cached
$0.07
/M tokens
Output
$4.50
/M tokens
+ $0.010 per search
Grok 4.20 Beta Reasoning (0309)
xai
grok-4-20-beta-0309-reasoningStreaming
Vision
Tools
Reasoning
JSON Output
xAI
Context: 2M
Input
$2.00
/M tokens
Cached
$0.20
/M tokens
Output
$6.00
/M tokens
Tiered Pricing
IN
CACHED
OUT
≤200K tokens
$2.00
$0.20
$6.00
>200K tokens
$4.00
$0.40
$12.00
Grok 4.20 Multi-Agent Beta (0309)
xaiModel Deactivated
grok-4-20-multi-agent-beta-0309Streaming
Vision
Tools
Reasoning
JSON Output
xAI
Context: 2M
Deactivated since Mar 27, 2026
Input
$2.00
/M tokens
Cached
$0.20
/M tokens
Output
$6.00
/M tokens
Tiered Pricing
IN
CACHED
OUT
≤200K tokens
$2.00
$0.20
$6.00
>200K tokens
$4.00
$0.40
$12.00
GPT-5.3 Codex
openai
gpt-5.3-codexStreaming
Vision
Tools
Reasoning
JSON Output
Native Web Search
Azure
Context: 400k
Input
$1.75
/M tokens
Cached
$0.17
/M tokens
Output
$14.00
/M tokens
GPT-5.2 Codex
openai
gpt-5.2-codexStreaming
Vision
Tools
Reasoning
JSON Output
Native Web Search
Azure
Context: 400k
Input
$1.75
/M tokens
Cached
$0.17
/M tokens
Output
$14.00
/M tokens
o4 Mini
openai
o4-miniStreaming
Vision
Tools
Reasoning
JSON Output
Structured JSON Output
Azure
Context: 200k
Input
$1.10
/M tokens
Cached
$0.28
/M tokens
Output
$4.40
/M tokens
Grok 4.1 Fast
xai
grok-4-1-fastStreaming
Vision
Tools
Reasoning
JSON Output
Azure AI Foundry
Context: 2M
Input
$0.20
/M tokens
Cached
—
/M tokens
Output
$0.50
/M tokens
Grok 4 Fast
xaiModel Deactivated
grok-4-fastStreaming
Vision
Tools
Reasoning
JSON Output
xAI
Context: 2M
Deactivated since May 15, 2026
Input
$0.20
/M tokens
Cached
$0.05
/M tokens
Output
$0.50
/M tokens
GPT-5.4 Pro
openai
gpt-5.4-proStreaming
Vision
Tools
Reasoning
JSON Output
Native Web Search
Azure
Context: 1.1M
Input
$30.00
/M tokens
Cached
—
/M tokens
Output
$180.00
/M tokens
+ $0.010 per search
GPT-5.4
openai
gpt-5.4Streaming
Vision
Tools
Reasoning
JSON Output
Structured JSON Output
Native Web Search
Azure
Context: 1.1M
Input
$2.50
/M tokens
Cached
$0.25
/M tokens
Output
$15.00
/M tokens
+ $0.010 per search
Qwen3.5 397B A17B
alibaba
qwen35-397b-a17bStreaming
Vision
Tools
Reasoning
JSON Output
Native Web Search
Alibaba Cloud
Context: 262.1k
Input
$0.17
/M tokens
Cached
—
/M tokens
Output
$1.03
/M tokens
Tiered Pricing
IN
OUT
≤128K tokens
$0.17
$1.03
≤256K tokens
$0.43
$2.58
+ $0.010 per search
Gemini 3.1 Pro (Preview)
google
gemini-3.1-pro-previewStreaming
Vision
Tools
Reasoning
Reasoning Budget
JSON Output
Structured JSON Output
Native Web Search
Google AI Studio
Context: 1.0M
Input
$2.00
/M tokens
Cached
$0.20
/M tokens
Output
$12.00
/M tokens
Tiered Pricing
IN
CACHED
OUT
≤200K tokens
$2.00
$0.20
$12.00
>200K tokens
$4.00
$0.40
$18.00
+ $0.014 per search
Claude Sonnet 4.6
anthropic
claude-sonnet-4-6Streaming
Vision
Tools
Reasoning
Reasoning Budget
Structured JSON Output
Native Web Search
Anthropic
Context: 1M
Input
$3.00
/M tokens
Cached
$0.30
/M tokens
Output
$15.00
/M tokens
+ $0.010 per search
GLM-5
glm
glm-5Streaming
Tools
Reasoning
JSON Output
Structured JSON Output
Native Web Search
Alibaba Cloud
Context: 202.8k
Input
$0.57
/M tokens
Cached
—
/M tokens
Output
$2.58
/M tokens
Tiered Pricing
IN
OUT
≤32K tokens
$0.57
$2.58
>32K tokens
$0.86
$3.15
MiniMax M2.5
minimax
minimax-m2.5Streaming
Tools
Reasoning
JSON Output
Structured JSON Output
EmberCloud
Context: 196.6k
Deactivated since Jun 3, 2026
Input
$0.20
/M tokens
Cached
$0.04
/M tokens
Output
$1.20
/M tokens
Claude Opus 4.6
anthropic
claude-opus-4-6Streaming
Vision
Tools
Reasoning
Structured JSON Output
Native Web Search
Anthropic
Context: 1M
Input
$5.00
/M tokens
Cached
$0.50
/M tokens
Output
$25.00
/M tokens
+ $0.010 per search
Qwen3 VL 30B A3B Thinking
alibaba
qwen3-vl-30b-a3b-thinkingStreaming
Vision
Tools
Reasoning
JSON Output
NovitaAI
Context: 131.1k
Input
$0.20
/M tokens
Cached
—
/M tokens
Output
$1.00
/M tokens
Kimi K2.5
moonshot
kimi-k2.5Streaming
Vision
Tools
Reasoning
JSON Output
Alibaba Cloud
Context: 262.1k
Input
$0.57
/M tokens
Cached
—
/M tokens
Output
$3.01
/M tokens
Qwen3 Max 2026-01-23
alibaba
qwen3-max-2026-01-23Streaming
Vision
Tools
Reasoning
JSON Output
Alibaba Cloud
Context: 262.1k
Input
$0.36
/M tokens
Cached
$0.07
/M tokens
Output
$1.43
/M tokens
Tiered Pricing
IN
CACHED
OUT
≤32K tokens
$0.36
$0.07
$1.43
≤128K tokens
$0.57
$0.11
$2.29
≤252K tokens
$1.00
$0.20
$4.01
Qwen3 VL 235B A22B Thinking
alibaba
qwen3-vl-235b-a22b-thinkingStreaming
Vision
Reasoning
Alibaba Cloud
Context: 131.1k
Input
$0.50
/M tokens
Cached
—
/M tokens
Output
$2.00
/M tokens
QwQ Plus
alibaba
qwq-plusStreaming
Reasoning
Alibaba Cloud
Context: 131.1k
Input
$0.23
/M tokens
Cached
—
/M tokens
Output
$0.57
/M tokens
MiniMax Text 01
minimax
minimax-text-01Streaming
Tools
Reasoning
MiniMax
Context: 1M
Input
$0.20
/M tokens
Cached
—
/M tokens
Output
$1.10
/M tokens