Support

AI-powered help

Welcome!

Please introduce yourself before we start.

    LLM Gateway
    • Docs
    • Pricing
    • Pricing
    • Docs
    • Models
    1.3k
    Log InGet Started

    AI Models Directory

    Browse and compare 180+ AI models from OpenAI, Anthropic, Google, and 30+ providers — filter by capabilities, pricing, and context size.

    Compare

    Use Case

    Capabilities

    Provider

    Input Price ($/M tokens)

    Output Price ($/M tokens)

    Context Size (tokens)

    109/256
    Models
    28/38
    Providers
    69
    Vision Models (filtered)
    106
    Tool-enabled (filtered)
    2
    Free Models (filtered)

    Qwen3.7 Plus

    alibaba
    qwen3.7-plus
    Streaming
    Vision
    Tools
    Reasoning
    JSON Output
    Alibaba Cloud
    Context: 1M
    Input
    $0.40
    /M tokens
    Cached
    $0.08
    /M tokens
    Output
    $1.60
    /M tokens
    Tiered Pricing
    IN
    CACHED
    OUT
    ≤256K tokens
    $0.40
    $0.08
    $1.60
    >256K tokens
    $1.20
    $0.24
    $4.80
    Get Started

    MiniMax M3

    minimax
    minimax-m3
    Streaming
    Vision
    Tools
    Reasoning
    JSON Output
    MiniMax
    Context: 1.0M
    Input
    $0.60
    /M tokens
    Cached
    $0.12
    /M tokens
    Output
    $2.40
    /M tokens
    Get Started

    Claude Opus 4.8

    anthropic
    claude-opus-4-8
    Streaming
    Vision
    Tools
    Reasoning
    Structured JSON Output
    Native Web Search
    Anthropic
    Context: 1M
    Input
    $5.00
    /M tokens
    Cached
    $0.50
    /M tokens
    Output
    $25.00
    /M tokens
    + $0.010 per search
    Get Started

    Qwen3.7 Max

    alibaba50% off
    qwen3.7-max
    Streaming
    Tools
    Reasoning
    JSON Output
    Native Web Search
    Alibaba Cloud
    Context: 1M50% off
    Input
    $1.72$0.86
    -50% off
    /M tokens
    Cached
    $0.34$0.17
    -50% off
    /M tokens
    Output
    $5.17$2.58
    -50% off
    /M tokens
    + $0.010$0.005 per search
    Get Started

    Grok Build 0.1

    xai
    grok-build-0-1
    Streaming
    Vision
    Tools
    Reasoning
    JSON Output
    xAI
    Context: 256k
    Input
    $1.00
    /M tokens
    Cached
    $0.20
    /M tokens
    Output
    $2.00
    /M tokens
    Tiered Pricing
    IN
    CACHED
    OUT
    ≤200K tokens
    $1.00
    $0.20
    $2.00
    >200K tokens
    $2.00
    $0.40
    $4.00
    Get Started

    Gemini 3.5 Flash

    google
    gemini-3.5-flash
    Streaming
    Vision
    Tools
    Reasoning
    Reasoning Budget
    JSON Output
    Structured JSON Output
    Native Web Search
    Google AI Studio
    Context: 1.0M
    Input
    $1.50
    /M tokens
    Cached
    $0.15
    /M tokens
    Output
    $9.00
    /M tokens
    + $0.014 per search
    Get Started

    Grok 4.20 Reasoning

    xai
    grok-4-20-reasoning
    Streaming
    Vision
    Tools
    Reasoning
    JSON Output
    Vertex AI (OpenAI-compatible)
    Context: 2M
    Input
    $2.00
    /M tokens
    Cached
    $0.20
    /M tokens
    Output
    $6.00
    /M tokens
    Tiered Pricing
    IN
    CACHED
    OUT
    ≤200K tokens
    $2.00
    $0.20
    $6.00
    >200K tokens
    $4.00
    $0.40
    $12.00
    Get Started

    MiMo V2.5

    xiaomi
    mimo-v2.5
    Streaming
    Vision
    Tools
    Reasoning
    JSON Output
    Xiaomi
    Context: 1M
    Input
    $0.14
    /M tokens
    Cached
    $0.03
    /M tokens
    Output
    $0.28
    /M tokens
    Get Started

    MiMo V2 Pro

    xiaomi
    mimo-v2-pro
    Streaming
    Tools
    Reasoning
    JSON Output
    Xiaomi
    Context: 1M
    Input
    $1.00
    /M tokens
    Cached
    $0.20
    /M tokens
    Output
    $3.00
    /M tokens
    Get Started

    MiMo V2.5 Pro

    xiaomi
    mimo-v2.5-pro
    Streaming
    Tools
    Reasoning
    JSON Output
    Xiaomi
    Context: 1M
    Input
    $0.43
    /M tokens
    Cached
    $0.09
    /M tokens
    Output
    $0.87
    /M tokens
    Get Started

    Gemini 3.1 Flash Lite

    google
    gemini-3.1-flash-lite
    Streaming
    Vision
    Tools
    Reasoning
    Reasoning Budget
    JSON Output
    Structured JSON Output
    Native Web Search
    Google AI Studio
    Context: 1.0M
    Input
    $0.25
    /M tokens
    Cached
    $0.02
    /M tokens
    Output
    $1.50
    /M tokens
    + $0.014 per search
    Get Started

    Grok 4.3

    xai
    grok-4-3
    Streaming
    Vision
    Tools
    Reasoning
    JSON Output
    xAI
    Context: 1M
    Input
    $1.25
    /M tokens
    Cached
    $0.31
    /M tokens
    Output
    $2.50
    /M tokens
    Tiered Pricing
    IN
    CACHED
    OUT
    ≤200K tokens
    $1.25
    $0.31
    $2.50
    >200K tokens
    $2.50
    $0.00
    $5.00
    Get Started

    Qwen3.6 35B A3B

    alibaba
    qwen3.6-35b-a3b
    Streaming
    Vision
    Tools
    Reasoning
    JSON Output
    Native Web Search
    Alibaba Cloud
    Context: 262.1k
    Input
    $0.25
    /M tokens
    Cached
    —
    /M tokens
    Output
    $1.48
    /M tokens
    + $0.010 per search
    Get Started

    Qwen3.6 Plus

    alibaba
    qwen3.6-plus
    Streaming
    Vision
    Tools
    Reasoning
    JSON Output
    Native Web Search
    Alibaba Cloud
    Context: 262.1k
    Input
    $0.50
    /M tokens
    Cached
    $0.05
    /M tokens
    Output
    $3.00
    /M tokens
    + $0.010 per search
    Get Started

    Qwen3.6 Max Preview

    alibaba
    qwen3.6-max-preview
    Streaming
    Tools
    Reasoning
    JSON Output
    Alibaba Cloud
    Context: 262.1k
    Input
    $1.30
    /M tokens
    Cached
    $0.13
    /M tokens
    Output
    $7.80
    /M tokens
    Get Started

    GPT-5.5 Pro

    openai
    gpt-5.5-pro
    Streaming
    Vision
    Tools
    Reasoning
    JSON Output
    Native Web Search
    OpenAI
    Context: 1.1M
    Input
    $30.00
    /M tokens
    Cached
    —
    /M tokens
    Output
    $180.00
    /M tokens
    + $0.010 per search
    Get Started

    GPT-5.5

    openai
    gpt-5.5
    Streaming
    Vision
    Tools
    Reasoning
    JSON Output
    Structured JSON Output
    Native Web Search
    Azure
    Context: 1.1M
    Input
    $5.00
    /M tokens
    Cached
    $0.50
    /M tokens
    Output
    $30.00
    /M tokens
    + $0.010 per search
    Get Started

    DeepSeek V4 Flash

    deepseek
    deepseek-v4-flash
    Streaming
    Tools
    Reasoning
    JSON Output
    Alibaba Cloud
    Context: 1M
    Input
    $0.14
    /M tokens
    Cached
    $0.03
    /M tokens
    Output
    $0.28
    /M tokens
    Get Started

    DeepSeek V4 Pro

    deepseek
    deepseek-v4-pro
    Streaming
    Tools
    Reasoning
    JSON Output
    Structured JSON Output
    Alibaba Cloud
    Context: 1M
    Input
    $1.65
    /M tokens
    Cached
    $0.14
    /M tokens
    Output
    $3.30
    /M tokens
    Get Started

    Kimi K2.6

    moonshot
    kimi-k2.6
    Streaming
    Vision
    Tools
    Reasoning
    JSON Output
    CanopyWave
    Context: 262.1k
    Input
    $0.50
    /M tokens
    Cached
    $0.10
    /M tokens
    Output
    $2.80
    /M tokens
    Get Started

    Claude Opus 4.7

    anthropic
    claude-opus-4-7
    Streaming
    Vision
    Tools
    Reasoning
    Structured JSON Output
    Native Web Search
    Anthropic
    Context: 1M
    Input
    $5.00
    /M tokens
    Cached
    $0.50
    /M tokens
    Output
    $25.00
    /M tokens
    + $0.010 per search
    Get Started

    GLM-5.1

    glm
    glm-5.1
    Streaming
    Tools
    Reasoning
    JSON Output
    Structured JSON Output
    Native Web Search
    DeepInfra
    Context: 198k
    Input
    $1.05
    /M tokens
    Cached
    $0.20
    /M tokens
    Output
    $3.50
    /M tokens
    Get Started

    MiMo V2 Flash

    xiaomi
    mimo-v2-flash
    Streaming
    Tools
    Reasoning
    JSON Output
    Xiaomi
    Context: 256k
    Input
    $0.10
    /M tokens
    Cached
    $0.02
    /M tokens
    Output
    $0.30
    /M tokens
    Get Started

    MiniMax M2.5 Highspeed

    minimax
    minimax-m2.5-highspeed
    Streaming
    Tools
    Reasoning
    MiniMax
    Context: 204.8k
    Input
    $0.60
    /M tokens
    Cached
    $0.03
    /M tokens
    Output
    $2.40
    /M tokens
    Get Started

    MiniMax M2.7 Highspeed

    minimax
    minimax-m2.7-highspeed
    Streaming
    Tools
    Reasoning
    MiniMax
    Context: 204.8k
    Input
    $0.60
    /M tokens
    Cached
    $0.06
    /M tokens
    Output
    $2.40
    /M tokens
    Get Started

    MiniMax M2.7

    minimax
    minimax-m2.7
    Streaming
    Tools
    Reasoning
    JSON Output
    Structured JSON Output
    MiniMax
    Context: 204.8k
    Input
    $0.30
    /M tokens
    Cached
    $0.06
    /M tokens
    Output
    $1.20
    /M tokens
    Get Started

    Gemini Pro Latest

    google
    gemini-pro-latest
    Streaming
    Vision
    Tools
    Reasoning
    Reasoning Budget
    JSON Output
    Structured JSON Output
    Native Web Search
    Google AI Studio
    Context: 1.0M
    Input
    $2.00
    /M tokens
    Cached
    $0.20
    /M tokens
    Output
    $12.00
    /M tokens
    Tiered Pricing
    IN
    CACHED
    OUT
    ≤200K tokens
    $2.00
    $0.20
    $12.00
    >200K tokens
    $4.00
    $0.40
    $18.00
    + $0.014 per search
    Get Started

    GPT-5.4 Nano

    openai
    gpt-5.4-nano
    Streaming
    Vision
    Tools
    Reasoning
    JSON Output
    Structured JSON Output
    Native Web Search
    Azure
    Context: 400k
    Input
    $0.20
    /M tokens
    Cached
    $0.02
    /M tokens
    Output
    $1.25
    /M tokens
    + $0.010 per search
    Get Started

    GPT-5.4 Mini

    openai
    gpt-5.4-mini
    Streaming
    Vision
    Tools
    Reasoning
    JSON Output
    Structured JSON Output
    Native Web Search
    Azure
    Context: 400k
    Input
    $0.75
    /M tokens
    Cached
    $0.07
    /M tokens
    Output
    $4.50
    /M tokens
    + $0.010 per search
    Get Started

    Grok 4.20 Beta Reasoning (0309)

    xai
    grok-4-20-beta-0309-reasoning
    Streaming
    Vision
    Tools
    Reasoning
    JSON Output
    xAI
    Context: 2M
    Input
    $2.00
    /M tokens
    Cached
    $0.20
    /M tokens
    Output
    $6.00
    /M tokens
    Tiered Pricing
    IN
    CACHED
    OUT
    ≤200K tokens
    $2.00
    $0.20
    $6.00
    >200K tokens
    $4.00
    $0.40
    $12.00
    Get Started

    Grok 4.20 Multi-Agent Beta (0309)

    xaiModel Deactivated
    grok-4-20-multi-agent-beta-0309
    Streaming
    Vision
    Tools
    Reasoning
    JSON Output
    xAI
    Context: 2M
    Deactivated since Mar 27, 2026
    Input
    $2.00
    /M tokens
    Cached
    $0.20
    /M tokens
    Output
    $6.00
    /M tokens
    Tiered Pricing
    IN
    CACHED
    OUT
    ≤200K tokens
    $2.00
    $0.20
    $6.00
    >200K tokens
    $4.00
    $0.40
    $12.00
    Get Started

    GPT-5.3 Codex

    openai
    gpt-5.3-codex
    Streaming
    Vision
    Tools
    Reasoning
    JSON Output
    Native Web Search
    Azure
    Context: 400k
    Input
    $1.75
    /M tokens
    Cached
    $0.17
    /M tokens
    Output
    $14.00
    /M tokens
    Get Started

    GPT-5.2 Codex

    openai
    gpt-5.2-codex
    Streaming
    Vision
    Tools
    Reasoning
    JSON Output
    Native Web Search
    Azure
    Context: 400k
    Input
    $1.75
    /M tokens
    Cached
    $0.17
    /M tokens
    Output
    $14.00
    /M tokens
    Get Started

    o4 Mini

    openai
    o4-mini
    Streaming
    Vision
    Tools
    Reasoning
    JSON Output
    Structured JSON Output
    Azure
    Context: 200k
    Input
    $1.10
    /M tokens
    Cached
    $0.28
    /M tokens
    Output
    $4.40
    /M tokens
    Get Started

    Grok 4.1 Fast

    xai
    grok-4-1-fast
    Streaming
    Vision
    Tools
    Reasoning
    JSON Output
    Azure AI Foundry
    Context: 2M
    Input
    $0.20
    /M tokens
    Cached
    —
    /M tokens
    Output
    $0.50
    /M tokens
    Get Started

    Grok 4 Fast

    xaiModel Deactivated
    grok-4-fast
    Streaming
    Vision
    Tools
    Reasoning
    JSON Output
    xAI
    Context: 2M
    Deactivated since May 15, 2026
    Input
    $0.20
    /M tokens
    Cached
    $0.05
    /M tokens
    Output
    $0.50
    /M tokens
    Get Started

    GPT-5.4 Pro

    openai
    gpt-5.4-pro
    Streaming
    Vision
    Tools
    Reasoning
    JSON Output
    Native Web Search
    Azure
    Context: 1.1M
    Input
    $30.00
    /M tokens
    Cached
    —
    /M tokens
    Output
    $180.00
    /M tokens
    + $0.010 per search
    Get Started

    GPT-5.4

    openai
    gpt-5.4
    Streaming
    Vision
    Tools
    Reasoning
    JSON Output
    Structured JSON Output
    Native Web Search
    Azure
    Context: 1.1M
    Input
    $2.50
    /M tokens
    Cached
    $0.25
    /M tokens
    Output
    $15.00
    /M tokens
    + $0.010 per search
    Get Started

    Qwen3.5 397B A17B

    alibaba
    qwen35-397b-a17b
    Streaming
    Vision
    Tools
    Reasoning
    JSON Output
    Native Web Search
    Alibaba Cloud
    Context: 262.1k
    Input
    $0.17
    /M tokens
    Cached
    —
    /M tokens
    Output
    $1.03
    /M tokens
    Tiered Pricing
    IN
    OUT
    ≤128K tokens
    $0.17
    $1.03
    ≤256K tokens
    $0.43
    $2.58
    + $0.010 per search
    Get Started

    Gemini 3.1 Pro (Preview)

    google
    gemini-3.1-pro-preview
    Streaming
    Vision
    Tools
    Reasoning
    Reasoning Budget
    JSON Output
    Structured JSON Output
    Native Web Search
    Google AI Studio
    Context: 1.0M
    Input
    $2.00
    /M tokens
    Cached
    $0.20
    /M tokens
    Output
    $12.00
    /M tokens
    Tiered Pricing
    IN
    CACHED
    OUT
    ≤200K tokens
    $2.00
    $0.20
    $12.00
    >200K tokens
    $4.00
    $0.40
    $18.00
    + $0.014 per search
    Get Started

    Claude Sonnet 4.6

    anthropic
    claude-sonnet-4-6
    Streaming
    Vision
    Tools
    Reasoning
    Reasoning Budget
    Structured JSON Output
    Native Web Search
    Anthropic
    Context: 1M
    Input
    $3.00
    /M tokens
    Cached
    $0.30
    /M tokens
    Output
    $15.00
    /M tokens
    + $0.010 per search
    Get Started

    GLM-5

    glm
    glm-5
    Streaming
    Tools
    Reasoning
    JSON Output
    Structured JSON Output
    Native Web Search
    Alibaba Cloud
    Context: 202.8k
    Input
    $0.57
    /M tokens
    Cached
    —
    /M tokens
    Output
    $2.58
    /M tokens
    Tiered Pricing
    IN
    OUT
    ≤32K tokens
    $0.57
    $2.58
    >32K tokens
    $0.86
    $3.15
    Get Started

    MiniMax M2.5

    minimax
    minimax-m2.5
    Streaming
    Tools
    Reasoning
    JSON Output
    Structured JSON Output
    EmberCloud
    Context: 196.6k
    Deactivated since Jun 3, 2026
    Input
    $0.20
    /M tokens
    Cached
    $0.04
    /M tokens
    Output
    $1.20
    /M tokens
    Get Started

    Claude Opus 4.6

    anthropic
    claude-opus-4-6
    Streaming
    Vision
    Tools
    Reasoning
    Structured JSON Output
    Native Web Search
    Anthropic
    Context: 1M
    Input
    $5.00
    /M tokens
    Cached
    $0.50
    /M tokens
    Output
    $25.00
    /M tokens
    + $0.010 per search
    Get Started

    Qwen3 VL 30B A3B Thinking

    alibaba
    qwen3-vl-30b-a3b-thinking
    Streaming
    Vision
    Tools
    Reasoning
    JSON Output
    NovitaAI
    Context: 131.1k
    Input
    $0.20
    /M tokens
    Cached
    —
    /M tokens
    Output
    $1.00
    /M tokens
    Get Started

    Kimi K2.5

    moonshot
    kimi-k2.5
    Streaming
    Vision
    Tools
    Reasoning
    JSON Output
    Alibaba Cloud
    Context: 262.1k
    Input
    $0.57
    /M tokens
    Cached
    —
    /M tokens
    Output
    $3.01
    /M tokens
    Get Started

    Qwen3 Max 2026-01-23

    alibaba
    qwen3-max-2026-01-23
    Streaming
    Vision
    Tools
    Reasoning
    JSON Output
    Alibaba Cloud
    Context: 262.1k
    Input
    $0.36
    /M tokens
    Cached
    $0.07
    /M tokens
    Output
    $1.43
    /M tokens
    Tiered Pricing
    IN
    CACHED
    OUT
    ≤32K tokens
    $0.36
    $0.07
    $1.43
    ≤128K tokens
    $0.57
    $0.11
    $2.29
    ≤252K tokens
    $1.00
    $0.20
    $4.01
    Get Started

    Qwen3 VL 235B A22B Thinking

    alibaba
    qwen3-vl-235b-a22b-thinking
    Streaming
    Vision
    Reasoning
    Alibaba Cloud
    Context: 131.1k
    Input
    $0.50
    /M tokens
    Cached
    —
    /M tokens
    Output
    $2.00
    /M tokens
    Get Started

    QwQ Plus

    alibaba
    qwq-plus
    Streaming
    Reasoning
    Alibaba Cloud
    Context: 131.1k
    Input
    $0.23
    /M tokens
    Cached
    —
    /M tokens
    Output
    $0.57
    /M tokens
    Get Started

    MiniMax Text 01

    minimax
    minimax-text-01
    Streaming
    Tools
    Reasoning
    MiniMax
    Context: 1M
    Input
    $0.20
    /M tokens
    Cached
    —
    /M tokens
    Output
    $1.10
    /M tokens
    Get Started
    Page 1 of 3

    Newsletter

    Stay ahead of the curve

    Join developers who get weekly insights on LLM routing, new model launches, and cost optimization — straight to their inbox.

    • New models & providers as they drop
    • Tips to cut latency & costs
    • Early access to beta features

    No spam. Unsubscribe anytime.

    All systems operational

    Product

    • Features
    • Models
    • Providers
    • Chat Playground
    • Changelog
    • DevPass
    • Compare Models
    • Enterprise

    Resources

    • Apps
    • Templates
    • Agents
    • MCP Server
    • Use Cases
    • Blog
    • Documentation
    • Integrations
    • Guides
    • Brand Assets
    • Token Cost Calculator
    • Referral Program
    • GitHub
    • Contact Us

    Community

    • Twitter
    • Discord

    Compare

    • OpenRouter
    • LiteLLM
    • Portkey
    • Migration Guides

    Models

    • Text Generation
    • Text to Image
    • Image to Image
    • Video Generation
    • Embeddings
    • Vision
    • Reasoning
    • Tool Calling
    • Web Search
    • Discounted

    Providers

    • OpenAI
    • Anthropic
    • Google AI Studio
    • Glacier
    • Google Vertex AI
    • Vertex AI (OpenAI-compatible)
    • Vertex AI (Anthropic)
    • Quartz
    • Avalanche
    • Groq
    • Cerebras
    • xAI
    • DeepSeek
    • Alibaba Cloud
    • NovitaAI
    • AWS Bedrock
    • Azure
    • Azure AI Foundry
    • Z AI
    • Moonshot AI
    • Perplexity
    • Nebius AI
    • Mistral AI
    • CanopyWave
    • Inference.net
    • Together AI
    • Custom
    • NanoGPT
    • ByteDance
    • MiniMax
    • EmberCloud
    • Xiaomi
    • DeepInfra

    © 2026 LLM Gateway. All rights reserved.

    Privacy PolicyTerms of Use