Support

AI-powered help

Welcome!

Please introduce yourself before we start.

    LLM Gateway
    • Docs
    • Pricing
    • Pricing
    • Docs
    • Models
    1.1k
    Log InGet Started

    Models

    Comprehensive list of all supported models and their providers

    Compare

    Use Case

    Capabilities

    Provider

    Input Price ($/M tokens)

    Output Price ($/M tokens)

    Context Size (tokens)

    227/227
    Models
    30/34
    Providers
    107
    Vision Models (filtered)
    144
    Tool-enabled (filtered)
    3
    Free Models (filtered)

    Kimi K2.6

    moonshot30% off
    kimi-k2.6
    Streaming
    Vision
    Tools
    Reasoning
    JSON Output
    CanopyWave
    Context: 262.1k30% off
    Input
    $0.50$0.35
    -30% off
    /M tokens
    Cached
    $0.10$0.07
    -30% off
    /M tokens
    Output
    $2.80$1.96
    -30% off
    /M tokens
    Get Started

    Claude Opus 4.7

    anthropic30% off
    claude-opus-4-7
    Streaming
    Vision
    Tools
    Reasoning
    Structured JSON Output
    Native Web Search
    Anthropic
    Context: 1M
    Input
    $5.00
    /M tokens
    Cached
    $0.50
    /M tokens
    Output
    $25.00
    /M tokens
    + $0.010 per search
    Get Started

    GLM-5.1

    glm10% off
    glm-5.1
    Streaming
    Tools
    Reasoning
    JSON Output
    Native Web Search
    NovitaAI
    Context: 204.8k
    Input
    $1.40
    /M tokens
    Cached
    $0.26
    /M tokens
    Output
    $4.40
    /M tokens
    Get Started

    Mimo V2 Flash

    mimo
    mimo-v2-flash
    Streaming
    CanopyWave
    Context: 256k
    Input
    $0.08
    /M tokens
    Cached
    $0.04
    /M tokens
    Output
    $0.24
    /M tokens
    Get Started

    Sora 2 Pro

    openaiModel Deactivated
    sora-2-pro
    Video Generation
    Avalanche
    Context: 32.8k
    Deactivated since Mar 24, 2026
    Per Second Pricing
    720p$0.24/sec
    hd$0.4/sec
    Get Started

    Sora 2

    openaiModel Deactivated
    sora-2
    Video Generation
    Avalanche
    Context: 32.8k
    Deactivated since Mar 24, 2026
    Per Second Pricing
    720p$0.08/sec
    Get Started

    Qwen3 Coder Next

    alibaba
    qwen3-coder-next
    Streaming
    Tools
    EmberCloud
    Context: 262.1k
    Input
    $0.11
    /M tokens
    Cached
    $0.06
    /M tokens
    Output
    $0.68
    /M tokens
    Get Started

    Veo 3.1 Fast

    google
    veo-3.1-fast-generate-preview
    Video Generation
    Avalanche
    Context: 32.8k
    Per Second Pricing
    Default$0.15/sec
    Get Started

    Veo 3.1

    google20% off
    veo-3.1-generate-preview
    Video Generation
    Avalanche
    Context: 32.8k20% off
    Per Second Pricing
    Video / Audio$0.2 – $0.4/sec
    Get Started

    MiniMax M2.5 Highspeed

    minimax
    minimax-m2.5-highspeed
    Streaming
    Reasoning
    MiniMax
    Context: 204.8k
    Input
    $0.60
    /M tokens
    Cached
    $0.03
    /M tokens
    Output
    $2.40
    /M tokens
    Get Started

    MiniMax M2.7 Highspeed

    minimax
    minimax-m2.7-highspeed
    Streaming
    Reasoning
    MiniMax
    Context: 204.8k
    Input
    $0.60
    /M tokens
    Cached
    $0.06
    /M tokens
    Output
    $2.40
    /M tokens
    Get Started

    MiniMax M2.7

    minimax
    minimax-m2.7
    Streaming
    Reasoning
    Tools
    JSON Output
    Structured JSON Output
    MiniMax
    Context: 204.8k
    Input
    $0.30
    /M tokens
    Cached
    $0.06
    /M tokens
    Output
    $1.20
    /M tokens
    Get Started

    Gemini Pro Latest

    google
    gemini-pro-latest
    Streaming
    Vision
    Tools
    Reasoning
    Reasoning Budget
    JSON Output
    Structured JSON Output
    Native Web Search
    Google AI Studio
    Context: 1.0M
    Input
    $2.00
    /M tokens
    Cached
    $0.20
    /M tokens
    Output
    $12.00
    /M tokens
    + $0.014 per search
    Get Started

    GPT-5.4 Nano

    openai30% off
    gpt-5.4-nano
    Streaming
    Vision
    Tools
    Reasoning
    JSON Output
    Structured JSON Output
    Native Web Search
    Azure
    Context: 400k30% off
    Input
    $0.20$0.14
    -30% off
    /M tokens
    Cached
    $0.02$0.01
    -30% off
    /M tokens
    Output
    $1.25$0.88
    -30% off
    /M tokens
    + $0.010 per search
    Get Started

    GPT-5.4 Mini

    openai30% off
    gpt-5.4-mini
    Streaming
    Vision
    Tools
    Reasoning
    JSON Output
    Structured JSON Output
    Native Web Search
    Azure
    Context: 400k30% off
    Input
    $0.75$0.52
    -30% off
    /M tokens
    Cached
    $0.07$0.05
    -30% off
    /M tokens
    Output
    $4.50$3.15
    -30% off
    /M tokens
    + $0.010 per search
    Get Started

    Grok 4.20 Beta Non-Reasoning (0309)

    xai
    grok-4-20-beta-0309-non-reasoning
    Streaming
    Vision
    Tools
    JSON Output
    xAI
    Context: 2M
    Input
    $2.00
    /M tokens
    Cached
    $0.20
    /M tokens
    Output
    $6.00
    /M tokens
    Get Started

    Grok 4.20 Beta Reasoning (0309)

    xai
    grok-4-20-beta-0309-reasoning
    Streaming
    Vision
    Tools
    Reasoning
    JSON Output
    xAI
    Context: 2M
    Input
    $2.00
    /M tokens
    Cached
    $0.20
    /M tokens
    Output
    $6.00
    /M tokens
    Get Started

    Grok 4.20 Multi-Agent Beta (0309)

    xaiModel Deactivated
    grok-4-20-multi-agent-beta-0309
    Streaming
    Vision
    Tools
    Reasoning
    JSON Output
    xAI
    Context: 2M
    Deactivated since Mar 27, 2026
    Input
    $2.00
    /M tokens
    Cached
    $0.20
    /M tokens
    Output
    $6.00
    /M tokens
    Get Started

    GPT-5.3 Chat

    openai
    gpt-5.3-chat-latest
    Streaming
    Vision
    Tools
    Reasoning
    Native Web Search
    Azure
    Context: 128k
    Input
    $1.75
    /M tokens
    Cached
    $0.17
    /M tokens
    Output
    $14.00
    /M tokens
    Get Started

    GPT-5.3 Codex

    openai
    gpt-5.3-codex
    Streaming
    Vision
    Tools
    Reasoning
    JSON Output
    Native Web Search
    Azure
    Context: 400k
    Input
    $1.75
    /M tokens
    Cached
    $0.17
    /M tokens
    Output
    $14.00
    /M tokens
    Get Started

    GPT-5.2 Codex

    openai
    gpt-5.2-codex
    Streaming
    Vision
    Tools
    Reasoning
    JSON Output
    Native Web Search
    Azure
    Context: 400k
    Input
    $1.75
    /M tokens
    Cached
    $0.17
    /M tokens
    Output
    $14.00
    /M tokens
    Get Started

    o4 Mini

    openai
    o4-mini
    Streaming
    Vision
    Tools
    Reasoning
    JSON Output
    Structured JSON Output
    Azure
    Context: 200k
    Input
    $1.10
    /M tokens
    Cached
    $0.28
    /M tokens
    Output
    $4.40
    /M tokens
    Get Started

    Grok 4.1 Fast

    xai
    grok-4-1-fast
    Streaming
    Vision
    Tools
    Reasoning
    JSON Output
    xAI
    Context: 2M
    Input
    $0.20
    /M tokens
    Cached
    $0.05
    /M tokens
    Output
    $0.50
    /M tokens
    Get Started

    Grok 4 Fast

    xai
    grok-4-fast
    Streaming
    Vision
    Tools
    Reasoning
    JSON Output
    xAI
    Context: 2M
    Input
    $0.20
    /M tokens
    Cached
    $0.05
    /M tokens
    Output
    $0.50
    /M tokens
    Get Started

    GPT-5.4 Pro

    openai30% off
    gpt-5.4-pro
    Streaming
    Vision
    Tools
    Reasoning
    JSON Output
    Native Web Search
    Azure
    Context: 1.1M
    Input
    $30.00
    /M tokens
    Cached
    —
    /M tokens
    Output
    $180.00
    /M tokens
    + $0.010 per search
    Get Started

    GPT-5.4

    openai30% off
    gpt-5.4
    Streaming
    Vision
    Tools
    Reasoning
    JSON Output
    Structured JSON Output
    Native Web Search
    Azure
    Context: 1.1M30% off
    Input
    $2.50$1.75
    -30% off
    /M tokens
    Cached
    $0.25$0.17
    -30% off
    /M tokens
    Output
    $15.00$10.50
    -30% off
    /M tokens
    + $0.010 per search
    Get Started

    Grok Imagine Image

    xai
    grok-imagine-image
    Vision
    Image Generation
    xAI
    Context: 2k
    Per image
    $0.0200
    Get Started

    Grok Imagine Image Pro

    xai
    grok-imagine-image-pro
    Vision
    Image Generation
    xAI
    Context: 2k
    Per image
    $0.0700
    Get Started

    Gemini 3.1 Flash Lite (Preview)

    google
    gemini-3.1-flash-lite-preview
    Streaming
    Vision
    Tools
    JSON Output
    Structured JSON Output
    Google AI Studio
    Context: 1.0M
    Input
    $0.25
    /M tokens
    Cached
    $0.03
    /M tokens
    Output
    $1.50
    /M tokens
    Get Started

    Gemini 3.1 Flash Image (Preview)

    google20% off
    gemini-3.1-flash-image-preview
    Streaming
    Vision
    JSON Output
    Structured JSON Output
    Image Generation
    Glacier
    Context: 65.5k20% off
    Per image (0.5K)
    $0.0448$0.0359
    Image Pricing (est. per image)
    Input
    any size~$0.0001~$0.0001
    Output
    0.5K~$0.0448~$0.0359
    1K~$0.0672~$0.0538
    2K~$0.1008~$0.0806
    4K~$0.1512~$0.1210
    Get Started

    Qwen3.5 397B A17B

    alibaba20% off
    qwen35-397b-a17b
    Streaming
    Vision
    Tools
    Reasoning
    JSON Output
    Native Web Search
    Alibaba Cloud
    Context: 262.1k20% off
    Input
    $0.17$0.14
    -20% off
    /M tokens
    Cached
    —
    /M tokens
    Output
    $1.03$0.83
    -20% off
    /M tokens
    + $0.010 per search
    Get Started

    Devstral Small 1.1

    mistral
    devstral-small-2507
    Streaming
    JSON Output
    Mistral AI
    Context: 131.1k
    Input
    $0.10
    /M tokens
    Cached
    —
    /M tokens
    Output
    $0.30
    /M tokens
    Get Started

    Devstral 2

    mistral
    devstral-2512
    Streaming
    JSON Output
    Mistral AI
    Context: 262.1k
    Input
    $0.40
    /M tokens
    Cached
    —
    /M tokens
    Output
    $2.00
    /M tokens
    Get Started

    Codestral

    mistral
    codestral-2508
    Streaming
    JSON Output
    Mistral AI
    Context: 256k
    Input
    $0.30
    /M tokens
    Cached
    —
    /M tokens
    Output
    $0.90
    /M tokens
    Get Started

    Ministral 3B

    mistral
    ministral-3b-2512
    Streaming
    Vision
    JSON Output
    Mistral AI
    Context: 131.1k
    Input
    $0.10
    /M tokens
    Cached
    —
    /M tokens
    Output
    $0.10
    /M tokens
    Get Started

    Ministral 8B

    mistral
    ministral-8b-2512
    Streaming
    Vision
    JSON Output
    Mistral AI
    Context: 262.1k
    Input
    $0.15
    /M tokens
    Cached
    —
    /M tokens
    Output
    $0.15
    /M tokens
    Get Started

    Ministral 14B

    mistral
    ministral-14b-2512
    Streaming
    Vision
    JSON Output
    Mistral AI
    Context: 262.1k
    Input
    $0.20
    /M tokens
    Cached
    —
    /M tokens
    Output
    $0.20
    /M tokens
    Get Started

    Mistral Small 3.2

    mistral
    mistral-small-2506
    Streaming
    Vision
    JSON Output
    Mistral AI
    Context: 128k
    Input
    $0.10
    /M tokens
    Cached
    —
    /M tokens
    Output
    $0.30
    /M tokens
    Get Started

    Mistral Large 3

    mistral
    mistral-large-2512
    Streaming
    Vision
    JSON Output
    Mistral AI
    Context: 262.1k
    Input
    $0.50
    /M tokens
    Cached
    —
    /M tokens
    Output
    $1.50
    /M tokens
    Get Started

    Gemini 3.1 Pro (Preview)

    google20% off
    gemini-3.1-pro-preview
    Streaming
    Vision
    Tools
    Reasoning
    Reasoning Budget
    JSON Output
    Structured JSON Output
    Native Web Search
    Google AI Studio
    Context: 1.0M
    Input
    $2.00
    /M tokens
    Cached
    $0.20
    /M tokens
    Output
    $12.00
    /M tokens
    + $0.014 per search
    Get Started

    Claude Sonnet 4.6

    anthropic30% off
    claude-sonnet-4-6
    Streaming
    Vision
    Tools
    Reasoning
    Reasoning Budget
    Structured JSON Output
    Native Web Search
    Anthropic
    Context: 200k
    Input
    $3.00
    /M tokens
    Cached
    $0.30
    /M tokens
    Output
    $15.00
    /M tokens
    + $0.010 per search
    Get Started

    GLM-5

    glm30% off
    glm-5
    Streaming
    Tools
    Reasoning
    JSON Output
    Structured JSON Output
    Native Web Search
    Alibaba Cloud
    Context: 202.8k
    Input
    $0.57
    /M tokens
    Cached
    —
    /M tokens
    Output
    $2.58
    /M tokens
    Get Started

    MiniMax M2.5

    minimax30% off
    minimax-m2.5
    Streaming
    Reasoning
    Tools
    JSON Output
    Structured JSON Output
    CanopyWave
    Context: 204.8k30% off
    Input
    $0.27$0.19
    -30% off
    /M tokens
    Cached
    $0.03$0.02
    -30% off
    /M tokens
    Output
    $1.08$0.76
    -30% off
    /M tokens
    Get Started

    Claude Opus 4.6

    anthropic30% off
    claude-opus-4-6
    Streaming
    Vision
    Tools
    Reasoning
    Reasoning Budget
    Structured JSON Output
    Native Web Search
    Anthropic
    Context: 1M
    Input
    $5.00
    /M tokens
    Cached
    $0.50
    /M tokens
    Output
    $25.00
    /M tokens
    + $0.010 per search
    Get Started

    Hermes 2 Pro Llama 3 8B

    nousresearch
    hermes-2-pro-llama-3-8b
    Streaming
    NovitaAI
    Context: 8.2k
    Input
    $0.14
    /M tokens
    Cached
    —
    /M tokens
    Output
    $0.14
    /M tokens
    Get Started

    Qwen3 4B FP8

    alibaba
    qwen3-4b-fp8
    Streaming
    NovitaAI
    Context: 128k
    Input
    $0.03
    /M tokens
    Cached
    —
    /M tokens
    Output
    $0.03
    /M tokens
    Get Started

    Qwen3 30B A3B FP8

    alibaba
    qwen3-30b-a3b-fp8
    Streaming
    NovitaAI
    Context: 41.0k
    Input
    $0.09
    /M tokens
    Cached
    —
    /M tokens
    Output
    $0.45
    /M tokens
    Get Started

    Qwen3 32B FP8

    alibaba
    qwen3-32b-fp8
    Streaming
    NovitaAI
    Context: 41.0k
    Input
    $0.10
    /M tokens
    Cached
    —
    /M tokens
    Output
    $0.45
    /M tokens
    Get Started

    Qwen3 VL 30B A3B Thinking

    alibaba
    qwen3-vl-30b-a3b-thinking
    Streaming
    Vision
    Tools
    Reasoning
    JSON Output
    NovitaAI
    Context: 131.1k
    Input
    $0.20
    /M tokens
    Cached
    —
    /M tokens
    Output
    $1.00
    /M tokens
    Get Started

    Qwen3 VL 30B A3B Instruct

    alibaba
    qwen3-vl-30b-a3b-instruct
    Streaming
    Vision
    Tools
    NovitaAI
    Context: 131.1k
    Input
    $0.20
    /M tokens
    Cached
    —
    /M tokens
    Output
    $0.70
    /M tokens
    Get Started
    Page 1 of 5

    Newsletter

    Stay ahead of the curve

    Join developers who get weekly insights on LLM routing, new model launches, and cost optimization — straight to their inbox.

    • New models & providers as they drop
    • Tips to cut latency & costs
    • Early access to beta features

    No spam. Unsubscribe anytime.

    LLM Gateway

    Product

    • Features
    • Models
    • Providers
    • Chat Playground
    • Changelog
    • DevPass
    • Compare Models
    • Enterprise

    Resources

    • Templates
    • Agents
    • MCP Server
    • Blog
    • Documentation
    • Integrations
    • Guides
    • Brand Assets
    • Token Cost Calculator
    • Referral Program
    • GitHub
    • Contact Us

    Community

    • Twitter
    • Discord

    Compare

    • OpenRouter
    • LiteLLM

    Models

    • Text Generation
    • Text to Image
    • Image to Image
    • Vision
    • Reasoning
    • Tool Calling
    • Web Search
    • Discounted

    Providers

    • OpenAI
    • Anthropic
    • Google AI Studio
    • Glacier
    • Google Vertex AI
    • Quartz
    • Avalanche
    • Groq
    • Cerebras
    • xAI
    • DeepSeek
    • Bluestone
    • Alibaba Cloud
    • NovitaAI
    • AWS Bedrock
    • Azure
    • Z AI
    • Moonshot AI
    • Perplexity
    • Nebius AI
    • Mistral AI
    • CanopyWave
    • Inference.net
    • Together AI
    • Custom
    • NanoGPT
    • ByteDance
    • MiniMax
    • EmberCloud

    © 2026 LLM Gateway. All rights reserved.

    All systems operationalPrivacy PolicyTerms of Use