Support

AI-powered help

Welcome!

Please introduce yourself before we start.

    LLM Gateway
    • Docs
    • Pricing
    • Pricing
    • Docs
    • Models
    1.1k
    Log InGet Started

    Models

    Comprehensive list of all supported models and their providers

    Compare

    Use Case

    Capabilities

    Provider

    Input Price ($/M tokens)

    Output Price ($/M tokens)

    Context Size (tokens)

    91/227
    Models
    24/34
    Providers
    58
    Vision Models (filtered)
    84
    Tool-enabled (filtered)
    2
    Free Models (filtered)

    Kimi K2.6

    moonshot30% off
    kimi-k2.6
    Streaming
    Vision
    Tools
    Reasoning
    JSON Output
    CanopyWave
    Context: 262.1k30% off
    Input
    $0.50$0.35
    -30% off
    /M tokens
    Cached
    $0.10$0.07
    -30% off
    /M tokens
    Output
    $2.80$1.96
    -30% off
    /M tokens
    Get Started

    Claude Opus 4.7

    anthropic30% off
    claude-opus-4-7
    Streaming
    Vision
    Tools
    Reasoning
    Structured JSON Output
    Native Web Search
    Anthropic
    Context: 1M
    Input
    $5.00
    /M tokens
    Cached
    $0.50
    /M tokens
    Output
    $25.00
    /M tokens
    + $0.010 per search
    Get Started

    GLM-5.1

    glm10% off
    glm-5.1
    Streaming
    Tools
    Reasoning
    JSON Output
    Native Web Search
    NovitaAI
    Context: 204.8k
    Input
    $1.40
    /M tokens
    Cached
    $0.26
    /M tokens
    Output
    $4.40
    /M tokens
    Get Started

    MiniMax M2.5 Highspeed

    minimax
    minimax-m2.5-highspeed
    Streaming
    Reasoning
    MiniMax
    Context: 204.8k
    Input
    $0.60
    /M tokens
    Cached
    $0.03
    /M tokens
    Output
    $2.40
    /M tokens
    Get Started

    MiniMax M2.7 Highspeed

    minimax
    minimax-m2.7-highspeed
    Streaming
    Reasoning
    MiniMax
    Context: 204.8k
    Input
    $0.60
    /M tokens
    Cached
    $0.06
    /M tokens
    Output
    $2.40
    /M tokens
    Get Started

    MiniMax M2.7

    minimax
    minimax-m2.7
    Streaming
    Reasoning
    Tools
    JSON Output
    Structured JSON Output
    MiniMax
    Context: 204.8k
    Input
    $0.30
    /M tokens
    Cached
    $0.06
    /M tokens
    Output
    $1.20
    /M tokens
    Get Started

    Gemini Pro Latest

    google
    gemini-pro-latest
    Streaming
    Vision
    Tools
    Reasoning
    Reasoning Budget
    JSON Output
    Structured JSON Output
    Native Web Search
    Google AI Studio
    Context: 1.0M
    Input
    $2.00
    /M tokens
    Cached
    $0.20
    /M tokens
    Output
    $12.00
    /M tokens
    + $0.014 per search
    Get Started

    GPT-5.4 Nano

    openai30% off
    gpt-5.4-nano
    Streaming
    Vision
    Tools
    Reasoning
    JSON Output
    Structured JSON Output
    Native Web Search
    Azure
    Context: 400k30% off
    Input
    $0.20$0.14
    -30% off
    /M tokens
    Cached
    $0.02$0.01
    -30% off
    /M tokens
    Output
    $1.25$0.88
    -30% off
    /M tokens
    + $0.010 per search
    Get Started

    GPT-5.4 Mini

    openai30% off
    gpt-5.4-mini
    Streaming
    Vision
    Tools
    Reasoning
    JSON Output
    Structured JSON Output
    Native Web Search
    Azure
    Context: 400k30% off
    Input
    $0.75$0.52
    -30% off
    /M tokens
    Cached
    $0.07$0.05
    -30% off
    /M tokens
    Output
    $4.50$3.15
    -30% off
    /M tokens
    + $0.010 per search
    Get Started

    Grok 4.20 Beta Reasoning (0309)

    xai
    grok-4-20-beta-0309-reasoning
    Streaming
    Vision
    Tools
    Reasoning
    JSON Output
    xAI
    Context: 2M
    Input
    $2.00
    /M tokens
    Cached
    $0.20
    /M tokens
    Output
    $6.00
    /M tokens
    Get Started

    Grok 4.20 Multi-Agent Beta (0309)

    xaiModel Deactivated
    grok-4-20-multi-agent-beta-0309
    Streaming
    Vision
    Tools
    Reasoning
    JSON Output
    xAI
    Context: 2M
    Deactivated since Mar 27, 2026
    Input
    $2.00
    /M tokens
    Cached
    $0.20
    /M tokens
    Output
    $6.00
    /M tokens
    Get Started

    GPT-5.3 Chat

    openai
    gpt-5.3-chat-latest
    Streaming
    Vision
    Tools
    Reasoning
    Native Web Search
    Azure
    Context: 128k
    Input
    $1.75
    /M tokens
    Cached
    $0.17
    /M tokens
    Output
    $14.00
    /M tokens
    Get Started

    GPT-5.3 Codex

    openai
    gpt-5.3-codex
    Streaming
    Vision
    Tools
    Reasoning
    JSON Output
    Native Web Search
    Azure
    Context: 400k
    Input
    $1.75
    /M tokens
    Cached
    $0.17
    /M tokens
    Output
    $14.00
    /M tokens
    Get Started

    GPT-5.2 Codex

    openai
    gpt-5.2-codex
    Streaming
    Vision
    Tools
    Reasoning
    JSON Output
    Native Web Search
    Azure
    Context: 400k
    Input
    $1.75
    /M tokens
    Cached
    $0.17
    /M tokens
    Output
    $14.00
    /M tokens
    Get Started

    o4 Mini

    openai
    o4-mini
    Streaming
    Vision
    Tools
    Reasoning
    JSON Output
    Structured JSON Output
    Azure
    Context: 200k
    Input
    $1.10
    /M tokens
    Cached
    $0.28
    /M tokens
    Output
    $4.40
    /M tokens
    Get Started

    Grok 4.1 Fast

    xai
    grok-4-1-fast
    Streaming
    Vision
    Tools
    Reasoning
    JSON Output
    xAI
    Context: 2M
    Input
    $0.20
    /M tokens
    Cached
    $0.05
    /M tokens
    Output
    $0.50
    /M tokens
    Get Started

    Grok 4 Fast

    xai
    grok-4-fast
    Streaming
    Vision
    Tools
    Reasoning
    JSON Output
    xAI
    Context: 2M
    Input
    $0.20
    /M tokens
    Cached
    $0.05
    /M tokens
    Output
    $0.50
    /M tokens
    Get Started

    GPT-5.4 Pro

    openai30% off
    gpt-5.4-pro
    Streaming
    Vision
    Tools
    Reasoning
    JSON Output
    Native Web Search
    Azure
    Context: 1.1M
    Input
    $30.00
    /M tokens
    Cached
    —
    /M tokens
    Output
    $180.00
    /M tokens
    + $0.010 per search
    Get Started

    GPT-5.4

    openai30% off
    gpt-5.4
    Streaming
    Vision
    Tools
    Reasoning
    JSON Output
    Structured JSON Output
    Native Web Search
    Azure
    Context: 1.1M30% off
    Input
    $2.50$1.75
    -30% off
    /M tokens
    Cached
    $0.25$0.17
    -30% off
    /M tokens
    Output
    $15.00$10.50
    -30% off
    /M tokens
    + $0.010 per search
    Get Started

    Qwen3.5 397B A17B

    alibaba20% off
    qwen35-397b-a17b
    Streaming
    Vision
    Tools
    Reasoning
    JSON Output
    Native Web Search
    Alibaba Cloud
    Context: 262.1k20% off
    Input
    $0.17$0.14
    -20% off
    /M tokens
    Cached
    —
    /M tokens
    Output
    $1.03$0.83
    -20% off
    /M tokens
    + $0.010 per search
    Get Started

    Gemini 3.1 Pro (Preview)

    google20% off
    gemini-3.1-pro-preview
    Streaming
    Vision
    Tools
    Reasoning
    Reasoning Budget
    JSON Output
    Structured JSON Output
    Native Web Search
    Google AI Studio
    Context: 1.0M
    Input
    $2.00
    /M tokens
    Cached
    $0.20
    /M tokens
    Output
    $12.00
    /M tokens
    + $0.014 per search
    Get Started

    Claude Sonnet 4.6

    anthropic30% off
    claude-sonnet-4-6
    Streaming
    Vision
    Tools
    Reasoning
    Reasoning Budget
    Structured JSON Output
    Native Web Search
    Anthropic
    Context: 200k
    Input
    $3.00
    /M tokens
    Cached
    $0.30
    /M tokens
    Output
    $15.00
    /M tokens
    + $0.010 per search
    Get Started

    GLM-5

    glm30% off
    glm-5
    Streaming
    Tools
    Reasoning
    JSON Output
    Structured JSON Output
    Native Web Search
    Alibaba Cloud
    Context: 202.8k
    Input
    $0.57
    /M tokens
    Cached
    —
    /M tokens
    Output
    $2.58
    /M tokens
    Get Started

    MiniMax M2.5

    minimax30% off
    minimax-m2.5
    Streaming
    Reasoning
    Tools
    JSON Output
    Structured JSON Output
    CanopyWave
    Context: 204.8k30% off
    Input
    $0.27$0.19
    -30% off
    /M tokens
    Cached
    $0.03$0.02
    -30% off
    /M tokens
    Output
    $1.08$0.76
    -30% off
    /M tokens
    Get Started

    Claude Opus 4.6

    anthropic30% off
    claude-opus-4-6
    Streaming
    Vision
    Tools
    Reasoning
    Reasoning Budget
    Structured JSON Output
    Native Web Search
    Anthropic
    Context: 1M
    Input
    $5.00
    /M tokens
    Cached
    $0.50
    /M tokens
    Output
    $25.00
    /M tokens
    + $0.010 per search
    Get Started

    Qwen3 VL 30B A3B Thinking

    alibaba
    qwen3-vl-30b-a3b-thinking
    Streaming
    Vision
    Tools
    Reasoning
    JSON Output
    NovitaAI
    Context: 131.1k
    Input
    $0.20
    /M tokens
    Cached
    —
    /M tokens
    Output
    $1.00
    /M tokens
    Get Started

    Kimi K2.5

    moonshot30% off
    kimi-k2.5
    Streaming
    Vision
    Tools
    Reasoning
    JSON Output
    Alibaba Cloud
    Context: 262.1k
    Input
    $0.57
    /M tokens
    Cached
    —
    /M tokens
    Output
    $3.01
    /M tokens
    Get Started

    Qwen3 Max 2026-01-23

    alibaba20% off
    qwen3-max-2026-01-23
    Streaming
    Vision
    Tools
    Reasoning
    JSON Output
    Alibaba Cloud
    Context: 262.1k20% off
    Input
    $0.36$0.29
    -20% off
    /M tokens
    Cached
    $0.07$0.06
    -20% off
    /M tokens
    Output
    $1.43$1.15
    -20% off
    /M tokens
    Get Started

    Qwen3 VL 235B A22B Thinking

    alibaba20% off
    qwen3-vl-235b-a22b-thinking
    Streaming
    Vision
    Reasoning
    Alibaba Cloud
    Context: 131.1k20% off
    Input
    $0.50$0.40
    -20% off
    /M tokens
    Cached
    —
    /M tokens
    Output
    $2.00$1.60
    -20% off
    /M tokens
    Get Started

    QwQ Plus

    alibaba20% off
    qwq-plus
    Streaming
    Reasoning
    Alibaba Cloud
    Context: 131.1k20% off
    Input
    $0.23$0.18
    -20% off
    /M tokens
    Cached
    —
    /M tokens
    Output
    $0.57$0.46
    -20% off
    /M tokens
    Get Started

    MiniMax Text 01

    minimax
    minimax-text-01
    Streaming
    Reasoning
    MiniMax
    Context: 1M
    Input
    $0.20
    /M tokens
    Cached
    —
    /M tokens
    Output
    $1.10
    /M tokens
    Get Started

    MiniMax M2.1 Lightning

    minimax
    minimax-m2.1-lightning
    Streaming
    Reasoning
    MiniMax
    Context: 196.6k
    Input
    $0.12
    /M tokens
    Cached
    —
    /M tokens
    Output
    $0.48
    /M tokens
    Get Started

    GLM-4.7 Flash

    glm
    glm-4.7-flash
    Streaming
    Tools
    JSON Output
    Reasoning
    EmberCloud
    Context: 200k
    Input
    $0.06
    /M tokens
    Cached
    $0.01
    /M tokens
    Output
    $0.40
    /M tokens
    Get Started

    GLM-4.7 FlashX

    glm10% off
    glm-4.7-flashx
    Streaming
    Tools
    Reasoning
    JSON Output
    Z AI
    Context: 200k10% off
    Input
    $0.07$0.06
    -10% off
    /M tokens
    Cached
    $0.01$0.01
    -10% off
    /M tokens
    Output
    $0.40$0.36
    -10% off
    /M tokens
    Get Started

    Seed 1.8 (251228)

    bytedance
    seed-1-8-251228
    Streaming
    Vision
    Tools
    Reasoning
    JSON Output
    ByteDance
    Context: 256k
    Input
    $0.25
    /M tokens
    Cached
    $0.05
    /M tokens
    Output
    $2.00
    /M tokens
    Get Started

    Seed 1.6 Flash (250715)

    bytedance
    seed-1-6-flash-250715
    Streaming
    Vision
    Tools
    Reasoning
    JSON Output
    ByteDance
    Context: 256k
    Input
    $0.07
    /M tokens
    Cached
    $0.02
    /M tokens
    Output
    $0.30
    /M tokens
    Get Started

    Seed 1.6 (250915)

    bytedance
    seed-1-6-250915
    Streaming
    Vision
    Tools
    Reasoning
    JSON Output
    ByteDance
    Context: 256k
    Input
    $0.25
    /M tokens
    Cached
    $0.05
    /M tokens
    Output
    $2.00
    /M tokens
    Get Started

    Seed 1.6 (250615)

    bytedance
    seed-1-6-250615
    Streaming
    Vision
    Tools
    Reasoning
    JSON Output
    ByteDance
    Context: 256k
    Input
    $0.25
    /M tokens
    Cached
    $0.05
    /M tokens
    Output
    $2.00
    /M tokens
    Get Started

    GLM-4.6V FlashX

    glm10% off
    glm-4.6v-flashx
    Streaming
    Vision
    Tools
    Reasoning
    JSON Output
    Z AI
    Context: 128k10% off
    Input
    $0.04$0.04
    -10% off
    /M tokens
    Cached
    $0.00$0.00
    -10% off
    /M tokens
    Output
    $0.40$0.36
    -10% off
    /M tokens
    Get Started

    MiniMax M2.1

    minimax30% off
    minimax-m2.1
    Streaming
    Tools
    Reasoning
    JSON Output
    CanopyWave
    Context: 204.8k30% off
    Deactivated since Mar 31, 2026
    Input
    $0.27$0.19
    -30% off
    /M tokens
    Cached
    $0.07$0.05
    -30% off
    /M tokens
    Output
    $1.08$0.76
    -30% off
    /M tokens
    Get Started

    GLM-4.7

    glm30% off
    glm-4.7
    Streaming
    Tools
    Reasoning
    JSON Output
    Native Web Search
    Alibaba Cloud
    Context: 202.8k
    Input
    $0.43
    /M tokens
    Cached
    —
    /M tokens
    Output
    $2.01
    /M tokens
    Get Started

    Gemini 3 Flash (Preview)

    google
    gemini-3-flash-preview
    Streaming
    Vision
    Tools
    Reasoning
    Reasoning Budget
    JSON Output
    Structured JSON Output
    Native Web Search
    Google AI Studio
    Context: 1.0M
    Input
    $0.50
    /M tokens
    Cached
    $0.05
    /M tokens
    Output
    $3.00
    /M tokens
    + $0.014 per search
    Get Started

    GPT-5.2 Chat

    openai
    gpt-5.2-chat-latest
    Streaming
    Vision
    Tools
    Reasoning
    Native Web Search
    Azure
    Context: 128k
    Input
    $1.75
    /M tokens
    Cached
    $0.17
    /M tokens
    Output
    $14.00
    /M tokens
    Get Started

    GPT-5.2 Pro

    openai
    gpt-5.2-pro
    Streaming
    Vision
    Tools
    Reasoning
    JSON Output
    Native Web Search
    Azure
    Context: 400k
    Input
    $21.00
    /M tokens
    Cached
    —
    /M tokens
    Output
    $168.00
    /M tokens
    Get Started

    GPT-5.2

    openai30% off
    gpt-5.2
    Streaming
    Vision
    Tools
    Reasoning
    JSON Output
    Structured JSON Output
    Native Web Search
    Azure
    Context: 400k30% off
    Input
    $1.75$1.22
    -30% off
    /M tokens
    Cached
    $0.17$0.12
    -30% off
    /M tokens
    Output
    $14.00$9.80
    -30% off
    /M tokens
    Get Started

    Claude Sonnet 4.5 (2025-09-29)

    anthropic30% off
    claude-sonnet-4-5-20250929
    Streaming
    Tools
    Reasoning
    Reasoning Budget
    Structured JSON Output
    Native Web Search
    Anthropic
    Context: 200k
    Input
    $3.00
    /M tokens
    Cached
    $0.30
    /M tokens
    Output
    $15.00
    /M tokens
    + $0.010 per search
    Get Started

    GLM-4.6V Flash

    glm
    glm-4.6v-flash
    Streaming
    Vision
    Tools
    Reasoning
    JSON Output
    Z AI
    Context: 128k
    Input
    $0.00
    /M tokens
    Cached
    $0.00
    /M tokens
    Output
    $0.00
    /M tokens
    Get Started

    GLM-4.6V

    glm10% off
    glm-4.6v
    Streaming
    Vision
    Tools
    Reasoning
    JSON Output
    NovitaAI
    Context: 131.1k
    Input
    $0.30
    /M tokens
    Cached
    $0.06
    /M tokens
    Output
    $0.90
    /M tokens
    Get Started

    DeepSeek V3.2

    deepseek30% off
    deepseek-v3.2
    Streaming
    Tools
    JSON Output
    Reasoning
    Alibaba Cloud
    Context: 131.1k20% off
    Input
    $0.29$0.23
    -20% off
    /M tokens
    Cached
    $0.06$0.05
    -20% off
    /M tokens
    Output
    $0.43$0.34
    -20% off
    /M tokens
    Get Started

    Kimi K2 Thinking Turbo

    moonshot
    kimi-k2-thinking-turbo
    Streaming
    Tools
    Reasoning
    JSON Output
    Moonshot AI
    Context: 262.1k
    Input
    $1.15
    /M tokens
    Cached
    $0.15
    /M tokens
    Output
    $8.00
    /M tokens
    Get Started
    Page 1 of 2

    Newsletter

    Stay ahead of the curve

    Join developers who get weekly insights on LLM routing, new model launches, and cost optimization — straight to their inbox.

    • New models & providers as they drop
    • Tips to cut latency & costs
    • Early access to beta features

    No spam. Unsubscribe anytime.

    LLM Gateway

    Product

    • Features
    • Models
    • Providers
    • Chat Playground
    • Changelog
    • DevPass
    • Compare Models
    • Enterprise

    Resources

    • Templates
    • Agents
    • MCP Server
    • Blog
    • Documentation
    • Integrations
    • Guides
    • Brand Assets
    • Token Cost Calculator
    • Referral Program
    • GitHub
    • Contact Us

    Community

    • Twitter
    • Discord

    Compare

    • OpenRouter
    • LiteLLM

    Models

    • Text Generation
    • Text to Image
    • Image to Image
    • Vision
    • Reasoning
    • Tool Calling
    • Web Search
    • Discounted

    Providers

    • OpenAI
    • Anthropic
    • Google AI Studio
    • Glacier
    • Google Vertex AI
    • Quartz
    • Avalanche
    • Groq
    • Cerebras
    • xAI
    • DeepSeek
    • Bluestone
    • Alibaba Cloud
    • NovitaAI
    • AWS Bedrock
    • Azure
    • Z AI
    • Moonshot AI
    • Perplexity
    • Nebius AI
    • Mistral AI
    • CanopyWave
    • Inference.net
    • Together AI
    • Custom
    • NanoGPT
    • ByteDance
    • MiniMax
    • EmberCloud

    © 2026 LLM Gateway. All rights reserved.

    All systems operationalPrivacy PolicyTerms of Use