Support

AI-powered help

Welcome!

Please introduce yourself before we start.

    LLM Gateway
    • Docs
    • Pricing
    • Pricing
    • Docs
    • Models
    1.2k
    Log InGet Started

    Models

    Comprehensive list of all supported models and their providers

    Compare

    Use Case

    Capabilities

    Provider

    Input Price ($/M tokens)

    Output Price ($/M tokens)

    Context Size (tokens)

    247
    Models
    36
    Providers
    115
    Vision Models
    156
    Tool-enabled
    3
    Free Models
    Features
    ByteDance
    kimi-k2-thinking
    $0.60$2.50$0.12
    Moonshot AI
    kimi-k2-thinking
    $0.60$2.50$0.15
    Anthropic
    claude-haiku-4-5
    $1.00$5.00$0.10
    AWS Bedrock
    claude-haiku-4-5
    $1.00$0.80
    -20% off
    $5.00$4.00
    -20% off
    $0.10$0.08
    -20% off
    Vertex AI (Anthropic)
    claude-haiku-4-5
    $1.00$5.00$0.10
    AWS Bedrock
    llama-3.1-70b-instruct
    $0.72$0.72—
    AWS Bedrock
    llama-4-maverick-17b-instruct
    $0.24$0.97—
    NovitaAI
    llama-4-maverick-17b-instruct
    $0.27$0.85—
    AWS Bedrock
    llama-4-scout-17b-instruct
    $0.17$0.66—
    NovitaAI
    llama-4-scout-17b-instruct
    $0.18$0.59—
    Alibaba Cloud(cn-beijing)
    glm-4.6
    $0.43$2.01—
    Alibaba Cloud
    glm-4.6
    $0.43$2.01—
    Cerebras
    glm-4.6
    $2.25$2.75—
    NovitaAI
    glm-4.6
    $0.55$2.20$0.11
    Z AI
    glm-4.6
    $0.60$0.54
    -10% off
    $2.20$1.98
    -10% off
    $0.11$0.10
    -10% off
    Anthropic
    claude-sonnet-4-5
    $3.00$15.00$0.30
    AWS Bedrock
    claude-sonnet-4-5
    $3.00$2.40
    -20% off
    $15.00$12.00
    -20% off
    $0.30$0.24
    -20% off
    Vertex AI (Anthropic)
    claude-sonnet-4-5
    $3.00$15.00$0.30
    Google AI Studio
    gemini-2.5-flash-lite-preview-09-2025
    $0.10$0.40$0.01
    Google Vertex AI
    gemini-2.5-flash-lite-preview-09-2025
    $0.10$0.40$0.01
    Google AI Studio
    gemini-2.5-flash-preview-09-2025
    $0.30$2.50$0.03
    Google Vertex AI
    gemini-2.5-flash-preview-09-2025
    $0.30$2.50$0.03
    Google AI Studio
    gemini-2.5-flash-lite
    $0.10$0.40$0.01
    Google Vertex AI
    gemini-2.5-flash-lite
    $0.10$0.40$0.01
    xAI
    grok-4-fast-non-reasoning
    $0.20$0.50$0.05
    xAI
    grok-4-fast-reasoning
    $0.20$0.50$0.05
    xAI
    grok-4
    $3.00$15.00$0.75
    Anthropic
    claude-3-5-sonnet-20240620
    $3.00$15.00$0.30
    Anthropic
    claude-opus-4-1-20250805
    $15.00$75.00$1.50
    AWS Bedrock
    claude-opus-4-1-20250805
    $15.00$12.00
    -20% off
    $75.00$60.00
    -20% off
    $1.50$1.20
    -20% off
    Z AI
    glm-4-32b-0414-128k
    $0.10$0.09
    -10% off
    $0.10$0.09
    -10% off
    $0.00$0.00
    -10% off
    Z AI
    glm-4.5-flash
    $0.00$0.00$0.00
    Z AI
    glm-4.5-airx
    $1.10$0.99
    -10% off
    $4.50$4.05
    -10% off
    $0.22$0.20
    -10% off
    Z AI
    glm-4.5-x
    $2.20$1.98
    -10% off
    $8.90$8.01
    -10% off
    $0.45$0.40
    -10% off
    EmberCloud
    glm-4.5-air
    $0.13$0.85$0.02
    Z AI
    glm-4.5-air
    $0.20$0.18
    -10% off
    $1.10$0.99
    -10% off
    $0.03$0.03
    -10% off
    NovitaAI
    glm-4.5v
    $0.60$1.80$0.11
    Z AI
    glm-4.5v
    $0.60$0.54
    -10% off
    $1.80$1.62
    -10% off
    $0.11$0.10
    -10% off
    EmberCloud
    glm-4.5
    $0.60$2.20$0.11
    Z AI
    glm-4.5
    $0.60$0.54
    -10% off
    $2.20$1.98
    -10% off
    $0.11$0.10
    -10% off
    Nebius AI
    hermes-3-llama-405b
    $1.00$3.00—
    Alibaba Cloud
    qwen3-max
    $3.00$2.40
    -20% off
    $15.00$12.00
    -20% off
    $0.60$0.48
    -20% off
    NovitaAI
    qwen3-max
    $0.84$3.38—
    Alibaba Cloud
    qwen3-next-80b-a3b-instruct
    $0.50$0.40
    -20% off
    $2.00$1.60
    -20% off
    —
    NovitaAI
    qwen3-next-80b-a3b-instruct
    $0.15$1.50—
    Alibaba Cloud
    qwen3-next-80b-a3b-thinking
    $0.50$0.40
    -20% off
    $6.00$4.80
    -20% off
    —
    Nebius AI
    qwen3-next-80b-a3b-thinking
    $0.15$1.20—
    NovitaAI
    qwen3-next-80b-a3b-thinking
    $0.15$1.50—
    Alibaba Cloud
    qwen-vl-plus
    $0.21$0.17
    -20% off
    $0.64$0.51
    -20% off
    —
    Alibaba Cloud
    qwen-vl-max
    $0.80$0.64
    -20% off
    $3.20$2.56
    -20% off
    —
    Page 6 of 10

    Newsletter

    Stay ahead of the curve

    Join developers who get weekly insights on LLM routing, new model launches, and cost optimization — straight to their inbox.

    • New models & providers as they drop
    • Tips to cut latency & costs
    • Early access to beta features

    No spam. Unsubscribe anytime.

    All systems operational

    Product

    • Features
    • Models
    • Providers
    • Chat Playground
    • Changelog
    • DevPass
    • Compare Models
    • Enterprise

    Resources

    • Apps
    • Templates
    • Agents
    • MCP Server
    • Blog
    • Documentation
    • Integrations
    • Guides
    • Brand Assets
    • Token Cost Calculator
    • Referral Program
    • GitHub
    • Contact Us

    Community

    • Twitter
    • Discord

    Compare

    • OpenRouter
    • LiteLLM

    Models

    • Text Generation
    • Text to Image
    • Image to Image
    • Vision
    • Reasoning
    • Tool Calling
    • Web Search
    • Discounted

    Providers

    • OpenAI
    • Anthropic
    • Google AI Studio
    • Glacier
    • Google Vertex AI
    • Vertex AI (OpenAI-compatible)
    • Vertex AI (Anthropic)
    • Quartz
    • Avalanche
    • Groq
    • Cerebras
    • xAI
    • DeepSeek
    • Alibaba Cloud
    • NovitaAI
    • AWS Bedrock
    • Azure
    • Azure AI Foundry
    • Z AI
    • Moonshot AI
    • Perplexity
    • Nebius AI
    • Mistral AI
    • Inference.net
    • Together AI
    • Custom
    • NanoGPT
    • ByteDance
    • MiniMax
    • EmberCloud
    • Xiaomi
    • DeepInfra

    © 2026 LLM Gateway. All rights reserved.

    Privacy PolicyTerms of Use