LLM Gateway
    • Pricing
    • Docs
    • Models
    • Pricing
    • Docs
    • Models
    Star926
    Get Started

    Models

    Comprehensive list of all supported models and their providers

    Compare

    Use Case

    Capabilities

    Provider

    Input Price ($/M tokens)

    Output Price ($/M tokens)

    Context Size (tokens)

    204/204
    Models
    25/30
    Providers
    92
    Vision Models (filtered)
    127
    Tool-enabled (filtered)
    3
    Free Models (filtered)

    Gemini 3.1 Flash Lite (Preview)

    google
    gemini-3.1-flash-lite-preview

    Providers

    Google AI Studio
    google-ai-studio/gemini-3.1-flash-lite-preview
    Context Size
    1.0M
    Stability
    stable
    Pricing
    Input
    $0.25/M
    Cached
    $0.03/M
    Output
    $1.50/M
    Capabilities
    Streaming
    Vision
    Tools
    JSON Output
    Structured JSON Output
    Try in Playground

    Grok Imagine Image

    xai
    grok-imagine-image

    Providers

    xAI
    xai/grok-imagine-image
    Context Size
    2k
    Stability
    stable
    Pricing
    Input
    $0.00/M
    Cached
    —/M
    Output
    $0.00/M
    Per Request
    $0.020/req
    Capabilities
    Image Generation
    Try in Playground

    Grok Imagine Image Pro

    xai
    grok-imagine-image-pro

    Providers

    xAI
    xai/grok-imagine-image-pro
    Context Size
    2k
    Stability
    stable
    Pricing
    Input
    $0.00/M
    Cached
    —/M
    Output
    $0.00/M
    Per Request
    $0.070/req
    Capabilities
    Image Generation
    Try in Playground

    Gemini 3.1 Flash Image (Preview)

    google
    gemini-3.1-flash-image-preview

    Providers

    Google AI Studio
    google-ai-studio/gemini-3.1-flash-image-preview
    Context Size
    65.5k
    Stability
    stable
    Pricing
    Input
    $0.25/M
    Cached
    —/M
    Output
    $1.50/M
    Capabilities
    Streaming
    Vision
    JSON Output
    Structured JSON Output
    Image Generation
    Try in Playground

    Gemini 3.1 Pro (Preview)

    google
    gemini-3.1-pro-preview

    Providers

    Google AI Studio
    google-ai-studio/gemini-3.1-pro-preview
    Context Size
    1.0M
    Stability
    stable
    Pricing
    Input
    $2.00/M
    Cached
    $0.20/M
    Output
    $12.00/M
    Capabilities
    Streaming
    Vision
    Tools
    Reasoning
    Reasoning Budget
    JSON Output
    Structured JSON Output
    Native Web Search
    Try in Playground

    Claude Sonnet 4.6

    anthropic
    claude-sonnet-4-6

    Providers

    Anthropic
    anthropic/claude-sonnet-4-6
    Context Size
    200k
    Stability
    stable
    Pricing
    Input
    $3.00/M
    Cached
    $0.30/M
    Output
    $15.00/M
    Capabilities
    Streaming
    Tools
    Reasoning
    Reasoning Budget
    Structured JSON Output
    Native Web Search
    Try in Playground

    Qwen3.5 397B A17B

    alibaba
    qwen35-397b-a17b

    Providers

    Alibaba Cloud
    alibaba/qwen35-397b-a17b
    Context Size
    262.1k
    Stability
    stable
    Pricing
    20% off
    Input
    $0.60$0.48
    -20% off
    /M
    Cached
    —/M
    Output
    $3.60$2.88
    -20% off
    /M
    Capabilities
    Streaming
    Vision
    Tools
    Reasoning
    JSON Output
    Native Web Search
    Try in Playground

    GLM-5

    glm
    glm-5

    Providers

    CanopyWave
    canopywave/glm-5
    Context Size
    200k
    Stability
    stable
    Pricing
    30% off
    Input
    $0.90$0.63
    -30% off
    /M
    Cached
    —/M
    Output
    $3.10$2.17
    -30% off
    /M
    Capabilities
    Streaming
    Tools
    Reasoning
    Try in Playground

    MiniMax M2.5

    minimax
    minimax-m2.5

    Providers

    CanopyWave
    canopywave/minimax-m2.5
    Context Size
    204.8k
    Stability
    stable
    Pricing
    30% off
    Input
    $0.27$0.19
    -30% off
    /M
    Cached
    —/M
    Output
    $1.08$0.76
    -30% off
    /M
    Capabilities
    Streaming
    Reasoning
    Try in Playground

    Claude Opus 4.6

    anthropic
    claude-opus-4-6

    Providers

    Anthropic
    anthropic/claude-opus-4-6
    Context Size
    1M
    Stability
    stable
    Pricing
    Input
    $5.00/M
    Cached
    $0.50/M
    Output
    $25.00/M
    Capabilities
    Streaming
    Vision
    Tools
    Reasoning
    Structured JSON Output
    Native Web Search
    Try in Playground

    Kimi K2.5

    moonshot
    kimi-k2.5

    Providers

    CanopyWave
    canopywave/kimi-k2.5
    Context Size
    262.1k
    Stability
    stable
    Pricing
    30% off
    Input
    $0.50$0.35
    -30% off
    /M
    Cached
    —/M
    Output
    $2.80$1.96
    -30% off
    /M
    Capabilities
    Streaming
    Vision
    Tools
    JSON Output
    Try in Playground

    Qwen3 Max 2026-01-23

    alibaba
    qwen3-max-2026-01-23

    Providers

    Alibaba Cloud
    alibaba/qwen3-max-2026-01-23
    Context Size
    262.1k
    Stability
    stable
    Pricing
    20% off
    Input
    $1.20$0.96
    -20% off
    /M
    Cached
    $0.24$0.19
    -20% off
    /M
    Output
    $6.00$4.80
    -20% off
    /M
    Capabilities
    Streaming
    Vision
    Tools
    Reasoning
    JSON Output
    Try in Playground

    Qwen Image Edit Max

    alibaba
    qwen-image-edit-max

    Providers

    Alibaba Cloud
    alibaba/qwen-image-edit-max
    Context Size
    2k
    Stability
    stable
    Pricing
    20% off
    Input
    $0.00$0.00
    -20% off
    /M
    Cached
    —/M
    Output
    $0.00$0.00
    -20% off
    /M
    Per Request
    $0.080/req
    Capabilities
    Vision
    Image Generation
    Try in Playground

    Qwen Image Max 2025-12-30

    alibaba
    qwen-image-max-2025-12-30

    Providers

    Alibaba Cloud
    alibaba/qwen-image-max-2025-12-30
    Context Size
    2k
    Stability
    stable
    Pricing
    Input
    $0.00/M
    Cached
    —/M
    Output
    $0.00/M
    Per Request
    $0.075/req
    Capabilities
    Image Generation
    Try in Playground

    MiniMax M2.1 Lightning

    minimax
    minimax-m2.1-lightning

    Providers

    MiniMax
    minimax/minimax-m2.1-lightning
    Context Size
    196.6k
    Stability
    stable
    Pricing
    Input
    $0.12/M
    Cached
    —/M
    Output
    $0.48/M
    Capabilities
    Streaming
    Reasoning
    Try in Playground

    MiniMax M2.1

    minimax
    minimax-m2.1

    Providers

    CanopyWave
    canopywave/minimax-m2.1
    Context Size
    204.8k
    Stability
    stable
    Pricing
    30% off
    Input
    $0.27$0.19
    -30% off
    /M
    Cached
    $0.07$0.05
    -30% off
    /M
    Output
    $1.08$0.76
    -30% off
    /M
    Capabilities
    Streaming
    Tools
    Reasoning
    JSON Output
    Try in Playground

    GLM-4.7 Flash

    glm
    glm-4.7-flash

    Providers

    Z AI
    zai/glm-4.7-flash
    Context Size
    200k
    Stability
    stable
    Pricing
    Input
    $0.00/M
    Cached
    $0.00/M
    Output
    $0.00/M
    Capabilities
    Streaming
    Tools
    Reasoning
    JSON Output
    Try in Playground

    GLM-4.7 FlashX

    glm
    glm-4.7-flashx

    Providers

    Z AI
    zai/glm-4.7-flashx
    Context Size
    200k
    Stability
    stable
    Pricing
    10% off
    Input
    $0.07$0.06
    -10% off
    /M
    Cached
    $0.01$0.01
    -10% off
    /M
    Output
    $0.40$0.36
    -10% off
    /M
    Capabilities
    Streaming
    Tools
    Reasoning
    JSON Output
    Try in Playground

    GLM-4.7

    glm
    glm-4.7

    Providers

    ByteDance
    bytedance/glm-4.7
    Context Size
    200k
    Stability
    stable
    Pricing
    Input
    $0.60/M
    Cached
    $0.11/M
    Output
    $2.20/M
    Capabilities
    Streaming
    Tools
    Reasoning
    Try in Playground

    Seed 1.8 (251228)

    bytedance
    seed-1-8-251228

    Providers

    ByteDance
    bytedance/seed-1-8-251228
    Context Size
    256k
    Stability
    stable
    Pricing
    Input
    $0.25/M
    Cached
    $0.05/M
    Output
    $2.00/M
    Capabilities
    Streaming
    Vision
    Tools
    Reasoning
    JSON Output
    Try in Playground

    Gemini 3 Flash (Preview)

    google
    gemini-3-flash-preview

    Providers

    Google AI Studio
    google-ai-studio/gemini-3-flash-preview
    Context Size
    1.0M
    Stability
    stable
    Pricing
    Input
    $0.50/M
    Cached
    $0.05/M
    Output
    $3.00/M
    Capabilities
    Streaming
    Vision
    Tools
    Reasoning
    Reasoning Budget
    JSON Output
    Structured JSON Output
    Native Web Search
    Try in Playground

    GPT-5.2 Chat

    openai
    gpt-5.2-chat-latest

    Providers

    Azure
    azure/gpt-5.2-chat-latest
    Context Size
    128k
    Stability
    unstable
    Pricing
    Input
    $1.75/M
    Cached
    $0.17/M
    Output
    $14.00/M
    Capabilities
    Streaming
    Vision
    Tools
    Reasoning
    Try in Playground

    GPT-5.2 Pro

    openai
    gpt-5.2-pro

    Providers

    Azure
    azure/gpt-5.2-pro
    Context Size
    400k
    Stability
    unstable
    Pricing
    Input
    $21.00/M
    Cached
    —/M
    Output
    $168.00/M
    Capabilities
    Streaming
    Vision
    Tools
    Reasoning
    JSON Output
    Try in Playground

    GPT-5.2

    openai
    gpt-5.2

    Providers

    Azure
    azure/gpt-5.2
    Context Size
    400k
    Stability
    stable
    Pricing
    30% off
    Input
    $1.75$1.22
    -30% off
    /M
    Cached
    $0.17$0.12
    -30% off
    /M
    Output
    $14.00$9.80
    -30% off
    /M
    Capabilities
    Streaming
    Vision
    Tools
    Reasoning
    JSON Output
    Structured JSON Output
    Try in Playground

    Devstral 2

    mistral
    devstral-2512

    Providers

    Mistral AI
    mistral/devstral-2512
    Context Size
    262.1k
    Stability
    stable
    Pricing
    Input
    $0.40/M
    Cached
    —/M
    Output
    $2.00/M
    Capabilities
    Streaming
    JSON Output
    Try in Playground

    GLM-4.6V FlashX

    glm
    glm-4.6v-flashx

    Providers

    Z AI
    zai/glm-4.6v-flashx
    Context Size
    128k
    Stability
    stable
    Pricing
    10% off
    Input
    $0.04$0.04
    -10% off
    /M
    Cached
    $0.00$0.00
    -10% off
    /M
    Output
    $0.40$0.36
    -10% off
    /M
    Capabilities
    Streaming
    Vision
    Tools
    Reasoning
    JSON Output
    Try in Playground

    GLM-4.6V Flash

    glm
    glm-4.6v-flash

    Providers

    Z AI
    zai/glm-4.6v-flash
    Context Size
    128k
    Stability
    stable
    Pricing
    Input
    $0.00/M
    Cached
    $0.00/M
    Output
    $0.00/M
    Capabilities
    Streaming
    Vision
    Tools
    Reasoning
    JSON Output
    Try in Playground

    GLM-4.6V

    glm
    glm-4.6v

    Providers

    NovitaAI
    novita/glm-4.6v
    Context Size
    131.1k
    Stability
    stable
    Pricing
    Input
    $0.30/M
    Cached
    $0.06/M
    Output
    $0.90/M
    Capabilities
    Streaming
    Vision
    Tools
    Reasoning
    JSON Output
    Try in Playground

    Seedream 4.5

    bytedance
    seedream-4-5

    Providers

    ByteDance
    bytedance/seedream-4-5
    Context Size
    2k
    Stability
    stable
    Pricing
    10% off
    Input
    $0.00$0.00
    -10% off
    /M
    Cached
    —/M
    Output
    $0.00$0.00
    -10% off
    /M
    Per Request
    $0.045/req
    Capabilities
    Image Generation
    Try in Playground

    Ministral 3B

    mistral
    ministral-3b-2512

    Providers

    Mistral AI
    mistral/ministral-3b-2512
    Context Size
    131.1k
    Stability
    stable
    Pricing
    Input
    $0.10/M
    Cached
    —/M
    Output
    $0.10/M
    Capabilities
    Streaming
    Vision
    JSON Output
    Try in Playground

    Ministral 8B

    mistral
    ministral-8b-2512

    Providers

    Mistral AI
    mistral/ministral-8b-2512
    Context Size
    262.1k
    Stability
    stable
    Pricing
    Input
    $0.15/M
    Cached
    —/M
    Output
    $0.15/M
    Capabilities
    Streaming
    Vision
    JSON Output
    Try in Playground

    Ministral 14B

    mistral
    ministral-14b-2512

    Providers

    Mistral AI
    mistral/ministral-14b-2512
    Context Size
    262.1k
    Stability
    stable
    Pricing
    Input
    $0.20/M
    Cached
    —/M
    Output
    $0.20/M
    Capabilities
    Streaming
    Vision
    JSON Output
    Try in Playground

    Mistral Large 3

    mistral
    mistral-large-2512

    Providers

    Mistral AI
    mistral/mistral-large-2512
    Context Size
    262.1k
    Stability
    stable
    Pricing
    Input
    $0.50/M
    Cached
    —/M
    Output
    $1.50/M
    Capabilities
    Streaming
    Vision
    JSON Output
    Try in Playground

    Mistral Large Latest

    mistral
    mistral-large-latest

    Providers

    Mistral AI
    mistral/mistral-large-latest
    Context Size
    128k
    Stability
    stable
    Pricing
    Input
    $4.00/M
    Cached
    —/M
    Output
    $12.00/M
    Capabilities
    Streaming
    Try in Playground

    Claude Opus 4.5

    anthropic
    claude-opus-4-5-20251101

    Providers

    Anthropic
    anthropic/claude-opus-4-5-20251101
    Context Size
    200k
    Stability
    stable
    Pricing
    Input
    $5.00/M
    Cached
    $0.50/M
    Output
    $25.00/M
    Capabilities
    Streaming
    Vision
    Tools
    Reasoning
    Reasoning Budget
    Structured JSON Output
    Native Web Search
    Try in Playground

    Gemini 3 Pro Image (Preview)

    google
    gemini-3-pro-image-preview

    Providers

    Google AI Studio
    google-ai-studio/gemini-3-pro-image-preview
    Context Size
    65.5k
    Stability
    stable
    Pricing
    Input
    $2.00/M
    Cached
    $0.20/M
    Output
    $12.00/M
    Capabilities
    Streaming
    Vision
    JSON Output
    Structured JSON Output
    Image Generation
    Try in Playground

    Grok 4.1 Fast Non-Reasoning

    xai
    grok-4-1-fast-non-reasoning

    Providers

    xAI
    xai/grok-4-1-fast-non-reasoning
    Context Size
    2M
    Stability
    stable
    Pricing
    Input
    $0.20/M
    Cached
    $0.05/M
    Output
    $0.50/M
    Capabilities
    Streaming
    Vision
    Tools
    JSON Output
    Try in Playground

    Grok 4.1 Fast Reasoning

    xai
    grok-4-1-fast-reasoning

    Providers

    xAI
    xai/grok-4-1-fast-reasoning
    Context Size
    2M
    Stability
    stable
    Pricing
    Input
    $0.20/M
    Cached
    $0.05/M
    Output
    $0.50/M
    Capabilities
    Streaming
    Vision
    Tools
    JSON Output
    Try in Playground

    Gemini 3 Pro (Preview)

    google
    gemini-3-pro-preview

    Providers

    Google AI Studio
    google-ai-studio/gemini-3-pro-preview
    Context Size
    1.0M
    Stability
    stable
    Pricing
    Input
    $2.00/M
    Cached
    $0.20/M
    Output
    $12.00/M
    Capabilities
    Streaming
    Vision
    Tools
    Reasoning
    Reasoning Budget
    JSON Output
    Structured JSON Output
    Native Web Search
    Try in Playground

    GPT-5.1 Codex

    openai
    gpt-5.1-codex

    Providers

    Azure
    azure/gpt-5.1-codex
    Context Size
    400k
    Stability
    unstable
    Pricing
    Input
    $1.25/M
    Cached
    —/M
    Output
    $10.00/M
    Capabilities
    Streaming
    Vision
    Tools
    Reasoning
    JSON Output
    Try in Playground

    GPT-5.1 Codex mini

    openai
    gpt-5.1-codex-mini

    Providers

    Azure
    azure/gpt-5.1-codex-mini
    Context Size
    400k
    Stability
    unstable
    Pricing
    Input
    $0.25/M
    Cached
    $0.03/M
    Output
    $2.00/M
    Capabilities
    Streaming
    Vision
    Tools
    Reasoning
    JSON Output
    Try in Playground

    Kimi K2 Thinking Turbo

    moonshot
    kimi-k2-thinking-turbo

    Providers

    Moonshot AI
    moonshot/kimi-k2-thinking-turbo
    Context Size
    262.1k
    Stability
    stable
    Pricing
    Input
    $1.15/M
    Cached
    $0.15/M
    Output
    $8.00/M
    Capabilities
    Streaming
    Tools
    Reasoning
    JSON Output
    Try in Playground

    Kimi K2 Thinking

    moonshot
    kimi-k2-thinking

    Providers

    ByteDance
    bytedance/kimi-k2-thinking
    Context Size
    256k
    Stability
    stable
    Pricing
    Input
    $0.60/M
    Cached
    $0.12/M
    Output
    $2.50/M
    Capabilities
    Streaming
    Tools
    Reasoning
    Try in Playground

    GPT-5.1

    openai
    gpt-5.1

    Providers

    Azure
    azure/gpt-5.1
    Context Size
    400k
    Stability
    stable
    Pricing
    30% off
    Input
    $1.25$0.88
    -30% off
    /M
    Cached
    $0.13$0.09
    -30% off
    /M
    Output
    $10.00$7.00
    -30% off
    /M
    Capabilities
    Streaming
    Vision
    Tools
    Reasoning
    JSON Output
    Structured JSON Output
    Try in Playground

    MiniMax M2

    minimax
    minimax-m2

    Providers

    CanopyWave
    canopywave/minimax-m2
    Context Size
    196.6k
    Stability
    stable
    Deactivated since Jan 1, 2026
    Pricing
    30% off
    Input
    $0.25$0.17
    -30% off
    /M
    Cached
    —/M
    Output
    $1.00$0.70
    -30% off
    /M
    Capabilities
    Streaming
    Tools
    Reasoning
    JSON Output
    Try in Playground

    Qwen3 VL Flash

    alibaba
    qwen3-vl-flash

    Providers

    Alibaba Cloud
    alibaba/qwen3-vl-flash
    Context Size
    262.1k
    Stability
    stable
    Pricing
    20% off
    Input
    $0.05$0.04
    -20% off
    /M
    Cached
    $0.01$0.01
    -20% off
    /M
    Output
    $0.40$0.32
    -20% off
    /M
    Capabilities
    Streaming
    Vision
    Tools
    JSON Output
    Try in Playground

    Claude Haiku 4.5 (2025-10-01)

    anthropic
    claude-haiku-4-5-20251001

    Providers

    Anthropic
    anthropic/claude-haiku-4-5-20251001
    Context Size
    200k
    Stability
    stable
    Pricing
    Input
    $1.00/M
    Cached
    $0.10/M
    Output
    $5.00/M
    Capabilities
    Streaming
    Tools
    Structured JSON Output
    Native Web Search
    Try in Playground

    Claude Haiku 4.5

    anthropic
    claude-haiku-4-5

    Providers

    Anthropic
    anthropic/claude-haiku-4-5
    Context Size
    200k
    Stability
    stable
    Pricing
    Input
    $1.00/M
    Cached
    $0.10/M
    Output
    $5.00/M
    Capabilities
    Streaming
    Tools
    Structured JSON Output
    Native Web Search
    Try in Playground

    Qwen3 VL 8B Instruct

    alibaba
    qwen3-vl-8b-instruct

    Providers

    NovitaAI
    novita/qwen3-vl-8b-instruct
    Context Size
    131.1k
    Stability
    stable
    Pricing
    Input
    $0.08/M
    Cached
    —/M
    Output
    $0.50/M
    Capabilities
    Streaming
    Vision
    JSON Output
    Try in Playground

    Qwen3 VL 30B A3B Thinking

    alibaba
    qwen3-vl-30b-a3b-thinking

    Providers

    NovitaAI
    novita/qwen3-vl-30b-a3b-thinking
    Context Size
    131.1k
    Stability
    stable
    Pricing
    Input
    $0.20/M
    Cached
    —/M
    Output
    $1.00/M
    Capabilities
    Streaming
    Vision
    Tools
    Reasoning
    JSON Output
    Try in Playground

    Grok 4 Fast Non-Reasoning

    xai
    grok-4-fast-non-reasoning

    Providers

    xAI
    xai/grok-4-fast-non-reasoning
    Context Size
    2M
    Stability
    stable
    Pricing
    Input
    $0.20/M
    Cached
    $0.05/M
    Output
    $0.50/M
    Capabilities
    Streaming
    Vision
    Tools
    JSON Output
    Try in Playground

    Qwen3 VL 30B A3B Instruct

    alibaba
    qwen3-vl-30b-a3b-instruct

    Providers

    NovitaAI
    novita/qwen3-vl-30b-a3b-instruct
    Context Size
    131.1k
    Stability
    stable
    Pricing
    Input
    $0.20/M
    Cached
    —/M
    Output
    $0.70/M
    Capabilities
    Streaming
    Vision
    Tools
    Try in Playground

    Gemini 2.5 Flash Image

    google
    gemini-2.5-flash-image

    Providers

    Google AI Studio
    google-ai-studio/gemini-2.5-flash-image
    Context Size
    32.8k
    Stability
    stable
    Pricing
    Input
    $0.30/M
    Cached
    $0.03/M
    Output
    $30.00/M
    Capabilities
    Streaming
    Vision
    JSON Output
    Structured JSON Output
    Image Generation
    Try in Playground

    Gemini 2.5 Flash Image (Preview)

    google
    gemini-2.5-flash-image-preview

    Providers

    Google Vertex AI
    google-vertex/gemini-2.5-flash-image-preview
    Context Size
    32.8k
    Stability
    stable
    Pricing
    Input
    $0.30/M
    Cached
    —/M
    Output
    $2.50/M
    Capabilities
    Streaming
    Vision
    JSON Output
    Structured JSON Output
    Image Generation
    Try in Playground

    GLM-4.6

    glm
    glm-4.6

    Providers

    CanopyWave
    canopywave/glm-4.6
    Context Size
    202.8k
    Stability
    stable
    Deactivated since Jan 1, 2026
    Pricing
    30% off
    Input
    $0.45$0.32
    -30% off
    /M
    Cached
    —/M
    Output
    $1.50$1.05
    -30% off
    /M
    Capabilities
    Streaming
    Tools
    Reasoning
    JSON Output
    Try in Playground

    Claude Sonnet 4.5 (2025-09-29)

    anthropic
    claude-sonnet-4-5-20250929

    Providers

    Anthropic
    anthropic/claude-sonnet-4-5-20250929
    Context Size
    200k
    Stability
    stable
    Pricing
    Input
    $3.00/M
    Cached
    $0.30/M
    Output
    $15.00/M
    Capabilities
    Streaming
    Tools
    Reasoning
    Reasoning Budget
    Structured JSON Output
    Native Web Search
    Try in Playground

    DeepSeek V3.2

    deepseek
    deepseek-v3.2

    Providers

    ByteDance
    bytedance/deepseek-v3.2
    Context Size
    131.1k
    Stability
    stable
    Pricing
    Input
    $0.28/M
    Cached
    $0.06/M
    Output
    $0.42/M
    Capabilities
    Streaming
    Tools
    Reasoning
    Try in Playground

    Claude Sonnet 4.5

    anthropic
    claude-sonnet-4-5

    Providers

    Anthropic
    anthropic/claude-sonnet-4-5
    Context Size
    200k
    Stability
    stable
    Pricing
    Input
    $3.00/M
    Cached
    $0.30/M
    Output
    $15.00/M
    Capabilities
    Streaming
    Tools
    Reasoning
    Reasoning Budget
    Structured JSON Output
    Native Web Search
    Try in Playground

    Gemini 2.5 Flash Lite Preview (09-2025)

    google
    gemini-2.5-flash-lite-preview-09-2025

    Providers

    Google AI Studio
    google-ai-studio/gemini-2.5-flash-lite-preview-09-2025
    Context Size
    1.0M
    Stability
    stable
    Pricing
    Input
    $0.10/M
    Cached
    $0.01/M
    Output
    $0.40/M
    Capabilities
    Streaming
    Vision
    Tools
    JSON Output
    Structured JSON Output
    Try in Playground

    Gemini 2.5 Flash Preview (09-2025)

    googleModel Deactivated
    gemini-2.5-flash-preview-09-2025

    Providers

    Google AI Studio
    google-ai-studio/gemini-2.5-flash-preview-09-2025
    Context Size
    1.0M
    Stability
    stable
    Deactivated since Jan 17, 2026
    Pricing
    Input
    $0.30/M
    Cached
    $0.03/M
    Output
    $2.50/M
    Capabilities
    Streaming
    Vision
    Tools
    Reasoning
    Reasoning Budget
    JSON Output
    Structured JSON Output
    Try in Playground

    Qwen3 Max

    alibaba
    qwen3-max

    Providers

    Alibaba Cloud
    alibaba/qwen3-max
    Context Size
    256k
    Stability
    stable
    Pricing
    20% off
    Input
    $3.00$2.40
    -20% off
    /M
    Cached
    $0.60$0.48
    -20% off
    /M
    Output
    $15.00$12.00
    -20% off
    /M
    Capabilities
    Streaming
    Vision
    Tools
    Reasoning
    JSON Output
    Try in Playground

    Qwen3 VL 235B A22B Thinking

    alibaba
    qwen3-vl-235b-a22b-thinking

    Providers

    Alibaba Cloud
    alibaba/qwen3-vl-235b-a22b-thinking
    Context Size
    131.1k
    Stability
    stable
    Pricing
    20% off
    Input
    $0.50$0.40
    -20% off
    /M
    Cached
    —/M
    Output
    $2.00$1.60
    -20% off
    /M
    Capabilities
    Streaming
    Vision
    Reasoning
    Try in Playground

    Qwen3 VL 235B A22B Instruct

    alibaba
    qwen3-vl-235b-a22b-instruct

    Providers

    Alibaba Cloud
    alibaba/qwen3-vl-235b-a22b-instruct
    Context Size
    131.1k
    Stability
    stable
    Pricing
    20% off
    Input
    $0.50$0.40
    -20% off
    /M
    Cached
    —/M
    Output
    $2.00$1.60
    -20% off
    /M
    Capabilities
    Streaming
    Vision
    Tools
    JSON Output
    Try in Playground

    Qwen3 VL Plus

    alibaba
    qwen3-vl-plus

    Providers

    Alibaba Cloud
    alibaba/qwen3-vl-plus
    Context Size
    262.1k
    Stability
    stable
    Pricing
    20% off
    Input
    $0.20$0.16
    -20% off
    /M
    Cached
    $0.04$0.03
    -20% off
    /M
    Output
    $1.60$1.28
    -20% off
    /M
    Capabilities
    Streaming
    Vision
    JSON Output
    Try in Playground

    Qwen3 Coder Plus

    alibaba
    qwen3-coder-plus

    Providers

    Alibaba Cloud
    alibaba/qwen3-coder-plus
    Context Size
    1M
    Stability
    stable
    Pricing
    20% off
    Input
    $6.00$4.80
    -20% off
    /M
    Cached
    —/M
    Output
    $60.00$48.00
    -20% off
    /M
    Capabilities
    Streaming
    Tools
    JSON Output
    Try in Playground

    Seedream 4.0

    bytedance
    seedream-4-0

    Providers

    ByteDance
    bytedance/seedream-4-0
    Context Size
    2k
    Stability
    stable
    Pricing
    10% off
    Input
    $0.00$0.00
    -10% off
    /M
    Cached
    —/M
    Output
    $0.00$0.00
    -10% off
    /M
    Per Request
    $0.035/req
    Capabilities
    Image Generation
    Try in Playground

    Seed 1.6 (250915)

    bytedance
    seed-1-6-250915

    Providers

    ByteDance
    bytedance/seed-1-6-250915
    Context Size
    256k
    Stability
    stable
    Pricing
    Input
    $0.25/M
    Cached
    $0.05/M
    Output
    $2.00/M
    Capabilities
    Streaming
    Vision
    Tools
    Reasoning
    JSON Output
    Try in Playground

    Qwen3 Next 80B A3B Instruct

    alibaba
    qwen3-next-80b-a3b-instruct

    Providers

    Alibaba Cloud
    alibaba/qwen3-next-80b-a3b-instruct
    Context Size
    129.0k
    Stability
    stable
    Pricing
    20% off
    Input
    $0.50$0.40
    -20% off
    /M
    Cached
    —/M
    Output
    $2.00$1.60
    -20% off
    /M
    Capabilities
    Streaming
    Tools
    JSON Output
    Try in Playground

    Qwen3 Next 80B A3B Thinking

    alibaba
    qwen3-next-80b-a3b-thinking

    Providers

    Alibaba Cloud
    alibaba/qwen3-next-80b-a3b-thinking
    Context Size
    131.1k
    Stability
    unstable
    Pricing
    20% off
    Input
    $0.50$0.40
    -20% off
    /M
    Cached
    —/M
    Output
    $6.00$4.80
    -20% off
    /M
    Capabilities
    Streaming
    Tools
    Reasoning
    Try in Playground

    Qwen Max

    alibaba
    qwen-max

    Providers

    Alibaba Cloud
    alibaba/qwen-max
    Context Size
    131.1k
    Stability
    stable
    Pricing
    20% off
    Input
    $1.60$1.28
    -20% off
    /M
    Cached
    —/M
    Output
    $6.40$5.12
    -20% off
    /M
    Capabilities
    Streaming
    Vision
    Tools
    JSON Output
    Try in Playground

    Grok Code Fast 1

    xai
    grok-code-fast-1

    Providers

    xAI
    xai/grok-code-fast-1
    Context Size
    256k
    Stability
    stable
    Pricing
    Input
    $0.20/M
    Cached
    —/M
    Output
    $1.50/M
    Capabilities
    Streaming
    Tools
    JSON Output
    Try in Playground

    Gemini 2.5 Flash

    google
    gemini-2.5-flash

    Providers

    Google AI Studio
    google-ai-studio/gemini-2.5-flash
    Context Size
    1.0M
    Stability
    stable
    Pricing
    Input
    $0.30/M
    Cached
    $0.03/M
    Output
    $2.50/M
    Capabilities
    Streaming
    Vision
    Tools
    Reasoning
    Reasoning Budget
    JSON Output
    Structured JSON Output
    Native Web Search
    Try in Playground

    DeepSeek V3.1

    deepseek
    deepseek-v3.1

    Providers

    ByteDance
    bytedance/deepseek-v3.1
    Context Size
    128k
    Stability
    stable
    Pricing
    Input
    $0.56/M
    Cached
    $0.11/M
    Output
    $1.68/M
    Capabilities
    Streaming
    Tools
    Reasoning
    Try in Playground

    Qwen Image Edit Plus

    alibaba
    qwen-image-edit-plus

    Providers

    Alibaba Cloud
    alibaba/qwen-image-edit-plus
    Context Size
    2k
    Stability
    stable
    Pricing
    20% off
    Input
    $0.00$0.00
    -20% off
    /M
    Cached
    —/M
    Output
    $0.00$0.00
    -20% off
    /M
    Per Request
    $0.040/req
    Capabilities
    Vision
    Image Generation
    Try in Playground

    GLM-4.5 Flash

    glm
    glm-4.5-flash

    Providers

    Z AI
    zai/glm-4.5-flash
    Context Size
    128k
    Stability
    stable
    Pricing
    Input
    $0.00/M
    Cached
    $0.00/M
    Output
    $0.00/M
    Capabilities
    Streaming
    Tools
    JSON Output
    Try in Playground

    GLM-4.5V

    glm
    glm-4.5v

    Providers

    NovitaAI
    novita/glm-4.5v
    Context Size
    65.5k
    Stability
    stable
    Pricing
    Input
    $0.60/M
    Cached
    $0.11/M
    Output
    $1.80/M
    Capabilities
    Streaming
    Vision
    Tools
    Reasoning
    JSON Output
    Try in Playground

    Claude Opus 4.1

    anthropic
    claude-opus-4-1-20250805

    Providers

    Anthropic
    anthropic/claude-opus-4-1-20250805
    Context Size
    200k
    Stability
    stable
    Pricing
    Input
    $15.00/M
    Cached
    $1.50/M
    Output
    $75.00/M
    Capabilities
    Streaming
    Vision
    Tools
    Reasoning
    Reasoning Budget
    Structured JSON Output
    Native Web Search
    Try in Playground

    GPT OSS 20B

    openai
    gpt-oss-20b

    Providers

    Groq
    groq/gpt-oss-20b
    Context Size
    131.1k
    Stability
    stable
    Pricing
    Input
    $0.10/M
    Cached
    —/M
    Output
    $0.50/M
    Capabilities
    Streaming
    Tools
    Reasoning
    JSON Output
    Try in Playground

    GPT OSS 120B

    openai
    gpt-oss-120b

    Providers

    ByteDance
    bytedance/gpt-oss-120b
    Context Size
    128k
    Stability
    stable
    Pricing
    Input
    $0.10/M
    Cached
    $0.02/M
    Output
    $0.50/M
    Capabilities
    Streaming
    Tools
    Reasoning
    Try in Playground

    Qwen Image

    alibaba
    qwen-image

    Providers

    Alibaba Cloud
    alibaba/qwen-image
    Context Size
    2k
    Stability
    stable
    Pricing
    20% off
    Input
    $0.00$0.00
    -20% off
    /M
    Cached
    —/M
    Output
    $0.00$0.00
    -20% off
    /M
    Per Request
    $0.035/req
    Capabilities
    Image Generation
    Try in Playground

    Qwen Image Max

    alibaba
    qwen-image-max

    Providers

    Alibaba Cloud
    alibaba/qwen-image-max
    Context Size
    2k
    Stability
    stable
    Pricing
    Input
    $0.00/M
    Cached
    —/M
    Output
    $0.00/M
    Per Request
    $0.075/req
    Capabilities
    Image Generation
    Try in Playground

    Qwen Image Plus

    alibaba
    qwen-image-plus

    Providers

    Alibaba Cloud
    alibaba/qwen-image-plus
    Context Size
    2k
    Stability
    stable
    Pricing
    20% off
    Input
    $0.00$0.00
    -20% off
    /M
    Cached
    —/M
    Output
    $0.00$0.00
    -20% off
    /M
    Per Request
    $0.030/req
    Capabilities
    Image Generation
    Try in Playground

    GPT-5 Pro

    openai
    gpt-5-pro

    Providers

    OpenAI
    openai/gpt-5-pro
    Context Size
    400k
    Stability
    stable
    Pricing
    Input
    $15.00/M
    Cached
    —/M
    Output
    $120.00/M
    Capabilities
    Streaming
    Vision
    Tools
    Reasoning
    JSON Output
    Structured JSON Output
    Native Web Search
    Try in Playground

    GPT-5 Chat Latest

    openai
    gpt-5-chat-latest

    Providers

    OpenAI
    openai/gpt-5-chat-latest
    Context Size
    400k
    Stability
    stable
    Pricing
    Input
    $1.25/M
    Cached
    $0.13/M
    Output
    $10.00/M
    Capabilities
    Streaming
    Vision
    JSON Output
    Structured JSON Output
    Try in Playground

    GPT-5 Nano

    openai
    gpt-5-nano

    Providers

    Azure
    azure/gpt-5-nano
    Context Size
    400k
    Stability
    unstable
    Pricing
    Input
    $0.05/M
    Cached
    $0.01/M
    Output
    $0.40/M
    Capabilities
    Streaming
    Tools
    Reasoning
    JSON Output
    Structured JSON Output
    Try in Playground

    GPT-5 Mini

    openai
    gpt-5-mini

    Providers

    Azure
    azure/gpt-5-mini
    Context Size
    400k
    Stability
    unstable
    Pricing
    Input
    $0.25/M
    Cached
    $0.03/M
    Output
    $2.00/M
    Capabilities
    Streaming
    Vision
    Tools
    Reasoning
    JSON Output
    Structured JSON Output
    Try in Playground

    GPT-5

    openai
    gpt-5

    Providers

    Azure
    azure/gpt-5
    Context Size
    400k
    Stability
    unstable
    Pricing
    Input
    $1.25/M
    Cached
    $0.13/M
    Output
    $10.00/M
    Capabilities
    Streaming
    Vision
    Tools
    Reasoning
    JSON Output
    Structured JSON Output
    Try in Playground

    Qwen3 Coder 30B A3B Instruct

    alibaba
    qwen3-coder-30b-a3b-instruct

    Providers

    Nebius AI
    nebius/qwen3-coder-30b-a3b-instruct
    Context Size
    262k
    Stability
    stable
    Pricing
    Input
    $0.10/M
    Cached
    —/M
    Output
    $0.30/M
    Capabilities
    Streaming
    Tools
    JSON Output
    Try in Playground

    Codestral

    mistral
    codestral-2508

    Providers

    Mistral AI
    mistral/codestral-2508
    Context Size
    256k
    Stability
    stable
    Pricing
    Input
    $0.30/M
    Cached
    —/M
    Output
    $0.90/M
    Capabilities
    Streaming
    JSON Output
    Try in Playground

    Qwen3 30B A3B Thinking 2507

    alibaba
    qwen3-30b-a3b-thinking-2507

    Providers

    Nebius AI
    nebius/qwen3-30b-a3b-thinking-2507
    Context Size
    262k
    Stability
    stable
    Pricing
    Input
    $0.10/M
    Cached
    —/M
    Output
    $0.30/M
    Capabilities
    Streaming
    Tools
    Reasoning
    JSON Output
    Try in Playground

    Qwen3 30B A3B Instruct 2507

    alibaba
    qwen3-30b-a3b-instruct-2507

    Providers

    Nebius AI
    nebius/qwen3-30b-a3b-instruct-2507
    Context Size
    262k
    Stability
    stable
    Pricing
    Input
    $0.10/M
    Cached
    —/M
    Output
    $0.30/M
    Capabilities
    Streaming
    Tools
    JSON Output
    Try in Playground

    GLM-4.5 AirX

    glm
    glm-4.5-airx

    Providers

    Z AI
    zai/glm-4.5-airx
    Context Size
    128k
    Stability
    stable
    Pricing
    10% off
    Input
    $1.10$0.99
    -10% off
    /M
    Cached
    $0.22$0.20
    -10% off
    /M
    Output
    $4.50$4.05
    -10% off
    /M
    Capabilities
    Streaming
    Tools
    JSON Output
    Try in Playground

    GLM-4.5 X

    glm
    glm-4.5-x

    Providers

    Z AI
    zai/glm-4.5-x
    Context Size
    128k
    Stability
    unstable
    Pricing
    10% off
    Input
    $2.20$1.98
    -10% off
    /M
    Cached
    $0.45$0.41
    -10% off
    /M
    Output
    $8.90$8.01
    -10% off
    /M
    Capabilities
    Streaming
    Tools
    Reasoning
    JSON Output
    Try in Playground

    GLM-4.5

    glm
    glm-4.5

    Providers

    Z AI
    zai/glm-4.5
    Context Size
    128k
    Stability
    stable
    Pricing
    10% off
    Input
    $0.60$0.54
    -10% off
    /M
    Cached
    $0.11$0.10
    -10% off
    /M
    Output
    $2.20$1.98
    -10% off
    /M
    Capabilities
    Streaming
    Tools
    Reasoning
    JSON Output
    Native Web Search
    Try in Playground

    Seed 1.6 Flash (250715)

    bytedance
    seed-1-6-flash-250715

    Providers

    ByteDance
    bytedance/seed-1-6-flash-250715
    Context Size
    256k
    Stability
    stable
    Pricing
    Input
    $0.07/M
    Cached
    $0.02/M
    Output
    $0.30/M
    Capabilities
    Streaming
    Vision
    Tools
    Reasoning
    JSON Output
    Try in Playground

    GLM-4.5 Air

    glm
    glm-4.5-air

    Providers

    Z AI
    zai/glm-4.5-air
    Context Size
    128k
    Stability
    stable
    Pricing
    10% off
    Input
    $0.20$0.18
    -10% off
    /M
    Cached
    $0.03$0.03
    -10% off
    /M
    Output
    $1.10$0.99
    -10% off
    /M
    Capabilities
    Streaming
    Tools
    JSON Output
    Try in Playground

    Qwen3 235B A22B Thinking 2507

    alibaba
    qwen3-235b-a22b-thinking-2507

    Providers

    Nebius AI
    nebius/qwen3-235b-a22b-thinking-2507
    Context Size
    262k
    Stability
    unstable
    Pricing
    Input
    $0.20/M
    Cached
    —/M
    Output
    $0.60/M
    Capabilities
    Streaming
    Tools
    Reasoning
    JSON Output
    Try in Playground

    Qwen3 Coder

    alibabaModel Deactivated
    qwen3-coder

    Providers

    CanopyWave
    canopywave/qwen3-coder
    Context Size
    262k
    Stability
    stable
    Deactivated since Feb 1, 2026
    Pricing
    30% off
    Input
    $0.22$0.15
    -30% off
    /M
    Cached
    —/M
    Output
    $0.95$0.66
    -30% off
    /M
    Capabilities
    Streaming
    Tools
    JSON Output
    Try in Playground

    Qwen3 Coder Flash

    alibaba
    qwen3-coder-flash

    Providers

    Alibaba Cloud
    alibaba/qwen3-coder-flash
    Context Size
    1M
    Stability
    stable
    Pricing
    20% off
    Input
    $0.30$0.24
    -20% off
    /M
    Cached
    $0.06$0.05
    -20% off
    /M
    Output
    $1.50$1.20
    -20% off
    /M
    Capabilities
    Streaming
    Tools
    JSON Output
    Try in Playground

    Gemini 2.5 Flash Lite

    google
    gemini-2.5-flash-lite

    Providers

    Google AI Studio
    google-ai-studio/gemini-2.5-flash-lite
    Context Size
    1.0M
    Stability
    stable
    Pricing
    Input
    $0.10/M
    Cached
    $0.01/M
    Output
    $0.40/M
    Capabilities
    Streaming
    Vision
    Tools
    JSON Output
    Structured JSON Output
    Try in Playground

    Devstral Small 1.1

    mistral
    devstral-small-2507

    Providers

    Mistral AI
    mistral/devstral-small-2507
    Context Size
    131.1k
    Stability
    stable
    Pricing
    Input
    $0.10/M
    Cached
    —/M
    Output
    $0.30/M
    Capabilities
    Streaming
    JSON Output
    Try in Playground

    Qwen3 235B A22B Instruct 2507

    alibaba
    qwen3-235b-a22b-instruct-2507

    Providers

    Cerebras
    cerebras/qwen3-235b-a22b-instruct-2507
    Context Size
    262k
    Stability
    stable
    Pricing
    Input
    $0.60/M
    Cached
    —/M
    Output
    $1.20/M
    Capabilities
    Streaming
    Tools
    JSON Output
    Try in Playground

    Kimi K2

    moonshot
    kimi-k2

    Providers

    ByteDance
    bytedance/kimi-k2
    Context Size
    256k
    Stability
    stable
    Pricing
    Input
    $0.60/M
    Cached
    $0.12/M
    Output
    $2.50/M
    Capabilities
    Streaming
    Tools
    Try in Playground

    Grok 4 Fast Reasoning

    xai
    grok-4-fast-reasoning

    Providers

    xAI
    xai/grok-4-fast-reasoning
    Context Size
    2M
    Stability
    stable
    Pricing
    Input
    $0.20/M
    Cached
    $0.05/M
    Output
    $0.50/M
    Capabilities
    Streaming
    Vision
    Tools
    JSON Output
    Try in Playground

    Grok 4

    xai
    grok-4

    Providers

    xAI
    xai/grok-4
    Context Size
    256k
    Stability
    stable
    Pricing
    Input
    $3.00/M
    Cached
    —/M
    Output
    $15.00/M
    Capabilities
    Streaming
    Vision
    Tools
    JSON Output
    Try in Playground

    Grok 4 (0709)

    xai
    grok-4-0709

    Providers

    xAI
    xai/grok-4-0709
    Context Size
    256k
    Stability
    stable
    Pricing
    Input
    $3.00/M
    Cached
    —/M
    Output
    $15.00/M
    Capabilities
    Streaming
    Tools
    JSON Output
    Try in Playground

    Gemma 3n E4B IT

    google
    gemma-3n-e4b-it

    Providers

    Google AI Studio
    google-ai-studio/gemma-3n-e4b-it
    Context Size
    1M
    Stability
    stable
    Pricing
    Input
    $0.07/M
    Cached
    —/M
    Output
    $0.30/M
    Capabilities
    Streaming
    Try in Playground

    Gemma 3n E2B IT

    google
    gemma-3n-e2b-it

    Providers

    Google AI Studio
    google-ai-studio/gemma-3n-e2b-it
    Context Size
    1M
    Stability
    stable
    Pricing
    Input
    $0.07/M
    Cached
    —/M
    Output
    $0.30/M
    Capabilities
    Streaming
    Try in Playground

    Seed 1.6 (250615)

    bytedance
    seed-1-6-250615

    Providers

    ByteDance
    bytedance/seed-1-6-250615
    Context Size
    256k
    Stability
    stable
    Pricing
    Input
    $0.25/M
    Cached
    $0.05/M
    Output
    $2.00/M
    Capabilities
    Streaming
    Vision
    Tools
    Reasoning
    JSON Output
    Try in Playground

    Mistral Small 3.2

    mistral
    mistral-small-2506

    Providers

    Mistral AI
    mistral/mistral-small-2506
    Context Size
    128k
    Stability
    stable
    Pricing
    Input
    $0.10/M
    Cached
    —/M
    Output
    $0.30/M
    Capabilities
    Streaming
    Vision
    JSON Output
    Try in Playground

    Gemini 2.5 Pro Preview (06-05)

    googleModel Deactivated
    gemini-2.5-pro-preview-06-05

    Providers

    Google AI Studio
    google-ai-studio/gemini-2.5-pro-preview-06-05
    Context Size
    1M
    Stability
    stable
    Deactivated since Jul 15, 2025
    Pricing
    Input
    $1.25/M
    Cached
    —/M
    Output
    $10.00/M
    Capabilities
    Streaming
    Vision
    Tools
    Reasoning
    Reasoning Budget
    JSON Output
    Structured JSON Output
    Try in Playground

    o3 Mini

    openai
    o3-mini

    Providers

    Azure
    azure/o3-mini
    Context Size
    200k
    Stability
    unstable
    Pricing
    Input
    $1.10/M
    Cached
    —/M
    Output
    $4.40/M
    Capabilities
    Streaming
    JSON Output
    Structured JSON Output
    Try in Playground

    o3

    openai
    o3

    Providers

    Azure
    azure/o3
    Context Size
    200k
    Stability
    unstable
    Pricing
    Input
    $2.00/M
    Cached
    —/M
    Output
    $8.00/M
    Capabilities
    Vision
    JSON Output
    Structured JSON Output
    Try in Playground

    DeepSeek R1 (0528)

    deepseek
    deepseek-r1-0528

    Providers

    DeepSeek
    deepseek/deepseek-r1-0528
    Context Size
    64k
    Stability
    stable
    Pricing
    Input
    $0.55/M
    Cached
    —/M
    Output
    $2.19/M
    Capabilities
    Streaming
    Try in Playground

    Claude Opus 4 (2025-05-14)

    anthropic
    claude-opus-4-20250514

    Providers

    Anthropic
    anthropic/claude-opus-4-20250514
    Context Size
    200k
    Stability
    stable
    Pricing
    Input
    $15.00/M
    Cached
    $1.50/M
    Output
    $75.00/M
    Capabilities
    Streaming
    Tools
    Reasoning
    Reasoning Budget
    Native Web Search
    Try in Playground

    Gemini 2.5 Flash Preview (05-20)

    googleModel Deactivated
    gemini-2.5-flash-preview-05-20

    Providers

    Google AI Studio
    google-ai-studio/gemini-2.5-flash-preview-05-20
    Context Size
    1M
    Stability
    stable
    Deactivated since Jul 15, 2025
    Pricing
    Input
    $0.15/M
    Cached
    —/M
    Output
    $0.60/M
    Capabilities
    Streaming
    Vision
    Tools
    Reasoning
    Reasoning Budget
    JSON Output
    Structured JSON Output
    Try in Playground

    Claude Sonnet 4 (2025-05-14)

    anthropic
    claude-sonnet-4-20250514

    Providers

    Anthropic
    anthropic/claude-sonnet-4-20250514
    Context Size
    200k
    Stability
    stable
    Pricing
    Input
    $3.00/M
    Cached
    $0.30/M
    Output
    $15.00/M
    Capabilities
    Streaming
    Tools
    Reasoning
    Reasoning Budget
    Native Web Search
    Try in Playground

    Gemini 2.5 Pro Preview (05-06)

    googleModel Deactivated
    gemini-2.5-pro-preview-05-06

    Providers

    Google AI Studio
    google-ai-studio/gemini-2.5-pro-preview-05-06
    Context Size
    1M
    Stability
    stable
    Deactivated since Jul 15, 2025
    Pricing
    Input
    $1.25/M
    Cached
    —/M
    Output
    $10.00/M
    Capabilities
    Streaming
    Vision
    Tools
    Reasoning
    Reasoning Budget
    JSON Output
    Structured JSON Output
    Try in Playground

    Llama Guard 4 12B

    meta
    llama-guard-4-12b

    Providers

    Groq
    groq/llama-guard-4-12b
    Context Size
    131.1k
    Stability
    stable
    Pricing
    Input
    $0.20/M
    Cached
    —/M
    Output
    $0.20/M
    Capabilities
    Streaming
    Try in Playground

    Qwen3 4B FP8

    alibaba
    qwen3-4b-fp8

    Providers

    NovitaAI
    novita/qwen3-4b-fp8
    Context Size
    128k
    Stability
    stable
    Pricing
    Input
    $0.03/M
    Cached
    —/M
    Output
    $0.03/M
    Capabilities
    Streaming
    Try in Playground

    Qwen3 30B A3B FP8

    alibaba
    qwen3-30b-a3b-fp8

    Providers

    NovitaAI
    novita/qwen3-30b-a3b-fp8
    Context Size
    41.0k
    Stability
    stable
    Pricing
    Input
    $0.09/M
    Cached
    —/M
    Output
    $0.45/M
    Capabilities
    Streaming
    Try in Playground

    Qwen3 32B FP8

    alibaba
    qwen3-32b-fp8

    Providers

    NovitaAI
    novita/qwen3-32b-fp8
    Context Size
    41.0k
    Stability
    stable
    Pricing
    Input
    $0.10/M
    Cached
    —/M
    Output
    $0.45/M
    Capabilities
    Streaming
    Try in Playground

    Qwen3 235B A22B FP8

    alibaba
    qwen3-235b-a22b-fp8

    Providers

    NovitaAI
    novita/qwen3-235b-a22b-fp8
    Context Size
    41.0k
    Stability
    stable
    Pricing
    Input
    $0.20/M
    Cached
    —/M
    Output
    $0.80/M
    Capabilities
    Streaming
    JSON Output
    Try in Playground

    Qwen3 30B A3B

    alibabaModel Deactivated
    qwen3-30b-a3b

    Providers

    Nebius AI
    nebius/qwen3-30b-a3b
    Context Size
    32.8k
    Stability
    stable
    Deactivated since Nov 3, 2025
    Pricing
    Input
    $0.10/M
    Cached
    —/M
    Output
    $0.30/M
    Capabilities
    Streaming
    Tools
    JSON Output
    Try in Playground

    Qwen3 32B

    alibaba
    qwen3-32b

    Providers

    Cerebras
    cerebras/qwen3-32b
    Context Size
    32.8k
    Stability
    stable
    Deactivated since Feb 16, 2026
    Pricing
    Input
    $0.40/M
    Cached
    —/M
    Output
    $0.80/M
    Capabilities
    Streaming
    Tools
    JSON Output
    Try in Playground

    Qwen3 14B

    alibabaModel Deactivated
    qwen3-14b

    Providers

    Nebius AI
    nebius/qwen3-14b
    Context Size
    32.8k
    Stability
    stable
    Deactivated since Nov 3, 2025
    Pricing
    Input
    $0.08/M
    Cached
    —/M
    Output
    $0.24/M
    Capabilities
    Streaming
    Tools
    JSON Output
    Try in Playground

    Gemini 2.5 Flash Preview Thinking (04-17)

    googleModel Deactivated
    gemini-2.5-flash-preview-04-17-thinking

    Providers

    Google AI Studio
    google-ai-studio/gemini-2.5-flash-preview-04-17-thinking
    Context Size
    1M
    Stability
    stable
    Deactivated since Jul 22, 2025
    Pricing
    Input
    $0.15/M
    Cached
    —/M
    Output
    $0.60/M
    Capabilities
    Streaming
    Vision
    Tools
    Reasoning
    Reasoning Budget
    JSON Output
    Structured JSON Output
    Try in Playground

    Gemini 2.5 Flash Preview (04-17)

    googleModel Deactivated
    gemini-2.5-flash-preview-04-17

    Providers

    Google AI Studio
    google-ai-studio/gemini-2.5-flash-preview-04-17
    Context Size
    1M
    Stability
    stable
    Deactivated since Jul 15, 2025
    Pricing
    Input
    $0.15/M
    Cached
    —/M
    Output
    $0.60/M
    Capabilities
    Streaming
    Vision
    Tools
    Reasoning
    Reasoning Budget
    JSON Output
    Structured JSON Output
    Try in Playground

    GLM-4 32B (0414-128k)

    glm
    glm-4-32b-0414-128k

    Providers

    Z AI
    zai/glm-4-32b-0414-128k
    Context Size
    128k
    Stability
    stable
    Pricing
    10% off
    Input
    $0.10$0.09
    -10% off
    /M
    Cached
    $0.00$0.00
    -10% off
    /M
    Output
    $0.10$0.09
    -10% off
    /M
    Capabilities
    Streaming
    Tools
    JSON Output
    Try in Playground

    GPT-4.1 Nano

    openai
    gpt-4.1-nano

    Providers

    Azure
    azure/gpt-4.1-nano
    Context Size
    1M
    Stability
    unstable
    Pricing
    Input
    $0.10/M
    Cached
    —/M
    Output
    $0.40/M
    Capabilities
    Streaming
    Vision
    Tools
    JSON Output
    Structured JSON Output
    Try in Playground

    GPT-4.1 Mini

    openai
    gpt-4.1-mini

    Providers

    Azure
    azure/gpt-4.1-mini
    Context Size
    1M
    Stability
    unstable
    Pricing
    Input
    $0.40/M
    Cached
    —/M
    Output
    $1.60/M
    Capabilities
    Streaming
    Vision
    Tools
    JSON Output
    Structured JSON Output
    Try in Playground

    GPT-4.1

    openai
    gpt-4.1

    Providers

    Azure
    azure/gpt-4.1
    Context Size
    1M
    Stability
    unstable
    Pricing
    Input
    $2.00/M
    Cached
    —/M
    Output
    $8.00/M
    Capabilities
    Streaming
    Vision
    Tools
    JSON Output
    Structured JSON Output
    Try in Playground

    Llama 3.1 Nemotron Ultra 253B

    meta
    llama-3.1-nemotron-ultra-253b

    Providers

    Nebius AI
    nebius/llama-3.1-nemotron-ultra-253b
    Context Size
    128k
    Stability
    stable
    Pricing
    Input
    $0.60/M
    Cached
    —/M
    Output
    $1.80/M
    Capabilities
    Streaming
    JSON Output
    Try in Playground

    Llama 4 Maverick 17B Instruct

    meta
    llama-4-maverick-17b-instruct

    Providers

    AWS Bedrock
    aws-bedrock/llama-4-maverick-17b-instruct
    Context Size
    8.2k
    Stability
    unstable
    Pricing
    Input
    $0.24/M
    Cached
    —/M
    Output
    $0.97/M
    Capabilities
    Streaming
    Vision
    Try in Playground

    Llama 4 Scout 17B Instruct

    meta
    llama-4-scout-17b-instruct

    Providers

    AWS Bedrock
    aws-bedrock/llama-4-scout-17b-instruct
    Context Size
    8.2k
    Stability
    unstable
    Pricing
    Input
    $0.17/M
    Cached
    —/M
    Output
    $0.66/M
    Capabilities
    Streaming
    Vision
    Try in Playground

    Llama 4 Scout

    meta
    llama-4-scout

    Providers

    Together AI
    together.ai/llama-4-scout
    Context Size
    32.8k
    Stability
    unstable
    Pricing
    Input
    $0.18/M
    Cached
    —/M
    Output
    $0.59/M
    Capabilities
    Streaming
    Tools
    Try in Playground

    Llama 3 8B Instruct

    meta
    llama-3-8b-instruct

    Providers

    NovitaAI
    novita/llama-3-8b-instruct
    Context Size
    8.2k
    Stability
    stable
    Pricing
    Input
    $0.04/M
    Cached
    —/M
    Output
    $0.04/M
    Capabilities
    Streaming
    JSON Output
    Try in Playground

    Qwen Omni Turbo

    alibaba
    qwen-omni-turbo

    Providers

    Alibaba Cloud
    alibaba/qwen-omni-turbo
    Context Size
    32.8k
    Stability
    stable
    Pricing
    20% off
    Input
    $0.20$0.16
    -20% off
    /M
    Cached
    —/M
    Output
    $0.80$0.64
    -20% off
    /M
    Capabilities
    Streaming
    Vision
    JSON Output
    Try in Playground

    Gemini 2.5 Pro

    google
    gemini-2.5-pro

    Providers

    Google AI Studio
    google-ai-studio/gemini-2.5-pro
    Context Size
    1.0M
    Stability
    stable
    Pricing
    Input
    $1.25/M
    Cached
    $0.13/M
    Output
    $10.00/M
    Capabilities
    Streaming
    Vision
    Tools
    Reasoning
    Reasoning Budget
    JSON Output
    Structured JSON Output
    Native Web Search
    Try in Playground

    Gemma 3 27B

    google
    gemma-3-27b

    Providers

    Nebius AI
    nebius/gemma-3-27b
    Context Size
    128k
    Stability
    stable
    Pricing
    Input
    $0.27/M
    Cached
    —/M
    Output
    $0.27/M
    Capabilities
    Streaming
    Vision
    Try in Playground

    Gemma 3 1B IT

    google
    gemma-3-1b-it

    Providers

    Google AI Studio
    google-ai-studio/gemma-3-1b-it
    Context Size
    1M
    Stability
    stable
    Pricing
    Input
    $0.07/M
    Cached
    —/M
    Output
    $0.30/M
    Capabilities
    Streaming
    Try in Playground

    Gemma 3 12B IT

    google
    gemma-3-12b-it

    Providers

    Google AI Studio
    google-ai-studio/gemma-3-12b-it
    Context Size
    1M
    Stability
    stable
    Pricing
    Input
    $0.07/M
    Cached
    —/M
    Output
    $0.30/M
    Capabilities
    Streaming
    Try in Playground

    Gemma 3 4B IT

    google
    gemma-3-4b-it

    Providers

    Google AI Studio
    google-ai-studio/gemma-3-4b-it
    Context Size
    1M
    Stability
    stable
    Pricing
    Input
    $0.07/M
    Cached
    —/M
    Output
    $0.30/M
    Capabilities
    Streaming
    Try in Playground

    Sonar Pro

    perplexity
    sonar-pro

    Providers

    Perplexity
    perplexity/sonar-pro
    Context Size
    200k
    Stability
    stable
    Pricing
    Input
    $3.00/M
    Cached
    —/M
    Output
    $15.00/M
    Per Request
    $0.005/req
    Capabilities
    Streaming
    Structured JSON Output
    Try in Playground

    Sonar Reasoning Pro

    perplexity
    sonar-reasoning-pro

    Providers

    Perplexity
    perplexity/sonar-reasoning-pro
    Context Size
    128k
    Stability
    stable
    Pricing
    Input
    $2.00/M
    Cached
    —/M
    Output
    $8.00/M
    Per Request
    $0.005/req
    Capabilities
    Streaming
    Structured JSON Output
    Try in Playground

    QwQ Plus

    alibaba
    qwq-plus

    Providers

    Alibaba Cloud
    alibaba/qwq-plus
    Context Size
    131.1k
    Stability
    stable
    Pricing
    20% off
    Input
    $0.80$0.64
    -20% off
    /M
    Cached
    —/M
    Output
    $2.40$1.92
    -20% off
    /M
    Capabilities
    Streaming
    Reasoning
    Try in Playground

    CogView-4

    zai
    cogview-4

    Providers

    Z AI
    zai/cogview-4
    Context Size
    2k
    Stability
    stable
    Pricing
    10% off
    Input
    $0.00$0.00
    -10% off
    /M
    Cached
    —/M
    Output
    $0.00$0.00
    -10% off
    /M
    Per Request
    $0.010/req
    Capabilities
    Image Generation
    Try in Playground

    Qwen QwQ 32B

    alibabaModel Deactivated
    qwen-qwq-32b

    Providers

    Nebius AI
    nebius/qwen-qwq-32b
    Context Size
    32.8k
    Stability
    stable
    Deactivated since Nov 3, 2025
    Pricing
    Input
    $0.15/M
    Cached
    —/M
    Output
    $0.45/M
    Capabilities
    Streaming
    JSON Output
    Try in Playground

    Claude 3.7 Sonnet

    anthropic
    claude-3-7-sonnet

    Providers

    Anthropic
    anthropic/claude-3-7-sonnet
    Context Size
    200k
    Stability
    stable
    Pricing
    Input
    $3.00/M
    Cached
    $0.30/M
    Output
    $15.00/M
    Capabilities
    Streaming
    Tools
    Reasoning
    Reasoning Budget
    Native Web Search
    Try in Playground

    Qwen2.5 VL 32B Instruct

    alibaba
    qwen2-5-vl-32b-instruct

    Providers

    Alibaba Cloud
    alibaba/qwen2-5-vl-32b-instruct
    Context Size
    131.1k
    Stability
    stable
    Pricing
    20% off
    Input
    $1.40$1.12
    -20% off
    /M
    Cached
    —/M
    Output
    $4.20$3.36
    -20% off
    /M
    Capabilities
    Streaming
    Vision
    JSON Output
    Try in Playground

    Claude 3.7 Sonnet (2025-02-19)

    anthropic
    claude-3-7-sonnet-20250219

    Providers

    Anthropic
    anthropic/claude-3-7-sonnet-20250219
    Context Size
    200k
    Stability
    stable
    Pricing
    Input
    $3.00/M
    Cached
    $0.30/M
    Output
    $15.00/M
    Capabilities
    Streaming
    Tools
    Reasoning
    Reasoning Budget
    Native Web Search
    Try in Playground

    Grok-3

    xai
    grok-3

    Providers

    xAI
    xai/grok-3
    Context Size
    131.1k
    Stability
    stable
    Pricing
    Input
    $3.00/M
    Cached
    —/M
    Output
    $15.00/M
    Capabilities
    Streaming
    Tools
    JSON Output
    Try in Playground

    Qwen VL Plus

    alibaba
    qwen-vl-plus

    Providers

    Alibaba Cloud
    alibaba/qwen-vl-plus
    Context Size
    131.1k
    Stability
    stable
    Pricing
    20% off
    Input
    $0.21$0.17
    -20% off
    /M
    Cached
    —/M
    Output
    $0.64$0.51
    -20% off
    /M
    Capabilities
    Streaming
    Vision
    JSON Output
    Try in Playground

    Qwen VL Max

    alibaba
    qwen-vl-max

    Providers

    Alibaba Cloud
    alibaba/qwen-vl-max
    Context Size
    131.1k
    Stability
    stable
    Pricing
    20% off
    Input
    $0.80$0.64
    -20% off
    /M
    Cached
    —/M
    Output
    $3.20$2.56
    -20% off
    /M
    Capabilities
    Streaming
    Vision
    JSON Output
    Try in Playground

    Qwen Turbo

    alibaba
    qwen-turbo

    Providers

    Alibaba Cloud
    alibaba/qwen-turbo
    Context Size
    1M
    Stability
    stable
    Pricing
    20% off
    Input
    $0.05$0.04
    -20% off
    /M
    Cached
    —/M
    Output
    $0.20$0.16
    -20% off
    /M
    Capabilities
    Streaming
    JSON Output
    Try in Playground

    Qwen3 Coder 480B A35B Instruct

    alibaba
    qwen3-coder-480b-a35b-instruct

    Providers

    CanopyWave
    canopywave/qwen3-coder-480b-a35b-instruct
    Context Size
    262.1k
    Stability
    stable
    Deactivated since Feb 1, 2026
    Pricing
    30% off
    Input
    $0.30$0.21
    -30% off
    /M
    Cached
    —/M
    Output
    $1.30$0.91
    -30% off
    /M
    Capabilities
    Streaming
    Tools
    JSON Output
    Try in Playground

    Qwen2.5 VL 72B Instruct

    alibaba
    qwen2-5-vl-72b-instruct

    Providers

    Nebius AI
    nebius/qwen2-5-vl-72b-instruct
    Context Size
    32.8k
    Stability
    stable
    Pricing
    Input
    $0.13/M
    Cached
    —/M
    Output
    $0.40/M
    Capabilities
    Streaming
    Vision
    JSON Output
    Try in Playground

    Qwen Plus

    alibaba
    qwen-plus

    Providers

    Alibaba Cloud
    alibaba/qwen-plus
    Context Size
    131.1k
    Stability
    stable
    Pricing
    20% off
    Input
    $0.40$0.32
    -20% off
    /M
    Cached
    $0.08$0.06
    -20% off
    /M
    Output
    $1.20$0.96
    -20% off
    /M
    Capabilities
    Streaming
    Tools
    JSON Output
    Try in Playground

    Qwen Max Latest

    alibaba
    qwen-max-latest

    Providers

    Alibaba Cloud
    alibaba/qwen-max-latest
    Context Size
    131.1k
    Stability
    stable
    Pricing
    20% off
    Input
    $1.60$1.28
    -20% off
    /M
    Cached
    —/M
    Output
    $6.40$5.12
    -20% off
    /M
    Capabilities
    Streaming
    Vision
    Tools
    JSON Output
    Try in Playground

    DeepSeek R1 Distill Llama 70B

    deepseekModel Deactivated
    deepseek-r1-distill-llama-70b

    Providers

    Groq
    groq/deepseek-r1-distill-llama-70b
    Context Size
    131.1k
    Stability
    stable
    Deactivated since Oct 9, 2025
    Pricing
    Input
    $0.75/M
    Cached
    —/M
    Output
    $0.99/M
    Capabilities
    Streaming
    Tools
    JSON Output
    Try in Playground

    MiniMax Text 01

    minimax
    minimax-text-01

    Providers

    MiniMax
    minimax/minimax-text-01
    Context Size
    1M
    Stability
    stable
    Pricing
    Input
    $0.20/M
    Cached
    —/M
    Output
    $1.10/M
    Capabilities
    Streaming
    Try in Playground

    GLM-Image

    glm
    glm-image

    Providers

    Z AI
    zai/glm-image
    Context Size
    2k
    Stability
    stable
    Pricing
    10% off
    Input
    $0.00$0.00
    -10% off
    /M
    Cached
    —/M
    Output
    $0.00$0.00
    -10% off
    /M
    Per Request
    $0.015/req
    Capabilities
    Image Generation
    Try in Playground

    Sonar

    perplexity
    sonar

    Providers

    Perplexity
    perplexity/sonar
    Context Size
    130k
    Stability
    stable
    Pricing
    Input
    $1.00/M
    Cached
    —/M
    Output
    $1.00/M
    Per Request
    $0.005/req
    Capabilities
    Streaming
    Structured JSON Output
    Try in Playground

    DeepSeek V3

    deepseekModel Deactivated
    deepseek-v3

    Providers

    Nebius AI
    nebius/deepseek-v3
    Context Size
    64k
    Stability
    unstable
    Deactivated since Nov 3, 2025
    Pricing
    Input
    $0.50/M
    Cached
    —/M
    Output
    $1.50/M
    Capabilities
    Streaming
    Try in Playground

    Llama 3.3 70B Instruct

    meta
    llama-3.3-70b-instruct

    Providers

    Cerebras
    cerebras/llama-3.3-70b-instruct
    Context Size
    128k
    Stability
    stable
    Pricing
    Input
    $0.85/M
    Cached
    —/M
    Output
    $1.20/M
    Capabilities
    Streaming
    Tools
    JSON Output
    Try in Playground

    Pixtral Large Latest

    mistral
    pixtral-large-latest

    Providers

    Mistral AI
    mistral/pixtral-large-latest
    Context Size
    128k
    Stability
    stable
    Pricing
    Input
    $4.00/M
    Cached
    —/M
    Output
    $12.00/M
    Capabilities
    Streaming
    Vision
    Try in Playground

    Claude 3.5 Sonnet (2024-10-22)

    anthropicModel Deactivated
    claude-3-5-sonnet-20241022

    Providers

    Anthropic
    anthropic/claude-3-5-sonnet-20241022
    Context Size
    200k
    Stability
    stable
    Deactivated since Oct 22, 2025
    Pricing
    Input
    $3.00/M
    Cached
    $0.30/M
    Output
    $15.00/M
    Capabilities
    Streaming
    Tools
    Native Web Search
    Try in Playground

    Gemini 1.5 Flash 8B

    googleModel Deactivated
    gemini-1.5-flash-8b

    Providers

    Google AI Studio
    google-ai-studio/gemini-1.5-flash-8b
    Context Size
    1M
    Stability
    stable
    Deactivated since Sep 20, 2025
    Pricing
    Input
    $0.04/M
    Cached
    —/M
    Output
    $0.15/M
    Capabilities
    Streaming
    Tools
    Reasoning
    Reasoning Budget
    JSON Output
    Structured JSON Output
    Try in Playground

    GPT-4o Mini Search Preview

    openai
    gpt-4o-mini-search-preview

    Providers

    OpenAI
    openai/gpt-4o-mini-search-preview
    Context Size
    128k
    Stability
    stable
    Pricing
    Input
    $0.15/M
    Cached
    —/M
    Output
    $0.60/M
    Capabilities
    Streaming
    Vision
    Native Web Search
    Try in Playground

    GPT-4o Search Preview

    openai
    gpt-4o-search-preview

    Providers

    OpenAI
    openai/gpt-4o-search-preview
    Context Size
    128k
    Stability
    stable
    Pricing
    Input
    $2.50/M
    Cached
    —/M
    Output
    $10.00/M
    Capabilities
    Streaming
    Vision
    Native Web Search
    Try in Playground

    Llama 3.2 11B Instruct

    meta
    llama-3.2-11b-instruct

    Providers

    Inference.net
    inference.net/llama-3.2-11b-instruct
    Context Size
    128k
    Stability
    unstable
    Pricing
    Input
    $0.07/M
    Cached
    —/M
    Output
    $0.33/M
    Capabilities
    Streaming
    JSON Output
    Try in Playground

    Qwen2 VL 72B Instruct

    alibabaModel Deactivated
    qwen2-vl-72b-instruct

    Providers

    Nebius AI
    nebius/qwen2-vl-72b-instruct
    Context Size
    32.8k
    Stability
    stable
    Deactivated since Sep 10, 2025
    Pricing
    Input
    $0.13/M
    Cached
    —/M
    Output
    $0.40/M
    Capabilities
    Streaming
    Vision
    JSON Output
    Try in Playground

    Qwen2.5 72B Instruct

    alibabaModel Deactivated
    qwen25-72b-instruct

    Providers

    Nebius AI
    nebius/qwen25-72b-instruct
    Context Size
    32.8k
    Stability
    stable
    Deactivated since Nov 3, 2025
    Pricing
    Input
    $0.13/M
    Cached
    —/M
    Output
    $0.40/M
    Capabilities
    Streaming
    Tools
    JSON Output
    Try in Playground

    Qwen2.5 32B Instruct

    alibabaModel Deactivated
    qwen25-32b-instruct

    Providers

    Nebius AI
    nebius/qwen25-32b-instruct
    Context Size
    32.8k
    Stability
    stable
    Deactivated since Sep 10, 2025
    Pricing
    Input
    $0.06/M
    Cached
    —/M
    Output
    $0.20/M
    Capabilities
    Streaming
    Tools
    JSON Output
    Try in Playground

    Qwen2.5 Coder 7B

    alibaba
    qwen25-coder-7b

    Providers

    Nebius AI
    nebius/qwen25-coder-7b
    Context Size
    32.8k
    Stability
    stable
    Pricing
    Input
    $0.01/M
    Cached
    —/M
    Output
    $0.03/M
    Capabilities
    Streaming
    JSON Output
    Try in Playground

    Llama 3.2 3B Instruct

    meta
    llama-3.2-3b-instruct

    Providers

    NovitaAI
    novita/llama-3.2-3b-instruct
    Context Size
    32.8k
    Stability
    unstable
    Pricing
    Input
    $0.03/M
    Cached
    —/M
    Output
    $0.05/M
    Capabilities
    Streaming
    JSON Output
    Try in Playground

    Qwen Coder Plus

    alibaba
    qwen-coder-plus

    Providers

    Alibaba Cloud
    alibaba/qwen-coder-plus
    Context Size
    131.1k
    Stability
    stable
    Pricing
    20% off
    Input
    $1.00$0.80
    -20% off
    /M
    Cached
    —/M
    Output
    $5.00$4.00
    -20% off
    /M
    Capabilities
    Streaming
    Tools
    JSON Output
    Try in Playground

    o1 Mini

    openai
    o1-mini

    Providers

    Azure
    azure/o1-mini
    Context Size
    128k
    Stability
    unstable
    Pricing
    Input
    $1.10/M
    Cached
    —/M
    Output
    $4.40/M
    Capabilities
    Try in Playground

    o1

    openai
    o1

    Providers

    Azure
    azure/o1
    Context Size
    200k
    Stability
    unstable
    Pricing
    Input
    $15.00/M
    Cached
    —/M
    Output
    $60.00/M
    Capabilities
    Streaming
    Vision
    Reasoning
    JSON Output
    Structured JSON Output
    Try in Playground

    Qwen Flash

    alibaba
    qwen-flash

    Providers

    Alibaba Cloud
    alibaba/qwen-flash
    Context Size
    1M
    Stability
    stable
    Pricing
    20% off
    Input
    $0.05$0.04
    -20% off
    /M
    Cached
    $0.01$0.01
    -20% off
    /M
    Output
    $0.40$0.32
    -20% off
    /M
    Capabilities
    Streaming
    Tools
    JSON Output
    Try in Playground

    Qwen Plus Latest

    alibaba
    qwen-plus-latest

    Providers

    Alibaba Cloud
    alibaba/qwen-plus-latest
    Context Size
    1M
    Stability
    stable
    Pricing
    20% off
    Input
    $0.40$0.32
    -20% off
    /M
    Cached
    $0.08$0.06
    -20% off
    /M
    Output
    $1.20$0.96
    -20% off
    /M
    Capabilities
    Streaming
    Tools
    JSON Output
    Try in Playground

    Hermes 3 Llama 405B

    nousresearchModel Deactivated
    hermes-3-llama-405b

    Providers

    Nebius AI
    nebius/hermes-3-llama-405b
    Context Size
    131.1k
    Stability
    stable
    Deactivated since Nov 3, 2025
    Pricing
    Input
    $1.00/M
    Cached
    —/M
    Output
    $3.00/M
    Capabilities
    Streaming
    JSON Output
    Try in Playground

    Llama 3.1 70B Instruct

    meta
    llama-3.1-70b-instruct

    Providers

    AWS Bedrock
    aws-bedrock/llama-3.1-70b-instruct
    Context Size
    128k
    Stability
    unstable
    Pricing
    Input
    $0.72/M
    Cached
    —/M
    Output
    $0.72/M
    Capabilities
    Streaming
    Try in Playground

    Llama 3.1 405B Instruct

    metaModel Deactivated
    llama-3.1-405b-instruct

    Providers

    Nebius AI
    nebius/llama-3.1-405b-instruct
    Context Size
    128k
    Stability
    stable
    Deactivated since Nov 3, 2025
    Pricing
    Input
    $1.00/M
    Cached
    —/M
    Output
    $3.00/M
    Capabilities
    Streaming
    Tools
    JSON Output
    Try in Playground

    Llama 3.1 8B Instruct

    meta
    llama-3.1-8b-instruct

    Providers

    AWS Bedrock
    aws-bedrock/llama-3.1-8b-instruct
    Context Size
    128k
    Stability
    unstable
    Pricing
    Input
    $0.22/M
    Cached
    —/M
    Output
    $0.22/M
    Capabilities
    Streaming
    Try in Playground

    GPT-4o Mini

    openai
    gpt-4o-mini

    Providers

    OpenAI
    openai/gpt-4o-mini
    Context Size
    128k
    Stability
    stable
    Pricing
    Input
    $0.15/M
    Cached
    $0.07/M
    Output
    $0.60/M
    Capabilities
    Streaming
    Tools
    JSON Output
    Structured JSON Output
    Try in Playground

    Gemma 2 27B IT

    google
    gemma-2-27b-it-together

    Providers

    Together AI
    together.ai/gemma-2-27b-it-together
    Context Size
    8.2k
    Stability
    stable
    Pricing
    Input
    $0.08/M
    Cached
    —/M
    Output
    $0.08/M
    Capabilities
    Streaming
    Try in Playground

    Gemma2 9B IT

    googleModel Deactivated
    gemma2-9b-it

    Providers

    Groq
    groq/gemma2-9b-it
    Context Size
    8.1k
    Stability
    unstable
    Deactivated since Oct 8, 2025
    Pricing
    Input
    $0.20/M
    Cached
    —/M
    Output
    $0.20/M
    Capabilities
    Streaming
    Tools
    Try in Playground

    Claude 3.5 Sonnet (Old)

    anthropicModel Deactivated
    claude-3-5-sonnet-20240620

    Providers

    Anthropic
    anthropic/claude-3-5-sonnet-20240620
    Context Size
    200k
    Stability
    stable
    Deactivated since Feb 19, 2026
    Pricing
    Input
    $3.00/M
    Cached
    $0.30/M
    Output
    $15.00/M
    Capabilities
    Streaming
    Vision
    Tools
    Try in Playground

    Claude 3.5 Sonnet

    anthropic
    claude-3-5-sonnet

    Providers

    Anthropic
    anthropic/claude-3-5-sonnet
    Context Size
    200k
    Stability
    stable
    Pricing
    Input
    $3.00/M
    Cached
    $0.30/M
    Output
    $15.00/M
    Capabilities
    Streaming
    Tools
    Native Web Search
    Try in Playground

    Hermes 2 Pro Llama 3 8B

    nousresearch
    hermes-2-pro-llama-3-8b

    Providers

    NovitaAI
    novita/hermes-2-pro-llama-3-8b
    Context Size
    8.2k
    Stability
    unstable
    Pricing
    Input
    $0.14/M
    Cached
    —/M
    Output
    $0.14/M
    Capabilities
    Streaming
    Try in Playground

    Gemini 1.5 Pro

    googleModel Deactivated
    gemini-1.5-pro

    Providers

    Google AI Studio
    google-ai-studio/gemini-1.5-pro
    Context Size
    1M
    Stability
    stable
    Deactivated since Sep 20, 2025
    Pricing
    Input
    $2.50/M
    Cached
    —/M
    Output
    $10.00/M
    Capabilities
    Streaming
    Vision
    Tools
    Reasoning
    Reasoning Budget
    JSON Output
    Structured JSON Output
    Try in Playground

    GPT-4o

    openai
    gpt-4o

    Providers

    Azure
    azure/gpt-4o
    Context Size
    128k
    Stability
    unstable
    Pricing
    Input
    $2.50/M
    Cached
    $1.25/M
    Output
    $10.00/M
    Capabilities
    Streaming
    Vision
    Tools
    JSON Output
    Try in Playground

    Gemini 1.5 Flash

    googleModel Deactivated
    gemini-1.5-flash

    Providers

    Google AI Studio
    google-ai-studio/gemini-1.5-flash
    Context Size
    1M
    Stability
    stable
    Deactivated since Sep 20, 2025
    Pricing
    Input
    $0.04/M
    Cached
    —/M
    Output
    $0.15/M
    Capabilities
    Streaming
    Vision
    Tools
    Reasoning
    Reasoning Budget
    JSON Output
    Structured JSON Output
    Try in Playground

    Llama 3 70B Instruct

    meta
    llama-3-70b-instruct

    Providers

    NovitaAI
    novita/llama-3-70b-instruct
    Context Size
    8.2k
    Stability
    stable
    Pricing
    Input
    $0.51/M
    Cached
    —/M
    Output
    $0.74/M
    Capabilities
    Streaming
    JSON Output
    Try in Playground

    Claude 3 Haiku

    anthropic
    claude-3-haiku

    Providers

    Anthropic
    anthropic/claude-3-haiku
    Context Size
    200k
    Stability
    stable
    Pricing
    Input
    $0.25/M
    Cached
    $0.03/M
    Output
    $1.25/M
    Capabilities
    Streaming
    Vision
    Tools
    Try in Playground

    Claude 3 Opus

    anthropic
    claude-3-opus

    Providers

    Anthropic
    anthropic/claude-3-opus
    Context Size
    200k
    Stability
    stable
    Pricing
    Input
    $15.00/M
    Cached
    $1.50/M
    Output
    $75.00/M
    Capabilities
    Streaming
    Vision
    Tools
    Try in Playground

    Auto Route

    llmgateway
    auto

    Providers

    LLM Gateway
    llmgateway/auto
    Context Size
    —
    Stability
    stable
    Pricing
    Input
    —/M
    Cached
    —/M
    Output
    —/M
    Capabilities
    Streaming
    Vision
    Tools
    JSON Output
    Try in Playground

    Custom Model

    llmgateway
    custom

    Providers

    LLM Gateway
    llmgateway/custom
    Context Size
    —
    Stability
    stable
    Pricing
    Input
    —/M
    Cached
    —/M
    Output
    —/M
    Capabilities
    Streaming
    Vision
    Tools
    JSON Output
    Try in Playground

    Mixtral 8x7B Instruct

    mistral
    mixtral-8x7b-instruct-together

    Providers

    Together AI
    together.ai/mixtral-8x7b-instruct-together
    Context Size
    32.8k
    Stability
    stable
    Pricing
    Input
    $0.06/M
    Cached
    —/M
    Output
    $0.06/M
    Capabilities
    Streaming
    JSON Output
    Try in Playground

    GPT-4 Turbo

    openai
    gpt-4-turbo

    Providers

    Azure
    azure/gpt-4-turbo
    Context Size
    128k
    Stability
    unstable
    Pricing
    Input
    $10.00/M
    Cached
    —/M
    Output
    $30.00/M
    Capabilities
    Streaming
    Vision
    Tools
    JSON Output
    Try in Playground

    Mistral 7B Instruct

    mistralModel Deactivated
    mistral-7b-instruct-together

    Providers

    Together AI
    together.ai/mistral-7b-instruct-together
    Context Size
    8.2k
    Stability
    stable
    Deactivated since Nov 13, 2025
    Pricing
    Input
    $0.06/M
    Cached
    —/M
    Output
    $0.06/M
    Capabilities
    Streaming
    JSON Output
    Try in Playground

    GPT-4

    openai
    gpt-4

    Providers

    Azure
    azure/gpt-4
    Context Size
    8.2k
    Stability
    unstable
    Pricing
    Input
    $30.00/M
    Cached
    —/M
    Output
    $60.00/M
    Capabilities
    Streaming
    Tools
    Try in Playground

    GPT-3.5 Turbo

    openai
    gpt-3.5-turbo

    Providers

    Azure
    azure/gpt-3.5-turbo
    Context Size
    16.4k
    Stability
    unstable
    Pricing
    Input
    $0.50/M
    Cached
    —/M
    Output
    $1.50/M
    Capabilities
    Streaming
    Tools
    JSON Output
    Try in Playground
    LLM Gateway

    © 2026 LLM Gateway. All rights reserved.

    Product

    • Features
    • Models
    • Providers
    • Chat Playground
    • Changelog
    • Compare Models
    • Enterprise

    Resources

    • Templates
    • Agents
    • MCP Server
    • Blog
    • Documentation
    • Integrations
    • Guides
    • Brand Assets
    • Referral Program
    • GitHub
    • Contact Us
    • Privacy Policy
    • Terms of Use

    Community

    • Twitter
    • Discord

    Compare

    • OpenRouter
    • LiteLLM

    Models

    • Text Generation
    • Text to Image
    • Image to Image
    • Vision
    • Reasoning
    • Tool Calling
    • Web Search
    • Discounted

    Providers

    • OpenAI
    • Anthropic
    • Google AI Studio
    • Google Vertex AI
    • Obsidian
    • Groq
    • Cerebras
    • xAI
    • DeepSeek
    • Alibaba Cloud
    • NovitaAI
    • AWS Bedrock
    • Azure
    • Z AI
    • Moonshot AI
    • Perplexity
    • Nebius AI
    • Mistral AI
    • CanopyWave
    • Inference.net
    • Together AI
    • Custom
    • NanoGPT
    • ByteDance
    • MiniMax