Changelog — New Features, Improvements, and Fixes

May 17, 2026

Seedance Video Models, Pinned Chats, Sharing Across Orgs & More

ByteDance Seedance video models land in the gateway, Chat gets pinning and cross-org sharing, plus vertex-anthropic, grok-4.20, and a stack of fixes.

Read more→

LLM Gateway now supports all ByteDance Seedance video generation models

May 15, 2026

OpenAI-Compatible Embeddings

Turn text into vectors for semantic search, clustering, and RAG — through the same gateway you already use for chat.

Read more→

LLM Gateway now supports OpenAI-compatible embeddings

May 7, 2026

DevPass is live on Product Hunt

DevPass by LLM Gateway is live on Product Hunt today. One subscription, every coding model, three flat prices.

Read more→

DevPass by LLM Gateway — $1 in, $3 out — live on Product Hunt

April 12, 2026

Sessions Rebranded to Agents

Sessions are now Agents — monitor your AI coding agents, track costs per agent, and drill into individual sessions.

Read more→

Agents page on LLM Gateway showing AI coding agent monitoring

April 2, 2026

Multi-Region Routing, Content Filters & More

Route requests to regional providers, protect your apps with built-in content moderation, enforce API key rate limits, and explore new models.

Read more→

Multi-region routing and content filters on LLM Gateway

March 23, 2026

Video Generation, Sessions & More

Generate videos via the API, track conversations with sessions, and more — plus new models and providers.

Read more→

Video generation and sessions now available on LLM Gateway

March 6, 2026

GPT-5.4 and GPT-5.4 Pro Now Available

Access OpenAI's most capable models — GPT-5.4 for complex professional work and GPT-5.4 Pro for smarter, more precise responses — with 1.05M context windows and reasoning support.

Read more→

GPT-5.4 and GPT-5.4 Pro models now available on LLM Gateway

February 27, 2026

Image Studio, Image Edits API & More

A dedicated Image Studio in the Playground for gallery-based generation with multi-model comparison, an OpenAI-compatible /v1/images/edits endpoint, and a wave of image generation improvements.

Read more→

Image Studio in the Playground comparing generations across Gemini 3.1 Flash Image, Gemini 3 Pro Image, and Qwen Image side by side

February 12, 2026

Automatic Retry & Fallback with Full Routing Transparency

When a provider fails, LLMGateway now automatically retries your request on another provider. Every attempt is logged with full routing visibility, so you always know what happened.

Read more→

January 30, 2026

AI Agent skills, Agents, Templates & CLI

Build AI-powered applications faster with pre-built agents, production-ready templates, and a new CLI tool for scaffolding projects.

Read more→

LLM Gateway agents and templates showcase

January 30, 2026

Unified Reasoning Configuration

New unified reasoning object for precise control over reasoning models. Specify exact token budgets with max_tokens or use effort levels — all in one consistent API.

Read more→

January 29, 2026

Dev Plans, Native Web Search, and MiniMax Provider

Ship faster with Dev Plans — AI-powered development planning now in beta. Plus native web search for real-time data, MiniMax provider, structured outputs for Anthropic & Perplexity, and a redesigned models experience.

Read more→

Dev Plans dashboard and web search capabilities

January 26, 2026

Enterprise Audit Logs

Track all organization activity with comprehensive audit logs. See who did what, when, and to which resource — available for Enterprise customers.

Read more→

Audit logs dashboard showing organization activity

January 26, 2026

Enterprise Guardrails

Protect your LLM usage with content guardrails. Detect and block prompt injections, PII, secrets, and more — available for Enterprise customers.

Read more→

January 21, 2026

Pro Features Now Free for Everyone

We're simplifying our pricing. All paid subscription features are now free for everyone — BYOK, team management, 30-day data retention, and more.

Read more→

January 2, 2026

Alibaba Cloud Qwen Image Models: Advanced Image Generation and Editing

Introducing Alibaba Cloud's Qwen Image model family - powerful models for text-to-image generation and image editing, now available in four variants: Qwen Image, Qwen Image Max, Qwen Image Max 2025-12-30, and Qwen Image Plus.

Read more→

Alibaba Cloud Qwen Image models now available on LLM Gateway

December 14, 2025

Cerebras: Ultra-Fast Inference with 6 New Models

New Cerebras provider with six high-performance models, including GPT-OSS 120B and Qwen 3, now available through LLM Gateway.

Read more→

Cerebras: Ultra-Fast Inference with 6 New Models

November 18, 2025

Gemini 3 Pro Preview: 20% Off Launch Discount

Google's latest Gemini 3 Pro Preview is now available with an exclusive 20% launch discount, featuring 1M context window and prompt caching.

Read more→

Gemini 3 Pro Preview: 20% Off Launch Discount

November 17, 2025

Sherlock: Two New Stealth Alpha Models

Introducing Sherlock Dash Alpha and Sherlock Think Alpha (Grok 4.1) - free stealth models with 1.8M context, reasoning, vision, and advanced capabilities.

Read more→

November 11, 2025

CanopyWave: 75% Off Kimi K2 Thinking

CanopyWave brings Kimi K2 Thinking to LLM Gateway with an exclusive 75% discount.

Read more→

November 8, 2025

CanopyWave: 3 New Models with 75% Off

CanopyWave brings Qwen3 Coder, MiniMax M2, and GLM-4.6 to LLM Gateway with an exclusive 75% discount on all three models.

Read more→

November 7, 2025

Kimi K2 Thinking Model Support

Added support for Moonshot AI's Kimi K2 Thinking model with 262K context window, advanced reasoning capabilities, and prompt caching for cost-effective thinking tasks.

Read more→

Kimi K2 Thinking model now available on LLM Gateway

November 5, 2025

Z.ai 10% Off & Google 20% Off All Models

Save on all Z.ai models with 10% off and get 20% off all Google models through LLM Gateway.

Read more→

Z.ai 10% off and Google 20% off all models

October 26, 2025

AWS Bedrock, Google Vertex AI and Microsoft Azure

Added native support for AWS Bedrock, Google Vertex AI, and Microsoft Azure.

Read more→

AWS Bedrock, Google Vertex AI and Microsoft Azure providers available in LLM Gateway

October 24, 2025

Team Members: Roles, Seats, and Access Controls

Invite teammates, assign roles (Owner, Admin, Developer), track included seats, and add more seats as your org grows. Pro includes team management; Enterprise adds SSO/SAML, SCIM, audit logs, and advanced permissions.

Read more→

Team Members management with role controls and seat usage

October 18, 2025

CanopyWave Partnership: 90% Off DeepSeek v3.1

Exclusive partnership with CanopyWave brings massive 90% discount on DeepSeek v3.1, making advanced reasoning capabilities more accessible than ever.

Read more→

CanopyWave partnership offering 90% off DeepSeek v3.1

September 29, 2025

Claude Sonnet 4.5 Model Support

Added support for Anthropic's Claude Sonnet 4.5

Read more→

Claude Sonnet models available via Anthropic provider on LLM Gateway

September 20, 2025

Grok 4 Fast Models: Flagship and Fast Variants Now Available

Added support for Grok 4 Fast Reasoning, and Grok 4 Fast Non-Reasoning models via xAI provider.

Read more→

Dashboard showing new Grok 4 Fast models via xAI provider

September 11, 2025

New Alibaba Qwen Models: Qwen3 Next, Max, Plus, Flash, Vision

Added support for qwen-max, qwen-max-latest, qwen-plus-latest, qwen-flash, qwen-vl-max, qwen-vl-plus and the new Qwen3 Next 80B A3B Instruct and Thinking models.

Read more→

Alibaba Qwen models now available on LLM Gateway

September 8, 2025

Claude Code Configuration Now Supported

Configure Claude Code to use any LLM model through LLMGateway's unified API with simple environment variable setup.

Read more→

Claude Code configuration support on LLM Gateway

September 5, 2025

Qwen3 Max Model Now Available

Access Alibaba's powerful Qwen3 Max model with 256K context window, advanced reasoning capabilities, vision support, and function calling - all at competitive pricing.

Read more→

Qwen3 Max model now available on LLM Gateway

August 26, 2025

Introducing Our First Image Generation Model: Gemini 2.5 Flash Image Preview

Generate stunning images with Google's Gemini 2.5 Flash Image Preview - our first image generation model with 32.8k context window and competitive pricing.

Read more→

Gemini 2.5 Flash Image Preview - First image generation model on LLM Gateway

August 24, 2025

DeepSeek v3.1 Model Support

Added support for DeepSeek's latest v3.1 model with 128K context window and competitive pricing for advanced reasoning capabilities.

Read more→

DeepSeek v3.1 model available in LLM Gateway with 128K context

August 16, 2025

New Models Directory & Free Llama 3.1 70B via CloudRift

Browse and compare 100+ AI models with advanced filtering, plus access Meta Llama 3.1 70B Instruct FP8 completely free through our CloudRift partnership.

Read more→

New models directory showing comprehensive model comparison and free Llama 3.1 70B

August 11, 2025

API Key Usage Limits & Credit Controls

Set individual credit limits for API keys to better control spending and prevent unexpected overages.

Read more→

API Keys dashboard showing usage limits and credit controls

August 7, 2025

GPT-5 Model Family Now Available

Get instant access to OpenAI's powerful new GPT-5 model family including gpt-5, gpt-5-mini, gpt-5-nano, and gpt-5-chat-latest with 400k context windows.

Read more→

GPT-5 models now available on LLM Gateway

August 7, 2025

AI SDK Provider v2.0 Released

Released v2.0 of our @llmgateway/ai-sdk-provider npm package with improved Vercel AI SDK integration and simplified model access.

Read more→

AI SDK Provider v2.0 package integration with code examples

August 6, 2025

Claude 4.1 Models: Opus and Sonnet Now Available

Added support for Claude 4.1 models including claude-opus-4-1, claude-opus-4-20250514, and claude-sonnet-4-20250514 via Anthropic provider.

Read more→

Dashboard showing new Claude 4.1 models via Anthropic provider

August 5, 2025

New GPT-OSS Models: 120B and 20B via Groq

Added support for GPT-OSS-120B and GPT-OSS-20B models via Groq, offering powerful open-source alternatives with extensive context windows and competitive pricing.

Read more→

Dashboard showing new GPT-OSS models via Groq provider

July 28, 2025

Next.js migration

We’ve moved from TanStack Start to Next.js. Here’s why it matters

Read more→

July 13, 2025

New Providers: Cloudrift, Moonshot AI and Novita AI Support

Added support for Cloudrift, Moonshot AI and Novita AI providers, both offering the powerful kimi-k2 model with extensive context windows and competitive pricing.

Read more→

Dashboard showing new Cloudrift Moonshot AI and Novita AI providers

June 26, 2025

New Providers: Groq and xAI Support + Always-Visible Credits

Added support for Groq and xAI providers with their latest models, plus credits are now always visible in the sidebar for easy access.

Read more→

Updated dashboard showing organization and project selection

June 22, 2025

Dashboard UI Improvements & Project Context

Introducing organizations and projects for clearer controls and statistics.

Read more→

June 15, 2025

Pro Subscription Launch

Bring your own LLM provider keys or use credits with reduced gateway fees (2.5% vs 5%). Includes premium analytics, higher rate limits, and priority email support.

Read more→