Kimi K2 Thinking Model Support

Added support for Moonshot AI's Kimi K2 Thinking model with 262K context window, advanced reasoning capabilities, and prompt caching for cost-effective thinking tasks.

November 7, 2025

Kimi K2 Thinking model now available on LLM Gateway

We're excited to announce support for Kimi K2 Thinking, Moonshot AI's advanced reasoning model, now available through LLM Gateway. Experience powerful thinking capabilities with extensive context and efficient prompt caching.

📊 Model Specifications

Kimi K2 Thinking

1moonshot/kimi-k2-thinking

1moonshot/kimi-k2-thinking

Provider: Moonshot AI
Context Window: 262,144 tokens (262.1K)
Input Price: $0.60 per million tokens
Cached Input Price: $0.15 per million tokens
Output Price: $2.50 per million tokens
Capabilities: Streaming, Tools, JSON Output, Advanced Reasoning

✨ Key Capabilities

🧠 Thinking-First Design: Optimized for tasks requiring deep reasoning, logical analysis, and structured thinking processes.

🔧 Tool Support: Native support for function calling and tool use, enabling complex workflows and integrations.

⚡ Streaming: Real-time response generation for interactive applications and chat interfaces.

📝 JSON Output: Structured output support for seamless integration with your applications.

Try it now in the Playground 🚀

Kimi K2 Thinking Model Support

📊 Model Specifications

✨ Key Capabilities

Stay ahead of the curve

Support

Welcome!