Kimi K2 Thinking Model Support

Added support for Moonshot AI's Kimi K2 Thinking model with 262K context window, advanced reasoning capabilities, and prompt caching for cost-effective thinking tasks.

Kimi K2 Thinking model now available on LLM Gateway

We're excited to announce support for Kimi K2 Thinking, Moonshot AI's advanced reasoning model, now available through LLM Gateway. Experience powerful thinking capabilities with extensive context and efficient prompt caching.

📊 Model Specifications

Kimi K2 Thinking

  • Model ID: moonshot/kimi-k2-thinking
  • Provider: Moonshot AI
  • Context Window: 262,144 tokens (262.1K)
  • Input Price: $0.60 per million tokens
  • Cached Input Price: $0.15 per million tokens
  • Output Price: $2.50 per million tokens
  • Capabilities: Streaming, Tools, JSON Output, Advanced Reasoning

✨ Key Capabilities

🧠 Thinking-First Design: Optimized for tasks requiring deep reasoning, logical analysis, and structured thinking processes.

🔧 Tool Support: Native support for function calling and tool use, enabling complex workflows and integrations.

⚡ Streaming: Real-time response generation for interactive applications and chat interfaces.

📝 JSON Output: Structured output support for seamless integration with your applications.


Try it now in the Playground 🚀

    Kimi K2 Thinking Model Support - Changelog - LLM Gateway