Moonshot Kimi
Connect Moonshot's Kimi Coding 2.6 (and Kimi K2 Thinking) to Fabric Agents. Setup, model ids, and notes on the unified kimi-for-coding endpoint.
Moonshot's Kimi Coding 2.6 (released April 2026) is available as a Pi-SDK-backed provider in Fabric Agents. Access is via API key.
Get an API key
- Sign in at platform.kimi.com (or
platform.moonshot.cnif you're on the Chinese mainland instance). - Create a key under API Keys — it starts with
sk-kimi-. - Top up a balance or confirm you're on Moonshot's free trial — keys with zero quota silently return "quota exceeded" on first send.
Connect in Fabric Agents
- Open Settings → AI → Connections → Add.
- From the provider picker, choose Kimi (Coding) — this maps to endpoint
https://api.kimi.com/coding. - Paste your
sk-kimi-…key. - Click Test connection. You should see the model catalog populate with Kimi Coding 2.6 and Kimi K2 Thinking.
- Save.
The connection shows up in the model picker. Sessions created with this connection are routed through the Pi SDK using Moonshot's coding-optimized inference backend.
Model ids
Moonshot moved off version-stamped ids for this product line in April 2026 — the API now exposes a single endpoint-name id that always points at the latest coding model.
| Id | Maps to | Notes |
|---|---|---|
kimi-for-coding | Kimi Coding 2.6 (current) | Unified endpoint. Server-side routing — when Moonshot releases 2.7 this same id will upgrade automatically. |
kimi-k2-thinking | Kimi K2 Thinking | Reasoning-focused variant. Addressable separately. |
k2p5 | (deprecated) | The former version-stamped name for 2.5. Phased out; new configurations should use kimi-for-coding instead. |
Fabric Agents auto-migrates older configs — if you set up Kimi before April 2026 with k2p5 in your model list, the next launch replaces it with kimi-for-coding and marks that as the Best-tier default.
Model capabilities
Both Kimi models under this provider:
- Use the
anthropic-messagesAPI family, so tool calling and streaming behave the same as Claude connections. - Accept text and image inputs (
kimi-for-coding) or text only (kimi-k2-thinking). - Support a 256 K token context window with a 32 K max-output per turn.
- Support extended reasoning / thinking. In Fabric Agents, the thinking-level selector in the input bar maps to Moonshot's reasoning_effort.
Tier picker defaults
When you connect Kimi for the first time, the onboarding 3-tier defaults resolve to:
| Tier | Default |
|---|---|
| Best | Kimi Coding 2.6 |
| Balanced | Kimi Coding 2.6 |
| Fast | Kimi K2 Thinking |
You can override any of them in Settings → AI → Kimi (Coding) → Models. Sessions then use the Balanced tier by default; pick Best or Fast from the model picker per session.
Regional notes
- Moonshot runs two separate clouds — global (
platform.kimi.com, endpointapi.kimi.com) and China (platform.moonshot.cn, endpointapi.moonshot.cn). Yoursk-kimi-…key is tied to one of them; they don't cross-authenticate. If you've got the wrong region on file,Test connectionreturns 401. - The China cloud's rate limits are stricter; for heavy automated use, Moonshot's docs recommend a dedicated enterprise key.
Troubleshooting
Settings shows "Kimi K2.5" instead of 2.6 — you're on a build prior to v0.8.12. Upgrade the desktop app (the in-app auto-updater picks up stable releases from agents.fabric.pro/electron/latest/). The fix adds the kimi-for-coding id to the picker and migrates older k2p5 saves.
Test connection returns 401 — wrong cloud (see Regional notes) or the key was revoked. Cycle it on the Moonshot dashboard and paste the new one.
"model not found" when sending a message — your connection's model list still has k2p5 but Moonshot's API no longer accepts it. Open Settings → AI → Kimi (Coding) → Models, remove k2p5, add kimi-for-coding, save. (The auto-migration in v0.8.12+ handles this on launch; this is only a manual step if you skipped the upgrade.)
Long tool-using sessions degrade / lose context — bump the thinking level for that session. Kimi's reasoning_effort tiers substantially affect chain-of-thought depth on complex agentic tasks.
Related
- LLM Providers overview — other provider setups
- Interactions reference — thinking levels, model picker behaviour
- Release notes for v0.8.12 — the version that added Kimi Coding 2.6
LLM Providers
Fabric Agents works with every frontier model — Anthropic, OpenAI, Google, Moonshot Kimi, and any OpenAI-compatible endpoint. This page covers setup for each.
Azure AI Foundry
Connect Azure-hosted OpenAI-compatible endpoints to Fabric Agents using Microsoft Entra ID (Azure AD) Bearer-token auth. Resource discovery, deployment selection, and token refresh are handled automatically.