Skip to content

Kimi K2 0905

View Status

Kimi K2 0905 is Moonshot AI's September 2025 K2 checkpoint, a refined release focused on agentic coding workflows with a context window of 256K tokens, available through AI Gateway via fireworks.

Tool Use
index.ts
import { streamText } from 'ai'
const result = streamText({
model: 'moonshotai/kimi-k2-0905',
prompt: 'Why is the sky blue?'
})

What To Consider When Choosing a Provider

  • Configuration: This checkpoint routes across fewer providers than the base K2. Monitor provider-level status during high-demand periods if you observe elevated latency.
  • Zero Data Retention: AI Gateway supports Zero Data Retention for this model via direct gateway requests (BYOK is not included). To configure this, check the documentation.
  • Authentication: AI Gateway authenticates requests using an API key or OIDC token. You do not need to manage provider credentials directly.

When to Use Kimi K2 0905

Best For

  • Long agentic sessions: Accumulated context (tool outputs, file contents, multi-turn history) pushes beyond the base K2 context window
  • September 5, 2025 training refinements: Workloads targeting the newer checkpoint's agentic coding improvements
  • Full-codebase review: Multi-file code review or generation where the context of 256K tokens enables a complete codebase view in one call
  • Drop-in upgrade: Existing base K2 integrations seeking a direct upgrade to the newer checkpoint

Consider Alternatives When

  • Chain-of-thought traces: Kimi K2 Thinking variants are designed for explicit reasoning output
  • Maximum routing redundancy: Base Kimi K2 routes across a wider provider set than this checkpoint
  • Fastest K2 inference: Kimi K2 Turbo is the speed-optimized variant
  • Shorter context needs: Tasks that don't require the full 256K tokens benefit from base K2's broader failover pool

Conclusion

Kimi K2 0905 delivers September 5, 2025 training refinements for agentic coding alongside a context window of 256K tokens that accommodates the long histories of extended coding agent sessions. For teams running base K2 in agentic coding workflows, it's the checkpoint update with the larger context window. Switch by changing the model string to moonshotai/kimi-k2-0905 with no other integration changes.

Frequently Asked Questions

  • What was the focus of the 0905 checkpoint update?

    Agentic coding. The checkpoint refines multi-step development tasks, tool use in coding workflows, and sustained context across long coding sessions.

  • Why does the context window of 256K tokens matter for agentic coding specifically?

    Coding agents accumulate context rapidly: file contents, function signatures, test outputs, error logs, and multi-turn reasoning traces all consume tokens. A window of 256K tokens keeps a much larger project scope in context at once, which cuts truncation workarounds.

  • How does switching from base K2 to kimi-k2-0905 work?

    Update the model string in your API call to moonshotai/kimi-k2-0905. Authentication, tool-calling format, and the rest of the integration stay the same.

  • What providers serve this checkpoint through AI Gateway?

    AI Gateway routes Kimi K2 0905 across fireworks. Failover between them is automatic.

  • Is the 0905 checkpoint open-weight?

    Yes, in the same lineage as other open-weight K2-family models. Check Moonshot AI's Hugging Face repository for license terms specific to this checkpoint.

  • Does kimi-k2-0905 support tool calling?

    Yes. Tool calling through the standard function-calling interface matches the agentic coding focus of the 0905 training refinements.

  • What if the context of 256K tokens is more than my tasks need?

    If context length isn't a constraint, the base Kimi K2 routes across a wider provider set and may give more availability headroom for high-uptime production use.