Kimi K2 Instruct

Moonshot AI · Chine
1T-parameter MoE (32B active) tuned for agentic and tool-use workflows. Modified MIT permits commercial use. Same Chinese-origin alignment and supply-chain considerations as DeepSeek and Qwen.
Caractéristiques de la licence
Licence
Modified MIT
Commercial use
Unrestricted
Derivatives
Allowed
Attribution
Minimal
Parameters
1T MoE (32B active, 384 experts, 8 active + 1 shared)
Context
128K
Architecture
MLA attention, 61 layers
Training data
Not disclosed
Last updated
2025-Q3
Risques connus
  • Chinese-origin alignment biases
  • Self-hosting 1T MoE is infrastructure-heavy
  • Modified MIT — read the specific modifications
Revu par Ali Madjaji · Dernière revue le 2026-04-16