Moonshot AIReasoning

Moonshot AI Releases Kimi-K2 Reasoning Model

The new Mixture-of-Experts model is designed for complex tasks but arrives in a custom compressed format with a restrictive license.

Nov 4, 2025

Major releaseOther

Chinese AI startup Moonshot AI has released Kimi-K2-Thinking, a new open-weight model aimed at tackling complex reasoning problems. The company describes the model as being designed for tasks that require “extended thinking,” positioning it as a specialized tool rather than a general-purpose chatbot. Architecturally, it is a Mixture-of-Experts (MoE) model, a design known for efficient scaling.

Uniquely, the model is not distributed in a standard format like SafeTensors. Instead, Moonshot has released the weights in a proprietary compressed format with a ‘.sbs’ extension. To use the model, developers must rely on the company's corresponding open-source inference framework, Kimi-Inference, which is designed to handle this specific format. The full model is available for download on Hugging Face.

Licensing and Limitations

The release is governed by the custom "Kimi-K2-Thinking License." While it permits academic research and general commercial use, it includes a significant restriction: the model cannot be used for any commercial purpose that directly competes with Moonshot AI's own products or services. This kind of non-compete clause places it in a category of controlled-use models, distinct from permissively licensed open-source software.

This release highlights a growing trend of companies making powerful models publicly available under specific, often restrictive, terms. For researchers, Kimi-K2-Thinking offers a new architecture to explore for advanced reasoning. However, the non-standard weight format and licensing terms present hurdles for broad commercial adoption and straightforward integration into the existing open-source ecosystem.

Sources

moonshotai/Kimi-K2-Thinking
Hugging Face
Visit

0 comments

No comments yet. Be the first to weigh in.

DeepSeek Ships V4-Flash, a 304B MoE Tuned for Agents

The latest checkpoint in DeepSeek's V4 line leans into agentic workflows while keeping the permissive MIT license.

Jul 31, 2026

DeepSeek/Text / LLM

DeepSeek Refreshes V4-Flash With New 0731 Checkpoint

The MIT-licensed mixture-of-experts model returns in an updated build shipping with FP8 weights for cheaper inference.

Jul 31, 2026

LGAI EXAONE/Text / LLM

LG AI Research debuts K-EXAONE 2.0, a 750B MoE model

The new mixture-of-experts model activates 37B parameters per token and targets English, Korean, and Spanish reasoning tasks.

Jul 29, 2026

Licensing and Limitations