MiniMax Debuts M2.1, an MoE Model Optimized with FP8
The new Mixture of Experts model from the Chinese AI firm uses 8-bit floating-point precision for a smaller memory footprint and faster inference.

AI research company MiniMax has released MiniMax-M2.1, a new large language model built on a Mixture of Experts (MoE) architecture. Unlike many contemporary models that use 16-bit precision, M2.1 is notable for its use of FP8 weights, a strategic choice aimed at significantly boosting computational efficiency.
Precision and Performance
The adoption of 8-bit floating-point (FP8) precision is a key trend in making large models more practical to deploy. By representing model weights with fewer bits, FP8 reduces the model's memory footprint and can accelerate inference speed on compatible hardware. This approach allows powerful MoE models, which are often very large, to run on less demanding infrastructure, though it can introduce a trade-off with numerical precision.
As a Mixture of Experts model, M2.1 is designed for efficiency at its core. MoE architectures process inputs by routing them to specialized "expert" subnetworks, activating only a fraction of the model's total parameters for any given task. This sparse activation, combined with the FP8 data type, positions M2.1 as a model focused on efficient scaling.
The model is available for download and experimentation on the Hugging Face Hub, but it comes with a custom license. The terms restrict usage to non-commercial and research purposes only, prohibiting commercial deployment and the redistribution of the model or any derivatives.
Sources
- Visit
MiniMaxAI/MiniMax-M2.1
Hugging Face
0 comments
No comments yet. Be the first to weigh in.
More in Text / LLM

Zhipu AI Releases MIT-Licensed GLM-5.2 MoE Model
The new bilingual model from the Chinese AI firm uses a Mixture of Experts architecture and sparse attention under a fully permissive license.

Weibo AI Releases VibeThinker-3B, a Compact Reasoning Model
The new 3-billion-parameter model from the Chinese tech giant focuses on challenging benchmarks in mathematics, coding, and graduate-level questions.
Moonshot AI Releases Kimi, a Multimodal Coding Model
The new Mixture-of-Experts model from the Chinese AI company can generate code while also understanding visual inputs, a rare combination in open models.