Qwen Releases 80B Mixture-of-Experts Model
The new Qwen3-Next model from Alibaba combines a large parameter count with an efficient MoE architecture to balance performance and computational cost.
The Qwen team at Alibaba has released Qwen3-Next-80B-A3B-Instruct, a new large language model that employs a Mixture-of-Experts (MoE) architecture. This release marks the introduction of the Qwen3-Next series, signaling a focus on more computationally efficient designs for powerful models.
The key feature of this model is its MoE structure. While it contains a total of 80 billion parameters, only 3 billion are activated for processing any given token. This design aims to provide the knowledge and nuance of a very large model while keeping inference costs significantly lower, making it more accessible for a wider range of applications and hardware setups.
Technical Specifications
Beyond its architecture, Qwen3-Next is an instruction-tuned model designed for chat and task completion. It supports a context length of up to 65,536 tokens, making it suitable for tasks requiring long-form context and analysis. The model is built on a standard Transformer foundation with SwiGLU activations and Group Query Attention for efficiency.
Released under the permissive Apache 2.0 license, the Qwen3-Next-80B-A3B-Instruct model is available for both research and commercial use. This continues the trend of major AI labs contributing powerful, open models that allow developers to build without restrictive licensing, fostering broader innovation in the ecosystem.
Sources
- Visit
Qwen/Qwen3-Next-80B-A3B-Instruct
Hugging Face
0 comments
No comments yet. Be the first to weigh in.
More in Text / LLM

Zhipu AI Releases MIT-Licensed GLM-5.2 MoE Model
The new bilingual model from the Chinese AI firm uses a Mixture of Experts architecture and sparse attention under a fully permissive license.

Weibo AI Releases VibeThinker-3B, a Compact Reasoning Model
The new 3-billion-parameter model from the Chinese tech giant focuses on challenging benchmarks in mathematics, coding, and graduate-level questions.
Moonshot AI Releases Kimi, a Multimodal Coding Model
The new Mixture-of-Experts model from the Chinese AI company can generate code while also understanding visual inputs, a rare combination in open models.