The Open Weights
LatestModelsLeaderboardsUpcomingCompanies
Subscribe
The Open Weights

The daily record of open-source AI. New model releases, leaderboards, and what's coming next — written for people who ship.

Refreshed every 12 hours

Discover

  • Latest releases
  • New today
  • Trending models
  • Upcoming launches

Browse

  • All models
  • Companies
  • Categories
  • Leaderboards

About

  • About
  • Editorial policy
  • RSS feed
  • Newsletter

© 2026 The Open Weights. An independent publication.

Aggregated by Claude · written with Gemini · curated by humans.

LatestQwen · AlibabaQwen3-Next
Qwen · AlibabaText / LLM

Qwen Releases 80B Mixture-of-Experts Model

The new Qwen3-Next model from Alibaba combines a large parameter count with an efficient MoE architecture to balance performance and computational cost.

Sep 9, 2025
Major releaseApache 2.0
Qwen · Alibaba · Text / LLM
Qwen3-Next-80B-A3B-Instruct
Qwen3-Next-80B-A3B-Instruct

The Qwen team at Alibaba has released Qwen3-Next-80B-A3B-Instruct, a new large language model that employs a Mixture-of-Experts (MoE) architecture. This release marks the introduction of the Qwen3-Next series, signaling a focus on more computationally efficient designs for powerful models.

The key feature of this model is its MoE structure. While it contains a total of 80 billion parameters, only 3 billion are activated for processing any given token. This design aims to provide the knowledge and nuance of a very large model while keeping inference costs significantly lower, making it more accessible for a wider range of applications and hardware setups.

Technical Specifications

Beyond its architecture, Qwen3-Next is an instruction-tuned model designed for chat and task completion. It supports a context length of up to 65,536 tokens, making it suitable for tasks requiring long-form context and analysis. The model is built on a standard Transformer foundation with SwiGLU activations and Group Query Attention for efficiency.

Released under the permissive Apache 2.0 license, the Qwen3-Next-80B-A3B-Instruct model is available for both research and commercial use. This continues the trend of major AI labs contributing powerful, open models that allow developers to build without restrictive licensing, fostering broader innovation in the ecosystem.

Sources

  • Qwen/Qwen3-Next-80B-A3B-Instruct

    Hugging Face

    Visit

0 comments

Protected by Turnstile

No comments yet. Be the first to weigh in.

Get the model

Weights

Specs

Parameters80B · MoE
Context window—
LicenseAPACHE-2.0
Downloads218.5K

Modalities

Text / LLM

More in Text / LLM

Zhipu AI
GLM-5.2
GLM-5.2
Zhipu AI/Text / LLM

Zhipu AI Releases MIT-Licensed GLM-5.2 MoE Model

The new bilingual model from the Chinese AI firm uses a Mixture of Experts architecture and sparse attention under a fully permissive license.

Jun 17, 2026
Weibo AI
VibeThinker-3B
VibeThinker-3B
Weibo AI/Reasoning

Weibo AI Releases VibeThinker-3B, a Compact Reasoning Model

The new 3-billion-parameter model from the Chinese tech giant focuses on challenging benchmarks in mathematics, coding, and graduate-level questions.

Jun 12, 2026
Moonshot AI
Kimi-K2.7-Code
Kimi-K2.7-Code
Moonshot AI/Code

Moonshot AI Releases Kimi, a Multimodal Coding Model

The new Mixture-of-Experts model from the Chinese AI company can generate code while also understanding visual inputs, a rare combination in open models.

Jun 11, 2026