Qwen · AlibabaVision-Language

Qwen releases flagship 397B multimodal MoE

The new open-source model from Alibaba uses a Mixture-of-Experts architecture to balance massive scale with efficient inference.

Feb 16, 2026

Major releaseApache 2.0

The Qwen team at Alibaba has released Qwen3.5-397B-A17B, a powerful new open-source model that pushes the boundaries of scale and efficiency. As detailed on its Hugging Face repository, the model features a staggering 397 billion total parameters, making it one of the largest open models available.

A Sparse Architecture at Scale

What makes this scale manageable is its Mixture-of-Experts (MoE) architecture. Instead of activating all 397 billion parameters for every task, the model intelligently routes queries through a smaller subset, using only 17 billion active parameters at inference time. This "sparse" approach allows for the vast knowledge capacity of a huge model while keeping computational demands relatively low.

Beyond its scale, Qwen3.5 is also a capable vision-language model (VLM). This multimodal capability means it can process and understand both text and images, enabling more complex applications in areas like image captioning, visual question answering, and content analysis.

Released under the permissive Apache 2.0 license, Qwen3.5-397B-A17B represents a significant contribution to the open-source AI ecosystem. By providing access to a flagship-class MoE model, Alibaba is enabling developers and researchers to build on top of state-of-the-art multimodal AI without the need to train such a massive model from scratch.

Sources

Qwen/Qwen3.5-397B-A17B
Hugging Face
Visit

0 comments

No comments yet. Be the first to weigh in.

Thinking Machines Debuts Inkling Small, a Compact Multimodal MoE

The Apache-2.0 model brings mixture-of-experts efficiency to image, audio, and text tasks in a smaller footprint.

Jul 27, 2026

Microsoft/Vision-Language

Microsoft's Mage-VL Streams Video Natively

A codec-native multimodal foundation model aims to understand live video and vision-language input in real time.

Jul 26, 2026

Swiss Ai/Text / LLM

Apertus v1.5 70B arrives with an Apache-2.0 license

Switzerland's open-model effort ships a 70-billion-parameter, multilingual and multimodal system that anyone can use, modify, and deploy.

Jul 24, 2026