Qwen · AlibabaVision-Language

Qwen Releases 30B MoE Vision Model, Qwen3-VL

The new open-source model from Alibaba uses a Mixture-of-Experts architecture to make its powerful vision-language capabilities more efficient to run.

Sep 30, 2025

Major releaseApache 2.0

Alibaba's Qwen team has released Qwen3-VL, a new open-source vision-language model (VLM) that combines high performance with computational efficiency. This instruction-tuned model is designed to understand and process both text and images, making it suitable for a wide range of multimodal tasks.

The model's key innovation is its Mixture-of-Experts (MoE) architecture. While it contains a total of 30 billion parameters, only 3 billion are activated during inference for any given input. This design allows it to achieve the performance associated with a much larger model while maintaining the speed and lower resource requirements of a smaller one, a significant advantage for developers and researchers.

As an instruction-tuned model, Qwen3-VL is optimized for conversational and task-oriented applications. It can follow complex commands that involve analyzing visual content, such as answering detailed questions about an image or generating descriptive captions. This makes it a powerful tool for building more sophisticated AI assistants and applications.

The model is released under the permissive Apache 2.0 license, encouraging broad adoption for both academic and commercial projects. Full details and model weights are available on its Hugging Face repository.

Sources

Qwen/Qwen3-VL-30B-A3B-Instruct
Hugging Face
Visit

0 comments

No comments yet. Be the first to weigh in.

Thinking Machines Debuts Inkling Small, a Compact Multimodal MoE

The Apache-2.0 model brings mixture-of-experts efficiency to image, audio, and text tasks in a smaller footprint.

Jul 27, 2026

Microsoft/Vision-Language

Microsoft's Mage-VL Streams Video Natively

A codec-native multimodal foundation model aims to understand live video and vision-language input in real time.

Jul 26, 2026

Swiss Ai/Text / LLM

Apertus v1.5 70B arrives with an Apache-2.0 license

Switzerland's open-model effort ships a 70-billion-parameter, multilingual and multimodal system that anyone can use, modify, and deploy.

Jul 24, 2026