Qwen · AlibabaVision-Language

Alibaba Releases Qwen3-VL, an 8B Open-Source Vision Model

The latest vision-language model from the popular Qwen series is instruction-tuned and available under an Apache 2.0 license.

Oct 11, 2025

NotableApache 2.0

Alibaba's Qwen team has launched Qwen3-VL-8B-Instruct, a new vision-language model (VLM) built on their latest Qwen3 architecture. This release adds powerful multimodal capabilities to the recently introduced Qwen3 family of open-source models.

As an instruction-tuned VLM, Qwen3-VL-8B is designed to understand and process both images and text simultaneously. It can perform a wide range of tasks that require visual reasoning, such as answering detailed questions about an image, generating captions, and identifying specific objects within a scene.

With 8 billion parameters, the model occupies a practical middle ground, offering strong performance without the demanding hardware requirements of much larger, proprietary systems. Its release provides developers and researchers with a capable tool for building multimodal applications, from enhanced chatbots to sophisticated content analysis systems.

The model is available under the permissive Apache 2.0 license, encouraging both academic and commercial use. Interested users can find the model weights and documentation on the Qwen Hugging Face repository.

Sources

Qwen/Qwen3-VL-8B-Instruct
Hugging Face
Visit

0 comments

No comments yet. Be the first to weigh in.

Thinking Machines Debuts Inkling Small, a Compact Multimodal MoE

The Apache-2.0 model brings mixture-of-experts efficiency to image, audio, and text tasks in a smaller footprint.

Jul 27, 2026

Microsoft/Vision-Language

Microsoft's Mage-VL Streams Video Natively

A codec-native multimodal foundation model aims to understand live video and vision-language input in real time.

Jul 26, 2026

Swiss Ai/Text / LLM

Apertus v1.5 70B arrives with an Apache-2.0 license

Switzerland's open-model effort ships a 70-billion-parameter, multilingual and multimodal system that anyone can use, modify, and deploy.

Jul 24, 2026