The Open Weights
LatestModelsLeaderboardsUpcomingCompanies
Subscribe
The Open Weights

The daily record of open-source AI. New model releases, leaderboards, and what's coming next — written for people who ship.

Refreshed every 12 hours

Discover

  • Latest releases
  • New today
  • Trending models
  • Upcoming launches

Browse

  • All models
  • Companies
  • Categories
  • Leaderboards

About

  • About
  • Editorial policy
  • RSS feed
  • Newsletter

© 2026 The Open Weights. An independent publication.

Aggregated by Claude · written with Gemini · curated by humans.

LatestQwen · Alibaba3-VL
Qwen · AlibabaVision-Language

Alibaba Releases Qwen3-VL, an 8B Open-Source Vision Model

The latest vision-language model from the popular Qwen series is instruction-tuned and available under an Apache 2.0 license.

Oct 11, 2025
NotableApache 2.0
Qwen · Alibaba · Vision-Language
Qwen3-VL-8B-Instruct
Qwen3-VL-8B-Instruct

Alibaba's Qwen team has launched Qwen3-VL-8B-Instruct, a new vision-language model (VLM) built on their latest Qwen3 architecture. This release adds powerful multimodal capabilities to the recently introduced Qwen3 family of open-source models.

As an instruction-tuned VLM, Qwen3-VL-8B is designed to understand and process both images and text simultaneously. It can perform a wide range of tasks that require visual reasoning, such as answering detailed questions about an image, generating captions, and identifying specific objects within a scene.

With 8 billion parameters, the model occupies a practical middle ground, offering strong performance without the demanding hardware requirements of much larger, proprietary systems. Its release provides developers and researchers with a capable tool for building multimodal applications, from enhanced chatbots to sophisticated content analysis systems.

The model is available under the permissive Apache 2.0 license, encouraging both academic and commercial use. Interested users can find the model weights and documentation on the Qwen Hugging Face repository.

Sources

  • Qwen/Qwen3-VL-8B-Instruct

    Hugging Face

    Visit

0 comments

Protected by Turnstile

No comments yet. Be the first to weigh in.

Get the model

Weights

Specs

Parameters8B
Context window—
LicenseAPACHE-2.0
Downloads5.7M

Modalities

Vision-Language
2 versions — view changelog

More in Vision-Language

Moonshot AI
Kimi-K2.7-Code
Kimi-K2.7-Code
Moonshot AI/Code

Moonshot AI Releases Kimi, a Multimodal Coding Model

The new Mixture-of-Experts model from the Chinese AI company can generate code while also understanding visual inputs, a rare combination in open models.

Jun 11, 2026
Google DeepMind
DiffusionGemma 26B-A4B Instruct
DiffusionGemma 26B-A4B Instruct
Google DeepMind/Text / LLM

Google Releases Open-Source DiffusionGemma 26B Model

The new 26B parameter model from DeepMind uses a diffusion-based architecture, a technique more common in image generation, to produce text.

Jun 9, 2026
MiniMax
MiniMax-M3
MiniMax-M3
MiniMax/Vision-Language

MiniMax Releases M3, a Multimodal MoE Model

The new open-weight model from MiniMax AI combines vision, coding, and reasoning using a Mixture-of-Experts architecture.

Jun 2, 2026