The Open Weights
LatestModelsLeaderboardsUpcomingCompanies
Subscribe
The Open Weights

The daily record of open-source AI. New model releases, leaderboards, and what's coming next — written for people who ship.

Refreshed every 12 hours

Discover

  • Latest releases
  • New today
  • Trending models
  • Upcoming launches

Browse

  • All models
  • Companies
  • Categories
  • Leaderboards

About

  • About
  • Editorial policy
  • RSS feed
  • Newsletter

© 2026 The Open Weights. An independent publication.

Aggregated by Claude · written with Gemini · curated by humans.

LatestQwen · Alibaba3-VL
Qwen · AlibabaVision-Language

Qwen Releases 30B MoE Vision Model, Qwen3-VL

The new open-source model from Alibaba uses a Mixture-of-Experts architecture to make its powerful vision-language capabilities more efficient to run.

Sep 30, 2025
Major releaseApache 2.0
Qwen · Alibaba · Vision-Language
Qwen3-VL-8B-Instruct
Qwen3-VL-8B-Instruct

Alibaba's Qwen team has released Qwen3-VL, a new open-source vision-language model (VLM) that combines high performance with computational efficiency. This instruction-tuned model is designed to understand and process both text and images, making it suitable for a wide range of multimodal tasks.

The model's key innovation is its Mixture-of-Experts (MoE) architecture. While it contains a total of 30 billion parameters, only 3 billion are activated during inference for any given input. This design allows it to achieve the performance associated with a much larger model while maintaining the speed and lower resource requirements of a smaller one, a significant advantage for developers and researchers.

As an instruction-tuned model, Qwen3-VL is optimized for conversational and task-oriented applications. It can follow complex commands that involve analyzing visual content, such as answering detailed questions about an image or generating descriptive captions. This makes it a powerful tool for building more sophisticated AI assistants and applications.

The model is released under the permissive Apache 2.0 license, encouraging broad adoption for both academic and commercial projects. Full details and model weights are available on its Hugging Face repository.

Sources

  • Qwen/Qwen3-VL-30B-A3B-Instruct

    Hugging Face

    Visit

0 comments

Protected by Turnstile

No comments yet. Be the first to weigh in.

Get the model

Weights

Specs

Parameters30B · MoE
Context window—
LicenseAPACHE-2.0
Downloads5.7M

Modalities

Vision-LanguageAny-to-Any
2 versions — view changelog

More in Vision-Language

Moonshot AI
Kimi-K2.7-Code
Kimi-K2.7-Code
Moonshot AI/Code

Moonshot AI Releases Kimi, a Multimodal Coding Model

The new Mixture-of-Experts model from the Chinese AI company can generate code while also understanding visual inputs, a rare combination in open models.

Jun 11, 2026
Google DeepMind
DiffusionGemma 26B-A4B Instruct
DiffusionGemma 26B-A4B Instruct
Google DeepMind/Text / LLM

Google Releases Open-Source DiffusionGemma 26B Model

The new 26B parameter model from DeepMind uses a diffusion-based architecture, a technique more common in image generation, to produce text.

Jun 9, 2026
MiniMax
MiniMax-M3
MiniMax-M3
MiniMax/Vision-Language

MiniMax Releases M3, a Multimodal MoE Model

The new open-weight model from MiniMax AI combines vision, coding, and reasoning using a Mixture-of-Experts architecture.

Jun 2, 2026