The Open Weights
LatestModelsLeaderboardsUpcomingCompanies
Subscribe
The Open Weights

The daily record of open-source AI. New model releases, leaderboards, and what's coming next — written for people who ship.

Refreshed every 12 hours

Discover

  • Latest releases
  • New today
  • Trending models
  • Upcoming launches

Browse

  • All models
  • Companies
  • Categories
  • Leaderboards

About

  • About
  • Editorial policy
  • RSS feed
  • Newsletter

© 2026 The Open Weights. An independent publication.

Aggregated by Claude · written with Gemini · curated by humans.

LatestQwen · AlibabaQwen3-Omni Thinking
Qwen · AlibabaAny-to-Any

Qwen Releases 'Thinking' Multimodal MoE Model

The new 30-billion-parameter Mixture-of-Experts model from Alibaba's Qwen team is designed to show its reasoning process for complex multimodal tasks.

Sep 15, 2025
NotableOther
Qwen · Alibaba · Any-to-Any
Qwen3-Omni-30B-A3B-Thinking
Qwen3-Omni-30B-A3B-Thinking

Alibaba's Qwen team has introduced a new model designed for transparent reasoning, called Qwen3-Omni-30B-A3B-Thinking. This release is a specialized variant of the broader Qwen3-Omni family, focusing on tasks that require complex, multi-step logic across various data types.

The model's key feature is its ability to output its "chain of thought," a step-by-step trace of its reasoning process. This transparency is a significant advantage for developers, allowing them to better understand, debug, and guide the model's decision-making. Instead of just providing a final answer, Qwen3-Omni shows its work, demystifying the path it took to reach a conclusion.

Efficient Architecture, Broad Capabilities

Under the hood, Qwen3-Omni is a Mixture-of-Experts (MoE) model. While it contains a total of 30 billion parameters, it only activates an average of 3 billion for any given task. This architecture aims to provide the knowledge scale of a large model with the inference efficiency closer to that of a much smaller one.

As an "omni-modal" model, its capabilities extend beyond text to a wide range of inputs:

  • Image and video understanding
  • Audio processing
  • Document analysis

This versatility makes it suitable for complex applications that need to synthesize information from multiple sources. The model and its weights are available for developers to explore on its Hugging Face repository. It's released under a custom license agreement, so users should review the terms before deployment.

Sources

  • Qwen/Qwen3-Omni-30B-A3B-Thinking

    Hugging Face

    Visit

0 comments

Protected by Turnstile

No comments yet. Be the first to weigh in.

Get the model

Weights

Specs

Parameters30B · MoE
Context window—
LicenseOTHER
Downloads227.6K

Modalities

Any-to-AnyReasoningVision-Language

More in Any-to-Any

MiniMax
MiniMax-M3
MiniMax-M3
MiniMax/Vision-Language

MiniMax Releases M3, a Multimodal MoE Model

The new open-weight model from MiniMax AI combines vision, coding, and reasoning using a Mixture-of-Experts architecture.

Jun 2, 2026
Google DeepMind
Gemma 4 12B
Gemma 4 12B
Google DeepMind/Any-to-Any

Google Releases Gemma 4 12B Multimodal Model

The new 12-billion-parameter open model from DeepMind introduces a unified 'any-to-any' architecture for advanced multimodal tasks.

May 23, 2026
Google DeepMind
Gemma 4 12B
Gemma 4 12B
Google DeepMind/Any-to-Any

Google Releases Gemma 4, a 12B 'Any-to-Any' Model

The new 12-billion-parameter model from Google DeepMind is designed to handle a flexible mix of data types, moving beyond traditional text and image inputs.

May 23, 2026