The Open Weights
LatestModelsLeaderboardsUpcomingCompanies
Subscribe
The Open Weights

The daily record of open-source AI. New model releases, leaderboards, and what's coming next — written for people who ship.

Refreshed every 12 hours

Discover

  • Latest releases
  • New today
  • Trending models
  • Upcoming launches

Browse

  • All models
  • Companies
  • Categories
  • Leaderboards

About

  • About
  • Editorial policy
  • RSS feed
  • Newsletter

© 2026 The Open Weights. An independent publication.

Aggregated by Claude · written with Gemini · curated by humans.

LatestinclusionAI2.0
inclusionAIAny-to-Any

inclusionAI's Ming 2.0 Tackles Any-to-Any Multimodality

The new open-source Mixture-of-Experts model can process and generate content across text, images, and audio in any combination.

Feb 10, 2026
NotableMIT
inclusionAI · Any-to-Any
Ming-flash-omni 2.0
Ming-flash-omni 2.0

AI research group inclusionAI has released Ming-flash-omni 2.0, an ambitious open-source model designed to natively handle text, images, and audio. Released under a permissive MIT license, the model aims to provide a single, unified system for 'any-to-any' multimodal tasks.

Unlike many multimodal models that primarily link text and images, Ming 2.0 is built to process and generate content across all three modalities interchangeably. This could enable capabilities like generating an image from an audio clip, describing a picture with spoken words, or transcribing speech, all within one framework.

The model utilizes a Mixture-of-Experts (MoE) architecture, a design that can lead to more efficient computation by only activating relevant parts of the network for a given task. While specific details on its parameter count and training data are not yet public, the MoE approach suggests a focus on scalable performance.

This release represents another step forward for complex, open-source AI systems that can perceive and create in ways more analogous to human senses. Researchers and developers can explore the model's capabilities on its Hugging Face repository.

Sources

  • inclusionAI/Ming-flash-omni-2.0

    Hugging Face

    Visit

0 comments

Protected by Turnstile

No comments yet. Be the first to weigh in.

Get the model

Weights

Specs

Parameters— · MoE
Context window—
LicenseMIT
Downloads2.4K

Modalities

Any-to-Any

More in Any-to-Any

MiniMax
MiniMax-M3
MiniMax-M3
MiniMax/Vision-Language

MiniMax Releases M3, a Multimodal MoE Model

The new open-weight model from MiniMax AI combines vision, coding, and reasoning using a Mixture-of-Experts architecture.

Jun 2, 2026
Google DeepMind
Gemma 4 12B
Gemma 4 12B
Google DeepMind/Any-to-Any

Google Releases Gemma 4 12B Multimodal Model

The new 12-billion-parameter open model from DeepMind introduces a unified 'any-to-any' architecture for advanced multimodal tasks.

May 23, 2026
Google DeepMind
Gemma 4 12B
Gemma 4 12B
Google DeepMind/Any-to-Any

Google Releases Gemma 4, a 12B 'Any-to-Any' Model

The new 12-billion-parameter model from Google DeepMind is designed to handle a flexible mix of data types, moving beyond traditional text and image inputs.

May 23, 2026