The Open Weights
LatestModelsLeaderboardsUpcomingCompanies
Subscribe
The Open Weights

The daily record of open-source AI. New model releases, leaderboards, and what's coming next — written for people who ship.

Refreshed every 12 hours

Discover

  • Latest releases
  • New today
  • Trending models
  • Upcoming launches

Browse

  • All models
  • Companies
  • Categories
  • Leaderboards

About

  • About
  • Editorial policy
  • RSS feed
  • Newsletter

© 2026 The Open Weights. An independent publication.

Aggregated by Claude · written with Gemini · curated by humans.

LatestMeituanFlash-Omni
MeituanAny-to-Any

Meituan Debuts LongCat-Flash-Omni, an Any-to-Any AI Model

The new open-source Mixture-of-Experts model can process and generate any combination of text, images, video, audio, and 3D data.

Oct 23, 2025
NotableMIT
Meituan · Any-to-Any
LongCat-Flash-Omni
LongCat-Flash-Omni

Chinese technology company Meituan has released LongCat-Flash-Omni, a new open-source model designed to handle a vast array of data types. Billed as an "any-to-any" omni-modal system, it aims to process and generate information across different formats seamlessly, a significant step in developing more versatile AI.

According to the release details on Hugging Face, the model's key innovation is its ability to manage multiple data modalities simultaneously within a single framework. This "omni-modal" capability includes:

  • Text
  • Images
  • Video
  • Audio
  • 3D data

The model is built on a Mixture-of-Experts (MoE) architecture, a technique that activates specialized parts of the network for specific tasks. This allows the model to scale efficiently while managing the complexity of different data encoders. The system projects inputs from various modalities into a shared representational space, allowing the core language model to process them cohesively.

Released under the permissive MIT license, LongCat-Flash-Omni provides the research community with a powerful tool for exploring truly multimodal AI. Its flexible architecture opens up possibilities for complex applications that require understanding and generating content across diverse formats, pushing the boundaries of what's possible with open-source models.

Sources

  • meituan-longcat/LongCat-Flash-Omni

    Hugging Face

    Visit

0 comments

Protected by Turnstile

No comments yet. Be the first to weigh in.

Get the model

Weights

Specs

Parameters— · MoE
Context window—
LicenseMIT
Downloads44

Modalities

Any-to-Any

More in Any-to-Any

MiniMax
MiniMax-M3
MiniMax-M3
MiniMax/Vision-Language

MiniMax Releases M3, a Multimodal MoE Model

The new open-weight model from MiniMax AI combines vision, coding, and reasoning using a Mixture-of-Experts architecture.

Jun 2, 2026
Google DeepMind
Gemma 4 12B
Gemma 4 12B
Google DeepMind/Any-to-Any

Google Releases Gemma 4 12B Multimodal Model

The new 12-billion-parameter open model from DeepMind introduces a unified 'any-to-any' architecture for advanced multimodal tasks.

May 23, 2026
Google DeepMind
Gemma 4 12B
Gemma 4 12B
Google DeepMind/Any-to-Any

Google Releases Gemma 4, a 12B 'Any-to-Any' Model

The new 12-billion-parameter model from Google DeepMind is designed to handle a flexible mix of data types, moving beyond traditional text and image inputs.

May 23, 2026