The Open Weights
LatestModelsLeaderboardsUpcomingCompanies
Subscribe
The Open Weights

The daily record of open-source AI. New model releases, leaderboards, and what's coming next — written for people who ship.

Refreshed every 12 hours

Discover

  • Latest releases
  • New today
  • Trending models
  • Upcoming launches

Browse

  • All models
  • Companies
  • Categories
  • Leaderboards

About

  • About
  • Editorial policy
  • RSS feed
  • Newsletter

© 2026 The Open Weights. An independent publication.

Aggregated by Claude · written with Gemini · curated by humans.

LatestMeituanNext
MeituanAny-to-Any

Meituan Releases LongCat-Next 'Any-to-Any' AI Model

The Chinese tech company has released the weights for a unified model that can process and generate combinations of text, images, audio, and video.

Mar 25, 2026
NotableMIT
Meituan · Any-to-Any
LongCat-Next
LongCat-Next

Chinese technology company Meituan has released the weights for LongCat-Next, an ambitious 'any-to-any' multimodal model. Published under a permissive MIT license, the model marks a significant step towards more flexible and generalized AI systems that can operate across a wide spectrum of data types.

Unlike most multimodal models that handle specific input-output pairs, such as text-to-image or image-to-text, LongCat-Next is designed for true combinatorial flexibility. It can accept any mix of text, images, audio, and video as input and generate any combination of those modalities as output. For example, it could take an image and an audio clip as prompts and produce a descriptive paragraph and a short video in response.

A Unified Architecture

The model achieves this versatility through a unified framework. Instead of stitching together separate, specialized encoders and decoders for each data type, LongCat-Next uses a single, end-to-end trained network. This architecture relies on a shared vocabulary to represent and process information from different sources, enabling it to generate coherent, multimodal content from complex prompts.

The release of LongCat-Next on the Hugging Face Hub provides researchers and developers with a powerful tool for exploring the frontiers of multimodal AI. Its open-ended capabilities and permissive license encourage experimentation in creative content generation, data synthesis, and complex reasoning tasks that span multiple domains.

Sources

  • meituan-longcat/LongCat-Next

    Hugging Face

    Visit

0 comments

Protected by Turnstile

No comments yet. Be the first to weigh in.

Get the model

Weights

Specs

Parameters—
Context window—
LicenseMIT
Downloads2K

Modalities

Any-to-AnyText / LLM

More in Any-to-Any

MiniMax
MiniMax-M3
MiniMax-M3
MiniMax/Vision-Language

MiniMax Releases M3, a Multimodal MoE Model

The new open-weight model from MiniMax AI combines vision, coding, and reasoning using a Mixture-of-Experts architecture.

Jun 2, 2026
Google DeepMind
Gemma 4 12B
Gemma 4 12B
Google DeepMind/Any-to-Any

Google Releases Gemma 4 12B Multimodal Model

The new 12-billion-parameter open model from DeepMind introduces a unified 'any-to-any' architecture for advanced multimodal tasks.

May 23, 2026
Google DeepMind
Gemma 4 12B
Gemma 4 12B
Google DeepMind/Any-to-Any

Google Releases Gemma 4, a 12B 'Any-to-Any' Model

The new 12-billion-parameter model from Google DeepMind is designed to handle a flexible mix of data types, moving beyond traditional text and image inputs.

May 23, 2026