The Open Weights
LatestModelsLeaderboardsUpcomingCompanies
Subscribe
The Open Weights

The daily record of open-source AI. New model releases, leaderboards, and what's coming next — written for people who ship.

Refreshed every 12 hours

Discover

  • Latest releases
  • New today
  • Trending models
  • Upcoming launches

Browse

  • All models
  • Companies
  • Categories
  • Leaderboards

About

  • About
  • Editorial policy
  • RSS feed
  • Newsletter

© 2026 The Open Weights. An independent publication.

Aggregated by Claude · written with Gemini · curated by humans.

LatestinclusionAI2.0-Uni
inclusionAIAny-to-Any

LLaDA2.0-Uni: A Unified MoE for Vision Tasks

The new open-source model from inclusionAI uses a Mixture-of-Experts architecture to handle multiple vision tasks in a single, diffusion-based system.

Apr 22, 2026
NotableApache 2.0
inclusionAI · Any-to-Any
LLaDA2.0-Uni
LLaDA2.0-Uni

AI research group inclusionAI has released LLaDA2.0-Uni, a new open-source model aimed at unifying a range of visual AI tasks. Released under a permissive Apache 2.0 license, the model introduces a novel architecture for handling complex image-related operations within a single framework.

The core of LLaDA2.0-Uni is its use of a diffusion-based, Mixture of Experts (MoE) architecture. This design choice is significant because it allows the model to efficiently manage different tasks without needing to deploy separate, specialized models. Instead of chaining together distinct systems for understanding, creating, and modifying images, LLaDA2.0-Uni integrates these functions into one coherent system.

A Unified Approach

The model's key feature is its versatility. LLaDA2.0-Uni is designed to perform three primary categories of visual tasks:

  • Image Understanding: Analyzing and interpreting the content of an image.
  • Text-to-Image Generation: Creating new images from textual descriptions.
  • Image Editing: Modifying existing images based on instructions.

This unified capability represents a step toward more consolidated and flexible multimodal systems. By combining these functions, developers can build more streamlined applications that require a mix of generative and analytical vision. The complete model is available on Hugging Face for researchers and developers to explore.

Sources

  • inclusionAI/LLaDA2.0-Uni

    Hugging Face

    Visit

0 comments

Protected by Turnstile

No comments yet. Be the first to weigh in.

Get the model

Weights

Specs

Parameters— · MoE
Context window—
LicenseAPACHE-2.0
Downloads5.9K

Modalities

Any-to-AnyText → ImageImage Editing

More in Any-to-Any

MiniMax
MiniMax-M3
MiniMax-M3
MiniMax/Vision-Language

MiniMax Releases M3, a Multimodal MoE Model

The new open-weight model from MiniMax AI combines vision, coding, and reasoning using a Mixture-of-Experts architecture.

Jun 2, 2026
Google DeepMind
Gemma 4 12B
Gemma 4 12B
Google DeepMind/Any-to-Any

Google Releases Gemma 4 12B Multimodal Model

The new 12-billion-parameter open model from DeepMind introduces a unified 'any-to-any' architecture for advanced multimodal tasks.

May 23, 2026
Google DeepMind
Gemma 4 12B
Gemma 4 12B
Google DeepMind/Any-to-Any

Google Releases Gemma 4, a 12B 'Any-to-Any' Model

The new 12-billion-parameter model from Google DeepMind is designed to handle a flexible mix of data types, moving beyond traditional text and image inputs.

May 23, 2026