The Open Weights
LatestModelsLeaderboardsUpcomingCompanies
Subscribe
The Open Weights

The daily record of open-source AI. New model releases, leaderboards, and what's coming next — written for people who ship.

Refreshed every 12 hours

Discover

  • Latest releases
  • New today
  • Trending models
  • Upcoming launches

Browse

  • All models
  • Companies
  • Categories
  • Leaderboards

About

  • About
  • Editorial policy
  • RSS feed
  • Newsletter

© 2026 The Open Weights. An independent publication.

Aggregated by Claude · written with Gemini · curated by humans.

LatestAllen Institute for AI3.5
Allen Institute for AIAny-to-Any

BAAI Releases Emu3.5, an 'Any-to-Any' Multimodal Model

The new open-source model from the Allen Institute for AI unifies text and image understanding and generation into a single architecture.

Oct 31, 2025
NotableApache 2.0
Allen Institute for AI · Any-to-Any
Emu3.5
Emu3.5

The Allen Institute for AI (BAAI) has released Emu3.5, a new open-source model that pushes the boundaries of multimodal AI. Available under the permissive Apache 2.0 license, Emu3.5 is designed as a native "any-to-any" system, capable of both understanding and generating interleaved text and images within a single, unified framework.

A Unified Architecture

Unlike systems that chain separate, specialized models for different tasks (e.g., one for captioning, another for image generation), Emu3.5 aims to handle diverse combinations of inputs and outputs natively. The model can accept prompts containing both text and images to generate responses that are also a mix of text and new images. This approach moves beyond simple text-to-image or image-to-text capabilities toward more fluid, conversational interactions across modalities.

This unified design represents a significant step toward more integrated and capable AI systems. By handling complex, multimodal instructions within one architecture, models like Emu3.5 could power more sophisticated applications in creative tools, data analysis, and robotics. Researchers and developers can explore the model and its capabilities on its official Hugging Face repository.

Sources

  • BAAI/Emu3.5

    Hugging Face

    Visit

0 comments

Protected by Turnstile

No comments yet. Be the first to weigh in.

Get the model

Weights

Specs

Parameters—
Context window—
LicenseAPACHE-2.0
Downloads164

Modalities

Any-to-AnyVision-LanguageText → Image

More in Any-to-Any

MiniMax
MiniMax-M3
MiniMax-M3
MiniMax/Vision-Language

MiniMax Releases M3, a Multimodal MoE Model

The new open-weight model from MiniMax AI combines vision, coding, and reasoning using a Mixture-of-Experts architecture.

Jun 2, 2026
Google DeepMind
Gemma 4 12B
Gemma 4 12B
Google DeepMind/Any-to-Any

Google Releases Gemma 4 12B Multimodal Model

The new 12-billion-parameter open model from DeepMind introduces a unified 'any-to-any' architecture for advanced multimodal tasks.

May 23, 2026
Google DeepMind
Gemma 4 12B
Gemma 4 12B
Google DeepMind/Any-to-Any

Google Releases Gemma 4, a 12B 'Any-to-Any' Model

The new 12-billion-parameter model from Google DeepMind is designed to handle a flexible mix of data types, moving beyond traditional text and image inputs.

May 23, 2026