The Open Weights
LatestModelsLeaderboardsUpcomingCompanies
Subscribe
The Open Weights

The daily record of open-source AI. New model releases, leaderboards, and what's coming next — written for people who ship.

Refreshed every 12 hours

Discover

  • Latest releases
  • New today
  • Trending models
  • Upcoming launches

Browse

  • All models
  • Companies
  • Categories
  • Leaderboards

About

  • About
  • Editorial policy
  • RSS feed
  • Newsletter

© 2026 The Open Weights. An independent publication.

Aggregated by Claude · written with Gemini · curated by humans.

LatestGoogle DeepMind4
Google DeepMindAny-to-Any

Google Releases Gemma 4 12B Multimodal Model

The new 12-billion-parameter open model from DeepMind introduces a unified 'any-to-any' architecture for advanced multimodal tasks.

May 23, 2026
Major releaseApache 2.0
Google DeepMind · Any-to-Any
Gemma 4 12B
Gemma 4 12B

Google DeepMind has released Gemma 4 12B, a new generation of its open model family. This 12-billion-parameter model is available under a permissive Apache 2.0 license, continuing Google's commitment to providing powerful tools for the open-source AI community.

Unlike many existing vision-language models, Gemma 4 is built on what Google calls a "unified any-to-any" architecture. This design aims to natively handle a wide variety of data modalities for both input and output, moving beyond the common text-and-image limitations of previous systems.

Why 'Any-to-Any' Matters

This architectural approach is significant for developers. It simplifies the process of building complex applications that need to interpret and generate combinations of different data types, such as text, images, and potentially other formats in the future. Instead of chaining together multiple specialized models, developers can use a single, more integrated system, which could enable more fluid and capable AI assistants, creative tools, and analysis engines.

By releasing a model with this advanced multimodal design, Google provides a powerful new foundation for open-source development. Researchers and engineers can now experiment with and build upon this flexible architecture, pushing the boundaries of what's possible with open AI. The model is available now on Hugging Face.

Sources

  • google/gemma-4-12B

    Hugging Face

    Visit

0 comments

Protected by Turnstile

No comments yet. Be the first to weigh in.

Get the model

Weights

Specs

Parameters12B
Context window—
LicenseAPACHE-2.0
Downloads182K

Modalities

Any-to-AnyVision-LanguageText / LLM
5 versions — view changelog

More in Any-to-Any

MiniMax
MiniMax-M3
MiniMax-M3
MiniMax/Vision-Language

MiniMax Releases M3, a Multimodal MoE Model

The new open-weight model from MiniMax AI combines vision, coding, and reasoning using a Mixture-of-Experts architecture.

Jun 2, 2026
Google DeepMind
Gemma 4 12B
Gemma 4 12B
Google DeepMind/Any-to-Any

Google Releases Gemma 4, a 12B 'Any-to-Any' Model

The new 12-billion-parameter model from Google DeepMind is designed to handle a flexible mix of data types, moving beyond traditional text and image inputs.

May 23, 2026
ByteDance
Lance
Lance
ByteDance/Any-to-Any

ByteDance Releases Lance, a Unified Generative AI Model

The 3-billion-parameter model handles image and video generation, editing, and understanding from a single set of weights under a permissive license.

May 15, 2026