The Open Weights
LatestModelsLeaderboardsUpcomingCompanies
Subscribe
The Open Weights

The daily record of open-source AI. New model releases, leaderboards, and what's coming next — written for people who ship.

Refreshed every 12 hours

Discover

  • Latest releases
  • New today
  • Trending models
  • Upcoming launches

Browse

  • All models
  • Companies
  • Categories
  • Leaderboards

About

  • About
  • Editorial policy
  • RSS feed
  • Newsletter

© 2026 The Open Weights. An independent publication.

Aggregated by Claude · written with Gemini · curated by humans.

LatestGoogle DeepMind4
Google DeepMindAny-to-Any

Google's Gemma 4 Debuts with Any-to-Any Multimodality

The new 4-billion parameter model from Google DeepMind is designed for versatile input and output, handling text, images, and other data types.

Mar 2, 2026
Major releaseApache 2.0
Google DeepMind · Any-to-Any
Gemma 4 E4B
Gemma 4 E4B

Google has introduced a new model series with the release of Gemma 4 E4B Instruct, a compact and powerful multimodal model. Developed by Google DeepMind, this 4-billion parameter model represents a significant evolution in the open-source Gemma family, moving beyond text-only capabilities to embrace a more versatile, 'any-to-any' architecture.

This new design allows the model to accept a wide range of inputs—such as text and images—and generate various types of outputs in response. Unlike many vision-language models that are limited to text generation, Gemma 4 is built for more flexible and complex interactions, opening up new possibilities for building sophisticated, multi-faceted AI applications.

Built for a Multimodal World

The Gemma 4 E4B Instruct model is specifically tuned for following instructions, making it well-suited for chat and task-oriented applications right out of the box. Its key features include:

  • Compact Size: At 4B parameters, it offers advanced capabilities in a relatively efficient package.
  • Permissive License: Released under the Apache 2.0 license, it allows for broad commercial and research use.
  • Flexible I/O: The any-to-any architecture is designed for handling and generating mixed-media content.

This release marks a strategic move by Google to bring more powerful, general-purpose multimodal systems to the open-source community. For developers and researchers, Gemma 4 provides an accessible tool for experimenting with and building the next generation of AI that can see, read, and create in multiple formats. You can find the model and further details on its Hugging Face repository.

Sources

  • google/gemma-4-E4B-it

    Hugging Face

    Visit

0 comments

Protected by Turnstile

No comments yet. Be the first to weigh in.

Get the model

Weights

Specs

Parameters4B
Context window—
LicenseAPACHE-2.0
Downloads551.2K

Modalities

Any-to-AnyVision-LanguageText / LLM
2 versions — view changelog

More in Any-to-Any

MiniMax
MiniMax-M3
MiniMax-M3
MiniMax/Vision-Language

MiniMax Releases M3, a Multimodal MoE Model

The new open-weight model from MiniMax AI combines vision, coding, and reasoning using a Mixture-of-Experts architecture.

Jun 2, 2026
Google DeepMind
Gemma 4 12B
Gemma 4 12B
Google DeepMind/Any-to-Any

Google Releases Gemma 4 12B Multimodal Model

The new 12-billion-parameter open model from DeepMind introduces a unified 'any-to-any' architecture for advanced multimodal tasks.

May 23, 2026
Google DeepMind
Gemma 4 12B
Gemma 4 12B
Google DeepMind/Any-to-Any

Google Releases Gemma 4, a 12B 'Any-to-Any' Model

The new 12-billion-parameter model from Google DeepMind is designed to handle a flexible mix of data types, moving beyond traditional text and image inputs.

May 23, 2026