The Open Weights
LatestModelsLeaderboardsUpcomingCompanies
Subscribe
The Open Weights

The daily record of open-source AI. New model releases, leaderboards, and what's coming next — written for people who ship.

Refreshed every 12 hours

Discover

  • Latest releases
  • New today
  • Trending models
  • Upcoming launches

Browse

  • All models
  • Companies
  • Categories
  • Leaderboards

About

  • About
  • Editorial policy
  • RSS feed
  • Newsletter

© 2026 The Open Weights. An independent publication.

Aggregated by Claude · written with Gemini · curated by humans.

LatestGoogle DeepMind4
Google DeepMindAny-to-Any

Google Releases Gemma 4, a 26B Vision-Language Model

The new open-source model from DeepMind uses a Mixture-of-Experts architecture to handle both text and image inputs efficiently.

Mar 11, 2026
Major releaseApache 2.0
Google DeepMind · Any-to-Any
Gemma 4 12B
Gemma 4 12B

Google DeepMind has expanded its open-source offerings with the release of Gemma 4 26B Instruct, a new vision-language model. Published under a permissive Apache 2.0 license, this model is designed to understand and process both text and images, making it a versatile tool for multimodal applications.

An Efficient Multimodal Architecture

The key innovation in Gemma 4 is its Mixture-of-Experts (MoE) architecture. While the model contains a total of 26 billion parameters, it's designed for efficiency by activating only a fraction of them for any given task. The model's designation, "A4B," suggests that approximately 4 billion parameters are active at a time, offering potent performance without the full computational cost of a dense 26B model.

As an instruction-tuned model, Gemma 4 26B is optimized to follow user prompts and commands, making it suitable for a wide range of chat and assistant-style applications. Researchers and developers can access the model and its technical details on its Hugging Face repository.

This release signals Google's continued investment in the open-source AI ecosystem, providing a powerful, state-of-the-art multimodal model to the community. The efficient MoE design makes advanced vision-language capabilities more accessible, enabling new possibilities for applications that can see and reason about the world.

Sources

  • google/gemma-4-26B-A4B-it

    Hugging Face

    Visit

0 comments

Protected by Turnstile

No comments yet. Be the first to weigh in.

Get the model

Weights

Specs

Parameters26B · MoE
Context window—
LicenseAPACHE-2.0
Downloads182K

Modalities

Vision-LanguageText / LLM
5 versions — view changelog

More in Any-to-Any

MiniMax
MiniMax-M3
MiniMax-M3
MiniMax/Vision-Language

MiniMax Releases M3, a Multimodal MoE Model

The new open-weight model from MiniMax AI combines vision, coding, and reasoning using a Mixture-of-Experts architecture.

Jun 2, 2026
Google DeepMind
Gemma 4 12B
Gemma 4 12B
Google DeepMind/Any-to-Any

Google Releases Gemma 4 12B Multimodal Model

The new 12-billion-parameter open model from DeepMind introduces a unified 'any-to-any' architecture for advanced multimodal tasks.

May 23, 2026
Google DeepMind
Gemma 4 12B
Gemma 4 12B
Google DeepMind/Any-to-Any

Google Releases Gemma 4, a 12B 'Any-to-Any' Model

The new 12-billion-parameter model from Google DeepMind is designed to handle a flexible mix of data types, moving beyond traditional text and image inputs.

May 23, 2026