The Open Weights
LatestModelsLeaderboardsUpcomingCompanies
Subscribe
The Open Weights

The daily record of open-source AI. New model releases, leaderboards, and what's coming next — written for people who ship.

Refreshed every 12 hours

Discover

  • Latest releases
  • New today
  • Trending models
  • Upcoming launches

Browse

  • All models
  • Companies
  • Categories
  • Leaderboards

About

  • About
  • Editorial policy
  • RSS feed
  • Newsletter

© 2026 The Open Weights. An independent publication.

Aggregated by Claude · written with Gemini · curated by humans.

LatestGoogle DeepMind4 31B
Google DeepMindAny-to-Any

Google Releases Multimodal Gemma 4 31B Model

The new 31-billion-parameter model is instruction-tuned and can process both text and images, marking a significant expansion for the Gemma family.

Mar 11, 2026
Major releaseGemma
Google DeepMind · Any-to-Any
Gemma 4 12B
Gemma 4 12B

Google DeepMind has expanded its open-weights lineup with the release of Gemma 4 31B IT, a powerful new multimodal model. This 31-billion-parameter release marks a significant milestone for the Gemma family, introducing vision-language capabilities at a substantial scale.

As a Vision Language Model (VLM), Gemma 4 31B can interpret and process information from both text and images simultaneously. This allows it to handle tasks that were previously out of reach for its text-only predecessors, such as describing the contents of a photograph or answering questions based on visual information.

The "IT" in the model's name signifies that it is instruction-tuned, meaning it has been specifically optimized to follow user prompts and perform conversational tasks. This fine-tuning makes it more reliable and useful for building interactive applications.

The introduction of a capable VLM under the Gemma name advances Google's open-source AI strategy, providing developers with a strong foundation for building sophisticated multimodal applications. The model is available under the Gemma license and can be accessed by researchers and developers on its Hugging Face repository.

Sources

  • google/gemma-4-31B-it

    Hugging Face

    Visit

0 comments

Protected by Turnstile

No comments yet. Be the first to weigh in.

Get the model

Weights

Specs

Parameters31B
Context window—
LicenseGEMMA
Downloads182K

Modalities

Vision-LanguageText / LLM
5 versions — view changelog

More in Any-to-Any

MiniMax
MiniMax-M3
MiniMax-M3
MiniMax/Vision-Language

MiniMax Releases M3, a Multimodal MoE Model

The new open-weight model from MiniMax AI combines vision, coding, and reasoning using a Mixture-of-Experts architecture.

Jun 2, 2026
Google DeepMind
Gemma 4 12B
Gemma 4 12B
Google DeepMind/Any-to-Any

Google Releases Gemma 4 12B Multimodal Model

The new 12-billion-parameter open model from DeepMind introduces a unified 'any-to-any' architecture for advanced multimodal tasks.

May 23, 2026
Google DeepMind
Gemma 4 12B
Gemma 4 12B
Google DeepMind/Any-to-Any

Google Releases Gemma 4, a 12B 'Any-to-Any' Model

The new 12-billion-parameter model from Google DeepMind is designed to handle a flexible mix of data types, moving beyond traditional text and image inputs.

May 23, 2026