Google Releases Gemma 4, a 12B 'Any-to-Any' Model
The new 12-billion-parameter model from Google DeepMind is designed to handle a flexible mix of data types, moving beyond traditional text and image inputs.
Google DeepMind has expanded its open-weights portfolio with the release of Gemma 4 12B Instruct, a new 12-billion-parameter model. The model's key innovation is its unified 'any-to-any' multimodal architecture, designed to handle diverse data inputs and outputs seamlessly.
Unlike traditional models that are often limited to specific input-output pairs like text-to-image, Gemma 4 is built for more flexible, generalized reasoning. According to the release details on Hugging Face, its 'any-to-any' design allows it to process combinations of modalities simultaneously, a significant step toward more capable AI systems.
Why It Matters
The arrival of Gemma 4 democratizes a sophisticated architecture previously seen in much larger, closed models. By packaging these capabilities into a relatively efficient 12B parameter model, Google enables a wider range of researchers and developers to experiment with advanced multimodal applications that require less computational overhead.
The model is available under the custom Gemma license, which includes specific terms for usage and distribution. As an instruction-tuned variant, Gemma 4 12B is optimized for direct use in conversational and task-oriented applications.
Sources
- Visit
google/gemma-4-12B-it
Hugging Face
0 comments
No comments yet. Be the first to weigh in.
More in Any-to-Any

MiniMax Releases M3, a Multimodal MoE Model
The new open-weight model from MiniMax AI combines vision, coding, and reasoning using a Mixture-of-Experts architecture.
Google Releases Gemma 4 12B Multimodal Model
The new 12-billion-parameter open model from DeepMind introduces a unified 'any-to-any' architecture for advanced multimodal tasks.

ByteDance Releases Lance, a Unified Generative AI Model
The 3-billion-parameter model handles image and video generation, editing, and understanding from a single set of weights under a permissive license.