Google Releases Open-Source DiffusionGemma 26B Model
The new 26B parameter model from DeepMind uses a diffusion-based architecture, a technique more common in image generation, to produce text.
Google DeepMind has released DiffusionGemma, a new family of open-source language models that explores a novel architecture for generating text. The first release is an instruction-tuned, 26-billion-parameter model, signaling Google's continued investment in both the Gemma family and alternative AI architectures.
A Different Approach to Generation
Unlike most large language models like GPT-4 or Llama 3, which are autoregressive and generate text one token at a time, DiffusionGemma employs a diffusion-based method. This technique, widely used in popular image generation models, starts with a complete, noisy sequence of text and iteratively refines it into a coherent output. This marks a notable departure from the standard Transformer architecture that has dominated natural language processing for years.
The newly released model is a Mixture-of-Experts (MoE) variant, which means that while it has 26 billion total parameters, only 4 billion are active for any given input. This design makes inference more computationally efficient than a dense model of the same size. According to its official model card, the DiffusionGemma 26B-A4B-it model is designed for a variety of image-to-text and text-to-text tasks.
By open-sourcing DiffusionGemma under a permissive Apache 2.0 license, Google is enabling the wider AI community to experiment with this less-common approach. The exploration of non-autoregressive models could lead to new discoveries in text generation, potentially offering different trade-offs in speed, coherence, and controllability compared to today's leading models.
Sources
- Visit
google/diffusiongemma-26B-A4B-it
Hugging Face
0 comments
No comments yet. Be the first to weigh in.
More in Text / LLM

Zhipu AI Releases MIT-Licensed GLM-5.2 MoE Model
The new bilingual model from the Chinese AI firm uses a Mixture of Experts architecture and sparse attention under a fully permissive license.

Weibo AI Releases VibeThinker-3B, a Compact Reasoning Model
The new 3-billion-parameter model from the Chinese tech giant focuses on challenging benchmarks in mathematics, coding, and graduate-level questions.
Moonshot AI Releases Kimi, a Multimodal Coding Model
The new Mixture-of-Experts model from the Chinese AI company can generate code while also understanding visual inputs, a rare combination in open models.