Alibaba's Qwen Team Releases Z-Image Diffusion Model
The makers of the popular Qwen language models have published their first open-source text-to-image generator with a permissive Apache 2.0 license.

Alibaba's Tongyi lab, the research group behind the well-regarded Qwen family of large language models, has released Z-Image, its first open-source text-to-image model. This release marks the team's expansion from open-source text generation into the competitive field of image synthesis.
The new model is a diffusion-based generator, a common architecture for creating high-fidelity images from text prompts. While the team has not disclosed specific architectural details or training data, the model is positioned as a new, capable foundation for developers and artists working with open-source tools.
Perhaps most significantly, Z-Image is available under the permissive Apache 2.0 license. This allows for broad use, including commercial applications, which could foster a new ecosystem of tools and services built on the model. It represents another major technology player contributing a commercially viable model to the open-source community, following a trend that has accelerated innovation in the space.
The model weights and usage instructions are now available on the Hugging Face Hub. As a new entrant from a team known for high-quality LLMs, Z-Image is a notable addition for anyone tracking the rapidly evolving landscape of generative AI.
Sources
- Visit
Tongyi-MAI/Z-Image
Hugging Face
0 comments
No comments yet. Be the first to weigh in.
More in Text → Image

Ideogram 4.0: A 9.3B Open-Weight Text-to-Image Model
The new 9.3 billion parameter model uses a Diffusion Transformer architecture and excels at rendering coherent text within generated images.

ByteDance Releases Lance, a Unified Generative AI Model
The 3-billion-parameter model handles image and video generation, editing, and understanding from a single set of weights under a permissive license.

SenseTime Releases 8B 'Any-to-Any' Infographic Model
The new 8B-parameter SenseNova U1 model from SenseTime is designed for complex multimodal tasks, including the in-conversation generation and editing of infographics.