Zhipu AIText → Image

Zhipu AI Releases Open, Bilingual GLM-Image Model

The new text-to-image model is fluent in both Chinese and English, built on the CogView2 architecture and released under a permissive MIT license.

Jan 8, 2026

NotableMIT

Zhipu AI, a prominent Beijing-based AI research company, has released GLM-Image, a new open-source model for generating images from text descriptions. The model's primary distinction is its native bilingual capability, fluently understanding prompts in both Chinese and English.

This release is a notable contribution to the open-source multimodal landscape, which has historically been dominated by English-centric models. By providing a powerful bilingual tool under a permissive MIT license, Zhipu AI is lowering the barrier for developers and researchers worldwide to build applications that serve a more linguistically diverse audience.

Technical Foundations

GLM-Image is a diffusion model built upon the architecture of CogView2, an earlier powerful text-to-image model from the same research lineage. It works by pairing a Transformer-based text encoder with the diffusion model that synthesizes the final image.

The model and its weights are available for download from the project's Hugging Face repository. This move continues Zhipu AI's pattern of contributing significant models to the open-source community, fostering further innovation in multimodal AI research.

Sources

zai-org/GLM-Image
Hugging Face
Visit

0 comments

No comments yet. Be the first to weigh in.

Microsoft's Mage-Flow packs image editing into 4B

A compact model handles both text-to-image generation and instruction-based edits at native resolution, under a permissive MIT license.

Jul 21, 2026

Unknown/Any-to-Any

Boogu-Image-0.1 Brings Unified Multimodal to Open Source

A new Apache-licensed model family folds bilingual text-to-image generation and instruction editing into one system.

Jul 13, 2026

NVIDIA/Text → Image

NVIDIA distills Qwen-Image for few-step generation

A DMD2-distilled build of Qwen-Image trades sampling steps for speed while keeping the original model's output profile.

Jul 1, 2026

Technical Foundations