TencentText → Image

Tencent Debuts HunyuanImage 3.0 with MoE Design

The new text-to-image generator from the Chinese tech giant uses a Mixture-of-Experts architecture for more efficient and detailed image creation.

Sep 25, 2025

NotableOther

Tencent has released HunyuanImage 3.0 Instruct, a new text-to-image model that brings an architectural design more commonly seen in large language models to the world of image generation. The model is part of the company's Hunyuan series and is now available for researchers and developers to explore.

An Expert Approach to Pixels

The key innovation in HunyuanImage 3.0 is its use of a Mixture-of-Experts (MoE) framework. This allows the model to activate only the most relevant parts of its network for a given task, potentially leading to more efficient processing and higher-quality outputs. By combining this with an instruction-tuned approach, the model is designed to better understand and execute complex, multi-part prompts.

The model's architecture is built on a transformer backbone called Hunyuan-DiT. According to Tencent's release notes, this enables strong performance in areas like following detailed instructions and even engaging in multi-turn, dialogue-based image creation, making the generation process more conversational.

While the weights are publicly accessible on the Hugging Face Hub, they are released under a custom license. Users should review the specific terms of the "Hunyuan-Image-3.0-Instruct License Agreement" before using the model in their projects. This release marks another significant entry from a major tech firm into the open-weights AI landscape, pushing new architectures into different modalities.

Sources

tencent/HunyuanImage-3.0-Instruct
Hugging Face
Visit

0 comments

No comments yet. Be the first to weigh in.

Microsoft's Mage-Flow packs image editing into 4B

A compact model handles both text-to-image generation and instruction-based edits at native resolution, under a permissive MIT license.

Jul 21, 2026

Unknown/Any-to-Any

Boogu-Image-0.1 Brings Unified Multimodal to Open Source

A new Apache-licensed model family folds bilingual text-to-image generation and instruction editing into one system.

Jul 13, 2026

NVIDIA/Text → Image

NVIDIA distills Qwen-Image for few-step generation

A DMD2-distilled build of Qwen-Image trades sampling steps for speed while keeping the original model's output profile.

Jul 1, 2026

An Expert Approach to Pixels