TencentText → Image

Tencent Releases HunyuanImage 3.0 Text-to-Image Model

The new text-to-image generator from the Chinese tech giant uses a Mixture-of-Experts architecture for improved efficiency and output quality.

Sep 25, 2025

Major releaseOther

Tencent has released the weights for HunyuanImage 3.0, its latest generative model for creating images from text prompts. This release marks a significant entry into the open-weights image generation space from the multinational tech company, offering a sophisticated new tool for researchers and developers.

The model's core architecture is a Diffusion Transformer (DiT) that incorporates a Mixture-of-Experts (MoE) approach. Unlike monolithic models, an MoE architecture activates specialized sub-networks based on the input prompt. This allows HunyuanImage 3.0 to handle a diverse range of concepts more efficiently and can lead to higher-quality, more detailed outputs.

Enhanced Prompt Understanding

A key feature of HunyuanImage 3.0 is its use of a multimodal large language model to refine and rewrite user prompts before the image generation process begins. According to Tencent, this step significantly improves the model's ability to interpret and adhere to complex instructions. The model is also designed to be fully bilingual, with strong capabilities in both Chinese and English, and shows a particular strength in generating imagery with Asian cultural elements.

The complete model weights are available on the Hugging Face Hub for download. It's important to note that the model is released under a custom license that permits academic research and non-commercial use only. This makes it a valuable resource for experimentation but restricts its application in commercial products.

Sources

tencent/HunyuanImage-3.0
Hugging Face
Visit

0 comments

No comments yet. Be the first to weigh in.

Microsoft's Mage-Flow packs image editing into 4B

A compact model handles both text-to-image generation and instruction-based edits at native resolution, under a permissive MIT license.

Jul 21, 2026

Unknown/Any-to-Any

Boogu-Image-0.1 Brings Unified Multimodal to Open Source

A new Apache-licensed model family folds bilingual text-to-image generation and instruction editing into one system.

Jul 13, 2026

NVIDIA/Text → Image

NVIDIA distills Qwen-Image for few-step generation

A DMD2-distilled build of Qwen-Image trades sampling steps for speed while keeping the original model's output profile.

Jul 1, 2026

Enhanced Prompt Understanding