TencentText → Image

Tencent Releases HunyuanImage 2.1 for Bilingual AI Art

The new text-to-image model from the Chinese tech giant is designed to understand both Chinese and English prompts at high resolutions.

Sep 5, 2025

NotableOther

Tencent has released HunyuanImage 2.1, a powerful text-to-image diffusion model with a strong focus on bilingual capabilities. The model is engineered to interpret prompts in both Chinese and English, aiming to generate high-quality images that accurately reflect complex cultural and linguistic nuances.

Built on a diffusion U-Net architecture, the model uses a sophisticated bilingual text encoder that combines CLIP and T5 to better understand user intent. It natively generates images at a 1024x1024 resolution, placing it in line with other high-resolution open-source generators.

Conversational Image Generation

A standout feature of HunyuanImage 2.1 is its ability to engage in multi-turn dialogue. This allows users to iteratively refine an image through conversation, providing follow-up instructions to modify a previously generated picture. This conversational context is a significant step beyond the single-shot prompting common in most image models.

The model weights and code are available on Hugging Face and are designed for use with the diffusers library. HunyuanImage 2.1 is released under a custom Tencent Hunyuan Model License Agreement, which permits non-commercial research use and outlines a separate application process for commercial licensing.

Sources

tencent/HunyuanImage-2.1
Hugging Face
Visit

0 comments

No comments yet. Be the first to weigh in.

Microsoft's Mage-Flow packs image editing into 4B

A compact model handles both text-to-image generation and instruction-based edits at native resolution, under a permissive MIT license.

Jul 21, 2026

Unknown/Any-to-Any

Boogu-Image-0.1 Brings Unified Multimodal to Open Source

A new Apache-licensed model family folds bilingual text-to-image generation and instruction editing into one system.

Jul 13, 2026

NVIDIA/Text → Image

NVIDIA distills Qwen-Image for few-step generation

A DMD2-distilled build of Qwen-Image trades sampling steps for speed while keeping the original model's output profile.

Jul 1, 2026

Conversational Image Generation