Latest open-source Text → Image models

Microsoft/Text → Image

Microsoft's Mage-Flow packs image editing into 4B

A compact model handles both text-to-image generation and instruction-based edits at native resolution, under a permissive MIT license.

Jul 21, 2026

Image Editing Text → Image

Unknown/Any-to-Any

Boogu-Image-0.1 Brings Unified Multimodal to Open Source

A new Apache-licensed model family folds bilingual text-to-image generation and instruction editing into one system.

Jul 13, 2026

Image Editing Any-to-Any

NVIDIA/Text → Image

NVIDIA distills Qwen-Image for few-step generation

A DMD2-distilled build of Qwen-Image trades sampling steps for speed while keeping the original model's output profile.

Jul 1, 2026

Text → Image

Krea/Text → Image

Krea 2 Arrives as Open-Weights Text-to-Image Model

The image-generation startup releases its second-generation diffusion model in raw and turbo variants under open weights.

Jun 18, 2026

Text → Image

Krea/Text → ImageMajor release

Krea 2 Arrives as a 12B Open-Weights Image Model

A new text-to-image model ships with a faster Turbo variant and downloadable weights on Hugging Face.

Jun 18, 2026

Text → Image

Stability AI/Text → Image

Ideogram 4.0 arrives as an open-weight image model

A 9.3-billion-parameter text-to-image system lands with open weights and a public GitHub home.

Jun 3, 2026

Text → Image

Black Forest Labs/Text → Image

Ideogram 4.0 arrives as an open-weight image model

A 9.3-billion-parameter text-to-image model lands on GitHub with downloadable weights and code.

Jun 3, 2026

Text → Image

Ideogram Ai/Text → Image

Ideogram 4.0: A 9.3B Open-Weight Text-to-Image Model

The new 9.3 billion parameter model uses a Diffusion Transformer architecture and excels at rendering coherent text within generated images.

May 30, 2026

Text → Image

ByteDance/Any-to-AnyMajor release

ByteDance Releases Lance, a Unified Generative AI Model

The 3-billion-parameter model handles image and video generation, editing, and understanding from a single set of weights under a permissive license.

May 15, 2026

Image Editing Any-to-Any

SenseTime/Any-to-Any

SenseTime Releases 8B 'Any-to-Any' Infographic Model

The new 8B-parameter SenseNova U1 model from SenseTime is designed for complex multimodal tasks, including the in-conversation generation and editing of infographics.

May 14, 2026

Image Editing Any-to-Any

inclusionAI/Any-to-Any

LLaDA2.0-Uni: A Unified MoE for Vision Tasks

The new open-source model from inclusionAI uses a Mixture-of-Experts architecture to handle multiple vision tasks in a single, diffusion-based system.

Apr 22, 2026

Image Editing Any-to-Any

SenseTime/Any-to-Any

SenseTime Releases 8B Any-to-Any Multimodal Model

The new SenseNova-U1 model unifies image understanding, generation, and editing within a single 8-billion-parameter framework.

Apr 22, 2026

Image Editing Any-to-Any

Baidu/Text → Image

Baidu Releases 8B Text-to-Image Model ERNIE-Image

The large diffusion model from the Chinese tech giant is available under the commercially permissive Apache 2.0 license, a notable release for the community.

Apr 7, 2026

Text → Image

Black Forest Labs/Text → Image

Black Forest Labs Releases Open FLUX.2 Image Decoder

This new component is part of a novel transformer-based architecture for text-to-image generation, released under a permissive Apache 2.0 license.

Apr 6, 2026

Image Editing Text → Image

Black Forest Labs/Text → ImageMajor release

Black Forest Labs Releases 9B FLUX.2 klein Image Model

The new open-weight model offers a more compact, distilled version of the advanced FLUX architecture for text-to-image and editing tasks.

Mar 9, 2026

Image Editing Text → Image

Black Forest Labs/Text → Image

Black Forest Labs Releases FLUX.2 Klein 9B

The open-weight text-to-image model brings a 9-billion-parameter base release to the FLUX.2 Klein family.

Jan 28, 2026

Text → Image

Qwen · Alibaba/Text → Image

Alibaba's Qwen Team Releases Z-Image Diffusion Model

The makers of the popular Qwen language models have published their first open-source text-to-image generator with a permissive Apache 2.0 license.

Jan 23, 2026

Text → Image

Black Forest Labs/Text → Image

Black Forest Labs Releases 9B FLUX.2 Image Model

The new text-to-image model emphasizes speed and efficiency with a novel architecture and FP8 quantization.

Jan 14, 2026

Image Editing Text → Image

Black Forest Labs/Text → ImageMajor release

Black Forest Labs Releases Open-Source FLUX.2 Klein 4B

The new 4-billion-parameter model is a distilled version of the powerful FLUX.2 architecture, released under a commercially-friendly Apache 2.0 license.

Jan 14, 2026

Image Editing Text → Image

Black Forest Labs/Text → Image

FLUX.2 Klein: A Compact 4B Open-Source Image Model

The new 4-billion-parameter model from Black Forest Labs offers an efficient, transformer-based alternative to latent diffusion for image generation.

Jan 14, 2026

Image Editing Text → Image

Black Forest Labs/Text → ImageMajor release

Black Forest Labs Releases 9B FLUX.2 Image Model

The new 9-billion-parameter model uses a Diffusion Transformer architecture, promising higher performance than existing open-source alternatives.

Jan 14, 2026

Image Editing Text → Image

Black Forest Labs/Text → ImageMajor release

Black Forest Labs Releases New FLUX.2 Image Model

The new 9-billion-parameter text-to-image model uses a novel architecture that operates directly on pixels for faster, more efficient generation.

Jan 14, 2026

Image Editing Text → Image

Zhipu AI/Text → Image

Zhipu AI Releases Open, Bilingual GLM-Image Model

The new text-to-image model is fluent in both Chinese and English, built on the CogView2 architecture and released under a permissive MIT license.

Jan 8, 2026

Text → Image

Qwen · Alibaba/Text → Image

Qwen Releases Bilingual Open-Source Image Model

Alibaba's latest text-to-image generator, Qwen-Image 2512, is optimized for creating visuals from both English and Chinese prompts.

Dec 30, 2025

Text → Image

Qwen · Alibaba/Text → Image

Alibaba Releases Z-Image-Turbo, A Fast Open Image Model

The new text-to-image model from the team behind Qwen uses a diffusion transformer to generate high-resolution images in just a single step.

Nov 25, 2025

Text → Image

Black Forest Labs/Text → ImageMajor release

Black Forest Labs Releases Open-Source FLUX.2 Image Model

The developer preview of the next-generation text-to-image architecture promises significant architectural improvements over its predecessor.

Nov 22, 2025

Image Editing Text → Image

BAAI/Any-to-Any

BAAI Releases Emu3.5, an 'Any-to-Any' Multimodal Model

The new open-source model from the Allen Institute for AI unifies text and image understanding and generation into a single architecture.

Oct 31, 2025

Any-to-Any Text → Image

Tencent/Text → Image

Tencent Debuts HunyuanImage 3.0 with MoE Design

The new text-to-image generator from the Chinese tech giant uses a Mixture-of-Experts architecture for more efficient and detailed image creation.

Sep 25, 2025

Text → Image

Tencent/Text → ImageMajor release

Tencent Releases HunyuanImage 3.0 Text-to-Image Model

The new text-to-image generator from the Chinese tech giant uses a Mixture-of-Experts architecture for improved efficiency and output quality.

Sep 25, 2025

Text → Image

Alpha-VLLM/Any-to-Any

Lumina-DiMOO: A Diffusion Model for Any-to-Any AI

This new open-source model uses a diffusion architecture instead of a typical transformer to generate and understand a mix of media types.

Sep 9, 2025

Any-to-Any Text → Image

Tencent/Text → Image

Tencent SRPO Fine-Tunes SDXL with Preference Optimization

The new text-to-image model uses a novel rejection sampling technique to align Stable Diffusion XL more closely with human aesthetic preferences.

Sep 8, 2025

Text → Image

Tencent/Text → Image

Tencent Releases HunyuanImage 2.1 for Bilingual AI Art

The new text-to-image model from the Chinese tech giant is designed to understand both Chinese and English prompts at high resolutions.

Sep 5, 2025

Text → Image

Qwen · Alibaba/Text → ImageMajor release

Qwen releases open model for text-in-image generation

The new Apache 2.0 diffusion model from Alibaba's Qwen team focuses on accurately rendering both English and Chinese characters within generated images.

Aug 2, 2025

Text → Image