Black Forest Labs Releases 9B FLUX.2 Image Model
The new text-to-image model emphasizes speed and efficiency with a novel architecture and FP8 quantization.
German research company Black Forest Labs has released FLUX.2-klein-base, a new 9-billion-parameter model for text-to-image generation and editing. The release marks the debut of the FLUX.2 family, which introduces a new architectural approach designed for high-speed inference.
Unlike popular latent diffusion models such as Stable Diffusion that rely on a U-Net, FLUX.2 uses a multi-stage process built on transformers. This design, combined with its native FP8 precision, aims to deliver faster image generation on consumer-grade hardware. The "klein" (German for 'small') designation suggests this 9B model is an efficient entry point into a new class of powerful image generators.
How it Works
The model's architecture is composed of two main transformer components:
- A large, text-guided transformer that processes prompts and generates a base 128x128 image.
- A smaller, specialized upscaler transformer that refines the initial output into a final 1024x1024 image.
This two-part system is designed to create detailed images while maintaining performance. The model weights and usage examples are available on the official Hugging Face repository.
While the model's weights are publicly accessible, they are governed by a custom license that prohibits commercial use and places other restrictions on redistribution and training. Developers and researchers should review the terms carefully before integrating the model into their work.
Sources
- Visit
black-forest-labs/FLUX.2-klein-base-9b-fp8
Hugging Face
0 comments
No comments yet. Be the first to weigh in.
More in Text → Image

Ideogram 4.0: A 9.3B Open-Weight Text-to-Image Model
The new 9.3 billion parameter model uses a Diffusion Transformer architecture and excels at rendering coherent text within generated images.

ByteDance Releases Lance, a Unified Generative AI Model
The 3-billion-parameter model handles image and video generation, editing, and understanding from a single set of weights under a permissive license.

SenseTime Releases 8B 'Any-to-Any' Infographic Model
The new 8B-parameter SenseNova U1 model from SenseTime is designed for complex multimodal tasks, including the in-conversation generation and editing of infographics.