MeituanImage Editing

Meituan Releases Open, Bilingual Image Editing Model

The new LongCat-Image-Edit model follows natural language instructions to perform complex photo manipulations in both English and Chinese.

Dec 5, 2025

NotableApache 2.0

Chinese technology company Meituan has released LongCat-Image-Edit, an open-source model designed for instruction-based image editing. Released under a permissive Apache 2.0 license, the model provides a new tool for developers and creators working on generative AI applications.

Unlike traditional text-to-image models that generate images from scratch, LongCat-Image-Edit modifies existing images based on specific user commands. A key feature is its bilingual capability, allowing it to understand and execute instructions in both English and Chinese, making it accessible to a wider global audience.

Precise, Instruction-Based Control

The model excels at a range of common editing tasks by interpreting natural language. This approach allows for more intuitive control than writing complex prompts or using masking tools. According to the project's documentation, LongCat can handle:

Local Editing: Changing the attributes of a specific object, like "change the color of the car to red."
Style Modification: Altering the overall aesthetic, such as applying a "watercolor style."
Global Replacement: Swapping out major elements, like changing the background from a city to a forest.

The model weights and usage instructions are now available for download from the Hugging Face Hub. This release adds another powerful, open tool to the growing ecosystem for AI-powered creative work, particularly for tasks requiring precise, user-guided manipulation rather than pure generation.

Sources

meituan-longcat/LongCat-Image-Edit
Hugging Face
Visit

0 comments

No comments yet. Be the first to weigh in.

Microsoft's Mage-Flow packs image editing into 4B

A compact model handles both text-to-image generation and instruction-based edits at native resolution, under a permissive MIT license.

Jul 21, 2026

Unknown/Any-to-Any

Boogu-Image-0.1 Brings Unified Multimodal to Open Source

A new Apache-licensed model family folds bilingual text-to-image generation and instruction editing into one system.

Jul 13, 2026

SenseTime/Any-to-Any

SenseTime's SenseNova-Vision-7B-MoT Goes Any-to-Any

A single 7B model from SenseTime folds vision-language understanding, image generation, editing, and perception into one system.

Jun 29, 2026

Precise, Instruction-Based Control

Local Editing: Changing the attributes of a specific object, like "change the color of the car to red."

Style Modification: Altering the overall aesthetic, such as applying a "watercolor style."

Global Replacement: Swapping out major elements, like changing the background from a city to a forest.