Qwen · AlibabaText → Video

Qwen Unveils Wan2.2, a 14B Open Text-to-Video Model

The new Apache 2.0-licensed model from Alibaba's team uses a Mixture-of-Experts architecture for efficient, high-quality video generation.

Jul 24, 2025

NotableApache 2.0

The Qwen team at Alibaba has introduced a significant new open-source model for generating video from text prompts, called Wan2.2-T2V-A14B. This release expands the team's portfolio of powerful, openly accessible foundation models.

What sets this model apart is its use of a Mixture-of-Experts (MoE) architecture. It features 14 billion active parameters, meaning only a fraction of the model's total size is engaged for any given task. This design aims to deliver the performance of a much larger model while keeping the computational cost of inference more manageable.

The release of Wan2.2 is a notable event in the competitive landscape of generative video. By making the model available under the permissive Apache 2.0 license, the Qwen team provides researchers and developers with a powerful, unrestricted tool for building new applications and pushing the boundaries of video synthesis.

The model is now available for download and experimentation on the Hugging Face Hub, allowing the community to begin exploring its capabilities immediately.

Sources

Wan-AI/Wan2.2-T2V-A14B
Hugging Face
Visit

0 comments

No comments yet. Be the first to weigh in.

MiniMax Releases H3 Video Model on Hugging Face

The company's new diffusion model handles text-to-video and image-to-video, with support for joint audio-video generation.

Jul 28, 2026

robbyant/Text → Video

LingBot-Video puts a 30B MoE behind embodied AI video

A DiT-based mixture-of-experts model activates just 3B parameters per step and ships under an Apache 2.0 license.

Jul 8, 2026

NVIDIA/Text → Video

NVIDIA's Cosmos 3 Edge Brings World Models Closer

A new edge-optimized variant of NVIDIA's Cosmos world-model line aims to run generative video where the compute lives.

Jul 1, 2026