JDText → Video

JD.com Enters Open-Source AI Video with JoyAI-Echo

The Chinese e-commerce giant has released a new model capable of generating long-form, multi-shot videos with synchronized audio from text prompts.

Jun 2, 2026

UpdateOther

Chinese e-commerce company JD.com has released JoyAI-Echo, a new open-source model for generating video from text. The release marks the company's entry into the competitive field of open-source generative video, adding another major corporate player to the ecosystem.

Unlike many models that produce short, single clips, JoyAI-Echo is designed for creating "long-form, multi-shot" videos. This allows it to generate a sequence of related scenes that can form a more coherent narrative. Crucially, the model also generates synchronized audio to accompany the video, a feature still emerging in many open video tools.

The model is based on the LTX-Video research framework, which focuses on generating temporally consistent and longer video sequences. The full model, code, and weights are available on its Hugging Face repository under an Apache 2.0 license, which permits commercial use.

JoyAI-Echo’s release highlights the growing trend of moving beyond simple clip generation toward more practical, narrative-driven video creation. Its focus on multi-shot storytelling and integrated audio pushes the capabilities of what's available in open-source AI, offering a new tool for creators and researchers exploring long-form generative content.

Sources

jdopensource/JoyAI-Echo
Hugging Face
Visit

0 comments

No comments yet. Be the first to weigh in.

MiniMax Releases H3 Video Model on Hugging Face

The company's new diffusion model handles text-to-video and image-to-video, with support for joint audio-video generation.

Jul 28, 2026

robbyant/Text → Video

LingBot-Video puts a 30B MoE behind embodied AI video

A DiT-based mixture-of-experts model activates just 3B parameters per step and ships under an Apache 2.0 license.

Jul 8, 2026

NVIDIA/Text → Video

NVIDIA's Cosmos 3 Edge Brings World Models Closer

A new edge-optimized variant of NVIDIA's Cosmos world-model line aims to run generative video where the compute lives.

Jul 1, 2026