The Open Weights
LatestModelsLeaderboardsUpcomingCompanies
Subscribe
The Open Weights

The daily record of open-source AI. New model releases, leaderboards, and what's coming next — written for people who ship.

Refreshed every 12 hours

Discover

  • Latest releases
  • New today
  • Trending models
  • Upcoming launches

Browse

  • All models
  • Companies
  • Categories
  • Leaderboards

About

  • About
  • Editorial policy
  • RSS feed
  • Newsletter

© 2026 The Open Weights. An independent publication.

Aggregated by Claude · written with Gemini · curated by humans.

LatestBaidu1.0
BaiduImage → Video

Baidu's Live-Avatar Animates Photos With Audio

The new 14-billion-parameter model uses audio input to generate realistic talking head videos from a single still image.

Dec 4, 2025
UpdateApache 2.0
Baidu · Image → Video
Live-Avatar
Live-Avatar

Baidu's Quark Vision team has released Live-Avatar, an open-source model that animates a still photograph into a talking head video using a separate audio track. The project aims to create realistic, audio-driven digital avatars from a single source image, a common but challenging task in generative AI.

The 14-billion-parameter model is built upon a foundation model called Wan2.2-S2V-14B, which specializes in still-image-to-video generation. Live-Avatar is fine-tuned for the specific task of synchronizing lip movements and generating natural head motions that correspond to the cadence and content of the provided audio input.

While audio-driven avatar technology has been explored extensively in commercial applications, the release of a powerful open-source model like Live-Avatar under a permissive Apache 2.0 license is significant. It provides researchers and developers with a strong baseline for creating virtual assistants, enhancing accessibility tools, or powering new forms of digital content creation.

The model, code, and usage instructions are now available for developers to explore on the Hugging Face Hub. The repository includes examples demonstrating the model's ability to generate coherent and expressive video from a variety of portrait images.

Sources

  • Quark-Vision/Live-Avatar

    Hugging Face

    Visit

0 comments

Protected by Turnstile

No comments yet. Be the first to weigh in.

Get the model

Weights

Specs

Parameters14B
Context window—
LicenseAPACHE-2.0
Downloads0

Modalities

Image → Video

More in Image → Video

Zhipu AI
SCAIL-2
SCAIL-2
Zhipu AI/Image → Video

Zhipu AI Releases SCAIL-2 for Character Animation

The new open-source diffusion model from the company's research arm generates video clips from a single character image and a sequence of poses.

Jun 9, 2026
NVIDIA
Cosmos3 Super Image2Video
Cosmos3 Super Image2Video
NVIDIA/Image → Video

NVIDIA Releases Cosmos3 Image-to-Video World Model

The latest release in NVIDIA's 'world model' research family aims to generate coherent and realistic video from a single static image.

May 21, 2026
NVIDIA
SANA-WM Bidirectional
SANA-WM Bidirectional
NVIDIA/Image → Video

NVIDIA Releases SANA, a Camera-Controllable Video Model

The new model, SANA-WM, uses a bidirectional diffusion process to give creators fine-grained control over camera movement and video editing.

May 18, 2026