The Open Weights
LatestModelsLeaderboardsUpcomingCompanies
Subscribe
The Open Weights

The daily record of open-source AI. New model releases, leaderboards, and what's coming next — written for people who ship.

Refreshed every 12 hours

Discover

  • Latest releases
  • New today
  • Trending models
  • Upcoming launches

Browse

  • All models
  • Companies
  • Categories
  • Leaderboards

About

  • About
  • Editorial policy
  • RSS feed
  • Newsletter

© 2026 The Open Weights. An independent publication.

Aggregated by Claude · written with Gemini · curated by humans.

LatestGAIR1.0
GAIRImage → Video

GAIR Releases daVinci-MagiHuman for Video Generation

The new open-source model from the General Artificial Intelligence Research team can create video clips complete with audio from a variety of inputs.

Mar 21, 2026
NotableOther
GAIR · Image → Video
daVinci-MagiHuman
daVinci-MagiHuman

The General Artificial Intelligence Research (GAIR) team has released daVinci-MagiHuman, a new open-source model designed for multimodal video generation. Released under an OpenRAIL-M license, the project provides a powerful new tool for developers and researchers working on creative AI applications.

Unlike many video models that work from a text prompt alone, daVinci-MagiHuman is positioned as a “multimodal agent” for creating audio-visual content. It can take a combination of text, images, and audio to produce a final video, offering a more flexible and controllable creative process. For example, a user could provide a still image and a text prompt to animate it into a short clip.

Core Capabilities

The model is built to handle a range of generative tasks, allowing users to direct video creation with different types of input. Key functions highlighted in the project's official release include:

  • Text-to-Video: Generating video from a descriptive text prompt.
  • Image-to-Video: Animating a static source image based on text instructions.
  • Audio-Video Generation: Creating video that is influenced by an audio input.
  • Video Editing: Performing tasks like style transfer on existing video clips.

This release adds another significant entry into the rapidly expanding field of open-source video generation. By providing a model that handles multiple input types, GAIR is empowering the community to explore more complex and nuanced forms of AI-driven media creation, providing an accessible alternative to the large, proprietary systems being developed in private labs.

Sources

  • GAIR/daVinci-MagiHuman

    Hugging Face

    Visit

0 comments

Protected by Turnstile

No comments yet. Be the first to weigh in.

Get the model

Weights

Specs

Parameters—
Context window—
LicenseOTHER
Downloads111

Modalities

Image → VideoText → VideoAny-to-Any

More in Image → Video

Zhipu AI
SCAIL-2
SCAIL-2
Zhipu AI/Image → Video

Zhipu AI Releases SCAIL-2 for Character Animation

The new open-source diffusion model from the company's research arm generates video clips from a single character image and a sequence of poses.

Jun 9, 2026
NVIDIA
Cosmos3 Super Image2Video
Cosmos3 Super Image2Video
NVIDIA/Image → Video

NVIDIA Releases Cosmos3 Image-to-Video World Model

The latest release in NVIDIA's 'world model' research family aims to generate coherent and realistic video from a single static image.

May 21, 2026
NVIDIA
SANA-WM Bidirectional
SANA-WM Bidirectional
NVIDIA/Image → Video

NVIDIA Releases SANA, a Camera-Controllable Video Model

The new model, SANA-WM, uses a bidirectional diffusion process to give creators fine-grained control over camera movement and video editing.

May 18, 2026