The Open Weights
LatestModelsLeaderboardsUpcomingCompanies
Subscribe
The Open Weights

The daily record of open-source AI. New model releases, leaderboards, and what's coming next — written for people who ship.

Refreshed every 12 hours

Discover

  • Latest releases
  • New today
  • Trending models
  • Upcoming launches

Browse

  • All models
  • Companies
  • Categories
  • Leaderboards

About

  • About
  • Editorial policy
  • RSS feed
  • Newsletter

© 2026 The Open Weights. An independent publication.

Aggregated by Claude · written with Gemini · curated by humans.

LatestOpenMOSSMOVA
OpenMOSSAny-to-Any

OpenMOSS Releases MOVA, a 720p Multimodal Video Generator

The new open model can generate high-definition video with synchronized audio from a flexible combination of text and image prompts.

Jan 28, 2026
NotableOther
OpenMOSS · Any-to-Any
MOVA 720p
MOVA 720p

The OpenMOSS team has released MOVA 720p, a new model for generating video with synchronized audio. Unlike many generative tools that rely on a single input type, MOVA is designed for flexible, multimodal prompts, capable of creating high-definition video from text, images, or a combination of both.

This "any-to-any" architecture is the model's key feature. It allows a creator to provide an image as a starting point and then guide the animation with a text prompt, offering more direct control over the subject, action, and style of the final video. The model processes these varied inputs to produce a coherent visual sequence complete with a relevant soundtrack.

Why It Matters

Generating video at a 720p resolution (1280x720) marks a notable step for open multimodal models, which often operate at lower resolutions to manage computational demands. By integrating audio generation and supporting complex prompts, MOVA 720p pushes the capabilities of open-source video synthesis forward, narrowing the gap with proprietary systems from major tech labs.

The model and its components are available for developers to explore on the OpenMOSS team's Hugging Face repository. It is released under a custom license, so users should review the specific terms of use before integrating it into their work.

Sources

  • OpenMOSS-Team/MOVA-720p

    Hugging Face

    Visit

0 comments

Protected by Turnstile

No comments yet. Be the first to weigh in.

Get the model

Weights

Specs

Parameters—
Context window—
LicenseOTHER
Downloads175

Modalities

Any-to-AnyImage → Video

More in Any-to-Any

MiniMax
MiniMax-M3
MiniMax-M3
MiniMax/Vision-Language

MiniMax Releases M3, a Multimodal MoE Model

The new open-weight model from MiniMax AI combines vision, coding, and reasoning using a Mixture-of-Experts architecture.

Jun 2, 2026
Google DeepMind
Gemma 4 12B
Gemma 4 12B
Google DeepMind/Any-to-Any

Google Releases Gemma 4 12B Multimodal Model

The new 12-billion-parameter open model from DeepMind introduces a unified 'any-to-any' architecture for advanced multimodal tasks.

May 23, 2026
Google DeepMind
Gemma 4 12B
Gemma 4 12B
Google DeepMind/Any-to-Any

Google Releases Gemma 4, a 12B 'Any-to-Any' Model

The new 12-billion-parameter model from Google DeepMind is designed to handle a flexible mix of data types, moving beyond traditional text and image inputs.

May 23, 2026