The Open Weights
LatestModelsLeaderboardsUpcomingCompanies
Subscribe
The Open Weights

The daily record of open-source AI. New model releases, leaderboards, and what's coming next — written for people who ship.

Refreshed every 12 hours

Discover

  • Latest releases
  • New today
  • Trending models
  • Upcoming launches

Browse

  • All models
  • Companies
  • Categories
  • Leaderboards

About

  • About
  • Editorial policy
  • RSS feed
  • Newsletter

© 2026 The Open Weights. An independent publication.

Aggregated by Claude · written with Gemini · curated by humans.

LatestOpenBMBMiniCPM-o 4.5
OpenBMBAny-to-Any

MiniCPM-o 4.5 Offers 'Any-to-Any' Multimodal AI

The new model from OpenBMB supports mixed-modality inputs and outputs, from text and images to audio and video, in a single efficient package.

Feb 2, 2026
NotableApache 2.0
OpenBMB · Any-to-Any
MiniCPM-o 4.5
MiniCPM-o 4.5

OpenBMB has released MiniCPM-o 4.5, a new multimodal model that pushes the boundaries of how AI can interact with different types of data. Unlike many vision-language models that primarily accept images and text to produce only text, MiniCPM-o is designed for 'any-to-any' communication.

This means the model can process a mix of inputs—such as text, images, and audio—and generate a combination of outputs in a single turn. The project describes this capability as 'full-duplex,' enabling more dynamic and complex interactions than traditional request-and-response models. This approach opens the door for more sophisticated conversational agents and creative tools.

The release is available in the popular GGUF format, which is significant for developers and hobbyists. GGUF allows large models to run efficiently on consumer-grade hardware, including CPUs and GPUs, lowering the barrier to entry for experimenting with advanced multimodal AI. You can find the model files and further details on the Hugging Face repository.

Released under the permissive Apache 2.0 license, MiniCPM-o 4.5 provides a powerful new building block for applications that require a deeper, more integrated understanding and generation of multiple media types.

Sources

  • openbmb/MiniCPM-o-4_5-gguf

    Hugging Face

    Visit

0 comments

Protected by Turnstile

No comments yet. Be the first to weigh in.

Get the model

Weights

Specs

Parameters—
Context window—
LicenseAPACHE-2.0
Downloads19.4K

Modalities

Any-to-AnyVision-Language
2 versions — view changelog

More in Any-to-Any

MiniMax
MiniMax-M3
MiniMax-M3
MiniMax/Vision-Language

MiniMax Releases M3, a Multimodal MoE Model

The new open-weight model from MiniMax AI combines vision, coding, and reasoning using a Mixture-of-Experts architecture.

Jun 2, 2026
Google DeepMind
Gemma 4 12B
Gemma 4 12B
Google DeepMind/Any-to-Any

Google Releases Gemma 4 12B Multimodal Model

The new 12-billion-parameter open model from DeepMind introduces a unified 'any-to-any' architecture for advanced multimodal tasks.

May 23, 2026
Google DeepMind
Gemma 4 12B
Gemma 4 12B
Google DeepMind/Any-to-Any

Google Releases Gemma 4, a 12B 'Any-to-Any' Model

The new 12-billion-parameter model from Google DeepMind is designed to handle a flexible mix of data types, moving beyond traditional text and image inputs.

May 23, 2026