The Open Weights
LatestModelsLeaderboardsUpcomingCompanies
Subscribe
The Open Weights

The daily record of open-source AI. New model releases, leaderboards, and what's coming next — written for people who ship.

Refreshed every 12 hours

Discover

  • Latest releases
  • New today
  • Trending models
  • Upcoming launches

Browse

  • All models
  • Companies
  • Categories
  • Leaderboards

About

  • About
  • Editorial policy
  • RSS feed
  • Newsletter

© 2026 The Open Weights. An independent publication.

Aggregated by Claude · written with Gemini · curated by humans.

LatestZhipu AI4.5V
Zhipu AIVision-Language

Zhipu AI Releases Open Vision Model GLM-4.5V

The new Mixture-of-Experts model offers strong multimodal reasoning capabilities under a permissive MIT license.

Aug 10, 2025
Major releaseMIT
Zhipu AI · Vision-Language
GLM-4.5V
GLM-4.5V

Chinese AI firm Zhipu AI has released GLM-4.5V, a new open-source vision-language model (VLM). The model, which uses a Mixture-of-Experts (MoE) architecture, is designed for sophisticated tasks that require understanding and reasoning about both text and images simultaneously.

According to the release notes, GLM-4.5V is built upon the company's GLM-4.5-Air-Base model. The key advancement is its capacity for what Zhipu AI describes as strong multimodal reasoning. This makes it suitable for complex applications like detailed image analysis, visual question answering, and generating text grounded in visual information. The model weights and code are available now on Hugging Face.

Why it matters

The release is significant for two main reasons. First, it adds a powerful, openly accessible VLM to the ecosystem, a domain where proprietary models have often dominated. Second, its release under the permissive MIT license removes significant barriers for both commercial and research applications, allowing developers to freely build upon and integrate the technology.

The MoE architecture also suggests an efficient design, capable of activating only the necessary expert sub-networks during inference. This can lead to faster performance and lower computational costs compared to dense models of a similar capability level, making advanced multimodal AI more accessible to a wider range of developers and organizations.

Sources

  • zai-org/GLM-4.5V

    Hugging Face

    Visit

0 comments

Protected by Turnstile

No comments yet. Be the first to weigh in.

Get the model

Weights

Specs

Parameters— · MoE
Context window—
LicenseMIT
Downloads85.9K

Modalities

Vision-LanguageReasoning

More in Vision-Language

Moonshot AI
Kimi-K2.7-Code
Kimi-K2.7-Code
Moonshot AI/Code

Moonshot AI Releases Kimi, a Multimodal Coding Model

The new Mixture-of-Experts model from the Chinese AI company can generate code while also understanding visual inputs, a rare combination in open models.

Jun 11, 2026
Google DeepMind
DiffusionGemma 26B-A4B Instruct
DiffusionGemma 26B-A4B Instruct
Google DeepMind/Text / LLM

Google Releases Open-Source DiffusionGemma 26B Model

The new 26B parameter model from DeepMind uses a diffusion-based architecture, a technique more common in image generation, to produce text.

Jun 9, 2026
MiniMax
MiniMax-M3
MiniMax-M3
MiniMax/Vision-Language

MiniMax Releases M3, a Multimodal MoE Model

The new open-weight model from MiniMax AI combines vision, coding, and reasoning using a Mixture-of-Experts architecture.

Jun 2, 2026