The Open Weights
LatestModelsLeaderboardsUpcomingCompanies
Subscribe
The Open Weights

The daily record of open-source AI. New model releases, leaderboards, and what's coming next — written for people who ship.

Refreshed every 12 hours

Discover

  • Latest releases
  • New today
  • Trending models
  • Upcoming launches

Browse

  • All models
  • Companies
  • Categories
  • Leaderboards

About

  • About
  • Editorial policy
  • RSS feed
  • Newsletter

© 2026 The Open Weights. An independent publication.

Aggregated by Claude · written with Gemini · curated by humans.

LatestZhipu AI4.1V
Zhipu AIVision-Language

Zhipu AI Open-Sources 9B Vision Model with 'Thinking' Mode

The new GLM-4.1V-9B-Thinking model makes its vision and chain-of-thought reasoning capabilities available under a permissive MIT license.

Jun 28, 2025
NotableMIT
Zhipu AI · Vision-Language
GLM-4.1V-9B-Thinking
GLM-4.1V-9B-Thinking

Zhipu AI has released a new open-source vision-language model, GLM-4.1V-9B-Thinking. At 9 billion parameters, the model is designed to interpret and reason about visual inputs, marking another significant entry into the competitive field of multimodal AI.

The model's most distinct feature is its explicit "thinking" mode. This enables a chain-of-thought process, where the model generates intermediate reasoning steps before arriving at a final answer. For developers and researchers, this transparency can make it easier to understand and debug the model's conclusions on complex visual question-answering tasks.

Permissive Licensing for Broader Use

Perhaps most notably for the open-source community, Zhipu AI has released the model under the highly permissive MIT license. This choice removes significant barriers to adoption, allowing for broad use in both commercial applications and academic research. The move encourages wider experimentation and integration compared to models with more restrictive licenses.

The complete model weights and details are available for download on Hugging Face. This release provides developers with a powerful and transparent tool for building applications that require a sophisticated understanding of both language and imagery.

Sources

  • zai-org/GLM-4.1V-9B-Thinking

    Hugging Face

    Visit

0 comments

Protected by Turnstile

No comments yet. Be the first to weigh in.

Get the model

Weights

Specs

Parameters9B
Context window—
LicenseMIT
Downloads364.2K

Modalities

Vision-LanguageReasoning

More in Vision-Language

Moonshot AI
Kimi-K2.7-Code
Kimi-K2.7-Code
Moonshot AI/Code

Moonshot AI Releases Kimi, a Multimodal Coding Model

The new Mixture-of-Experts model from the Chinese AI company can generate code while also understanding visual inputs, a rare combination in open models.

Jun 11, 2026
Google DeepMind
DiffusionGemma 26B-A4B Instruct
DiffusionGemma 26B-A4B Instruct
Google DeepMind/Text / LLM

Google Releases Open-Source DiffusionGemma 26B Model

The new 26B parameter model from DeepMind uses a diffusion-based architecture, a technique more common in image generation, to produce text.

Jun 9, 2026
MiniMax
MiniMax-M3
MiniMax-M3
MiniMax/Vision-Language

MiniMax Releases M3, a Multimodal MoE Model

The new open-weight model from MiniMax AI combines vision, coding, and reasoning using a Mixture-of-Experts architecture.

Jun 2, 2026