The Open Weights
LatestModelsLeaderboardsUpcomingCompanies
Subscribe
The Open Weights

The daily record of open-source AI. New model releases, leaderboards, and what's coming next — written for people who ship.

Refreshed every 12 hours

Discover

  • Latest releases
  • New today
  • Trending models
  • Upcoming launches

Browse

  • All models
  • Companies
  • Categories
  • Leaderboards

About

  • About
  • Editorial policy
  • RSS feed
  • Newsletter

© 2026 The Open Weights. An independent publication.

Aggregated by Claude · written with Gemini · curated by humans.

Category · text

Latest Text / LLM models

The newest open-source Text / LLM releases, from across the ecosystem.

Filter

36 releases

Zhipu AI/Text / LLMMajor release

Zhipu AI Releases MIT-Licensed GLM-5.2 MoE Model

The new bilingual model from the Chinese AI firm uses a Mixture of Experts architecture and sparse attention under a fully permissive license.

Jun 17, 2026
Text / LLMReasoning
GLM-5.2
GLM-5.2
Weibo AI/Reasoning

Weibo AI Releases VibeThinker-3B, a Compact Reasoning Model

The new 3-billion-parameter model from the Chinese tech giant focuses on challenging benchmarks in mathematics, coding, and graduate-level questions.

Jun 12, 2026
CodeText / LLM
VibeThinker-3B
VibeThinker-3B
Moonshot AI/CodeMajor release

Moonshot AI Releases Kimi, a Multimodal Coding Model

The new Mixture-of-Experts model from the Chinese AI company can generate code while also understanding visual inputs, a rare combination in open models.

Jun 11, 2026
CodeText / LLM
Kimi-K2.7-Code
Kimi-K2.7-Code
Google DeepMind/Text / LLM

Google Releases Open-Source DiffusionGemma 26B Model

The new 26B parameter model from DeepMind uses a diffusion-based architecture, a technique more common in image generation, to produce text.

Jun 9, 2026
Text / LLMVision-Language
DiffusionGemma 26B-A4B Instruct
DiffusionGemma 26B-A4B Instruct
Cohere/Code

Cohere Releases North-Mini-Code, an Open MoE Model

The new Apache 2.0-licensed model is designed for code generation and agentic chat applications, using a Mixture-of-Experts architecture for efficiency.

Jun 5, 2026
CodeText / LLM
North-Mini-Code 1.0
North-Mini-Code 1.0
Google DeepMind/Any-to-AnyMajor release

Google Releases Gemma 4, a 12B 'Any-to-Any' Model

The new 12-billion-parameter model from Google DeepMind is designed to handle a flexible mix of data types, moving beyond traditional text and image inputs.

May 23, 2026
Text / LLMAny-to-Any
Gemma 4 12B
Gemma 4 12B
Tencent/Text / LLM

Tencent Releases 1.8B Model for Multilingual Translation

The 1.8 billion-parameter model from the Chinese tech giant is designed for high-quality translation across a wide range of language pairs.

May 11, 2026
Text / LLM
Hunyuan-MT2 1.8B
Hunyuan-MT2 1.8B
Google DeepMind/Any-to-Any

Google Releases Gemma 4 Multimodal Open Model

The new 26-billion-parameter model from DeepMind uses a mixture-of-experts design for greater efficiency and is tuned for assistant-style tasks.

Apr 23, 2026
Text / LLMAny-to-Any
Gemma 4 26B-A4B Instruct (MoE)
Gemma 4 26B-A4B Instruct (MoE)
Google DeepMind/Any-to-AnyMajor release

Google Releases Multimodal Gemma 4 31B Model

The new 31-billion-parameter model is an instruction-tuned, 'any-to-any' powerhouse released under a permissive Apache 2.0 license.

Apr 23, 2026
Text / LLMAny-to-Any
Gemma 4 12B
Gemma 4 12B
Google DeepMind/Any-to-Any

Google Releases 4B Multimodal Gemma 4 Assistant

The new 4-billion-parameter model is instruction-tuned for 'any-to-any' tasks, handling a flexible mix of data types.

Apr 23, 2026
Text / LLMAny-to-Any
Gemma 4 E4B-it Assistant
Gemma 4 E4B-it Assistant
Google DeepMind/Any-to-Any

Google Releases 2B Multimodal Gemma 4 Assistant Model

The new compact model from DeepMind is instruction-tuned for "any-to-any" tasks, capable of processing and generating mixed data types.

Apr 23, 2026
Text / LLMAny-to-Any
Gemma 4 E2B-it Assistant
Gemma 4 E2B-it Assistant
DeepSeek/Text / LLMMajor release

DeepSeek Releases V4-Pro, an Open MoE Contender

The new flagship model combines a Mixture-of-Experts architecture with a permissive MIT license, positioning it for wide commercial adoption.

Apr 22, 2026
CodeText / LLM
DeepSeek-V4-Pro
DeepSeek-V4-Pro
DeepSeek/Text / LLMMajor release

DeepSeek Releases V4-Flash, a Fast MIT-Licensed MoE Model

The new Mixture of Experts model from the Beijing-based AI lab is optimized for fast, efficient conversational AI and carries a fully permissive license.

Apr 22, 2026
Text / LLMReasoning
DeepSeek-V4-Flash
DeepSeek-V4-Flash
Qwen · Alibaba/Vision-Language

Alibaba's Qwen Releases Open 27B Vision Model

The new dense model, licensed under Apache 2.0, brings both text and image understanding to the midrange parameter space.

Apr 21, 2026
Text / LLMVision-Language
Qwen3.6-27B
Qwen3.6-27B
Qwen · Alibaba/Vision-LanguageMajor release

Qwen Releases 35B Multimodal Mixture-of-Experts Model

The new Qwen3.6-35B-A3B from Alibaba's Qwen team combines vision and language capabilities using an efficient sparse architecture.

Apr 15, 2026
Text / LLMReasoning
Qwen3.6-27B
Qwen3.6-27B
Moonshot AI/Vision-LanguageMajor release

Moonshot AI Releases Kimi-K2.6 Multimodal Model

The Chinese AI lab has published weights for its new vision-language model, though a restrictive license limits its use to research applications.

Apr 14, 2026
Text / LLMVision-Language
Kimi-K2.6
Kimi-K2.6
MiniMax/Text / LLM

MiniMax Releases M2.7, an MoE Model with FP8 Weights

The new conversational language model from the Chinese AI company uses a Mixture-of-Experts architecture and 8-bit weights, but is released under a restrictive custom license.

Apr 9, 2026
Text / LLMReasoning
MiniMax-M2.7
MiniMax-M2.7
Google DeepMind/Any-to-AnyMajor release

Google Releases Gemma 4, a 26B Vision-Language Model

The new open-source model from DeepMind uses a Mixture-of-Experts architecture to handle both text and image inputs efficiently.

Mar 11, 2026
Text / LLMVision-Language
Gemma 4 12B
Gemma 4 12B
Google DeepMind/Any-to-AnyMajor release

Google Releases Multimodal Gemma 4 31B Model

The new 31-billion-parameter model is instruction-tuned and can process both text and images, marking a significant expansion for the Gemma family.

Mar 11, 2026
Text / LLMVision-Language
Gemma 4 12B
Gemma 4 12B
Google DeepMind/Any-to-AnyMajor release

Google's Gemma 4 Debuts with Any-to-Any Multimodality

The new 4-billion parameter model from Google DeepMind is designed for versatile input and output, handling text, images, and other data types.

Mar 2, 2026
Text / LLMAny-to-Any
Gemma 4 E4B
Gemma 4 E4B
Zhipu AI/Text / LLMMajor release

Zhipu AI Releases Open-Source GLM-5 MoE Model

The new Mixture-of-Experts model from the Chinese AI company combines an advanced architecture with a fully permissive MIT license for commercial use.

Feb 11, 2026
Text / LLMReasoning
GLM-5
GLM-5
Nanbeige/Text / LLM

Nanbeige Releases 3B Chinese-Enhanced Language Model

The new Llama-based model was trained from scratch on 3.5 trillion tokens of Chinese and English data to enhance its bilingual capabilities.

Feb 10, 2026
Text / LLM
Nanbeige4.1-3B
Nanbeige4.1-3B
Zhipu AI/Text / LLM

Zhipu AI Releases GLM-4.7-Flash MoE Model

The new Mixture-of-Experts model from the Beijing-based AI company is optimized for speed and released under the permissive MIT license.

Jan 19, 2026
Text / LLM
GLM-4.7-Flash
GLM-4.7-Flash
Moonshot AI/Vision-LanguageMajor release

Moonshot AI Releases Kimi K2.5 Multimodal Model

The new vision-language model from the Chinese AI firm uses a Mixture-of-Experts architecture and is now available on Hugging Face.

Jan 1, 2026
Text / LLMReasoning
Kimi K2.5
Kimi K2.5
MiniMax/Text / LLMMajor release

MiniMax Releases M2, an Open-Weight MoE for Agents

The Shanghai-based AI startup has released a new Mixture-of-Experts model focused on complex reasoning, coding, and agentic tasks.

Oct 22, 2025
CodeText / LLM
MiniMax-M2
MiniMax-M2
Google DeepMind/Text / LLM

Google Releases Compact FunctionGemma Model

The new 270-million-parameter model from Google DeepMind is fine-tuned specifically for reliable function calling and tool use.

Oct 8, 2025
Text / LLM
FunctionGemma 270M IT
FunctionGemma 270M IT
Zhipu AI/Text / LLMMajor release

Zhipu AI Releases Open-Weight MoE Model GLM-4.6

The new Mixture-of-Experts model is available under a permissive MIT license and is optimized for complex reasoning and coding tasks.

Sep 29, 2025
Text / LLMReasoning
GLM-4.6
GLM-4.6
Qwen · Alibaba/Text / LLMMajor release

Qwen Releases 80B Mixture-of-Experts Model

The new Qwen3-Next model from Alibaba combines a large parameter count with an efficient MoE architecture to balance performance and computational cost.

Sep 9, 2025
Text / LLM
Qwen3-Next-80B-A3B-Instruct
Qwen3-Next-80B-A3B-Instruct
DeepSeek/Text / LLMMajor release

DeepSeek Releases 671B MoE Model Under MIT License

The new DeepSeek-V3.1-Base is a massive 671-billion-parameter Mixture-of-Experts model designed for efficient, large-scale research and development.

Aug 19, 2025
Text / LLMReasoning
DeepSeek-V3.1-Base
DeepSeek-V3.1-Base
Google DeepMind/Text / LLM

Google Releases Gemma 3 270M for On-Device AI

The new ultra-compact model from DeepMind is designed for efficient performance in resource-constrained environments like mobile and web.

Aug 5, 2025
Text / LLM
Gemma 3 270M
Gemma 3 270M
OpenAI/ReasoningMajor release

OpenAI Releases 21B Open-Weight MoE Model

The new `gpt-oss-20b` is an Apache 2.0-licensed Mixture-of-Experts model designed to run efficiently on consumer-grade hardware.

Aug 4, 2025
Text / LLMReasoning
gpt-oss-20b
gpt-oss-20b
OpenAI/ReasoningMajor release

OpenAI Releases Its First Open-Source MoE Model

The new 117-billion-parameter `gpt-oss-120b` is a Mixture-of-Experts model focused on reasoning, released under a permissive Apache 2.0 license.

Aug 4, 2025
Text / LLMReasoning
gpt-oss-20b
gpt-oss-20b
Qwen · Alibaba/Code

Qwen Releases Compact 30B MoE for Coding Agents

The new Apache 2.0 model from Alibaba's Qwen team uses a Mixture-of-Experts architecture to deliver strong performance with only 3B active parameters.

Jul 31, 2025
CodeText / LLM
Qwen3-Coder-30B-A3B-Instruct
Qwen3-Coder-30B-A3B-Instruct
Qwen · Alibaba/CodeMajor release

Qwen Releases 480B Open-Source Model for Code Agents

The new flagship coding model from Alibaba's Qwen team uses a massive Mixture-of-Experts architecture and is released under a permissive Apache-2.0 license.

Jul 22, 2025
CodeText / LLM
Qwen3-Coder-30B-A3B-Instruct
Qwen3-Coder-30B-A3B-Instruct
Zhipu AI/ReasoningMajor release

Z.ai Releases 355B Parameter GLM-4.5 Under MIT License

The new Mixture-of-Experts model combines massive scale with a fully permissive license, targeting complex reasoning and agentic applications.

Jul 20, 2025
CodeText / LLM
GLM-4.5
GLM-4.5
Moonshot AI/Text / LLMMajor release

Moonshot AI Releases Trillion-Parameter Kimi-K2 Model

The new Mixture-of-Experts model brings massive scale to the open-weights community, focusing on complex reasoning and coding tasks with a 128K context window.

Jul 11, 2025
Text / LLMReasoning
Kimi-K2-Instruct
Kimi-K2-Instruct