Latest open-source Text / LLM models

Meituan/Text / LLM

Meituan Ships a Lighter, Sparser LongCat-Flash

The food-delivery giant's newest open model trims its mixture-of-experts design for more efficient inference under an MIT license.

Jul 31, 2026

Text / LLM

DeepSeek/Text / LLM

DeepSeek Ships V4-Flash, a 304B MoE Tuned for Agents

The latest checkpoint in DeepSeek's V4 line leans into agentic workflows while keeping the permissive MIT license.

Jul 31, 2026

Text / LLM Reasoning

LGAI EXAONE/Text / LLM

LG AI Research debuts K-EXAONE 2.0, a 750B MoE model

The new mixture-of-experts model activates 37B parameters per token and targets English, Korean, and Spanish reasoning tasks.

Jul 29, 2026

Text / LLM Reasoning

Skt/Text / LLM

SK Telecom Releases A.X-K2 Multilingual LLM

The Korean telecom carrier's latest open language model targets English, Korean, Chinese, Japanese, and Spanish under a permissive license.

Jul 28, 2026

Text / LLM

Thinkingmachines/Vision-Language

Thinking Machines Debuts Inkling Small, a Compact Multimodal MoE

The Apache-2.0 model brings mixture-of-experts efficiency to image, audio, and text tasks in a smaller footprint.

Jul 27, 2026

Vision-Language Text / LLM

Swiss Ai/Text / LLM

Apertus v1.5 70B arrives with an Apache-2.0 license

Switzerland's open-model effort ships a 70-billion-parameter, multilingual and multimodal system that anyone can use, modify, and deploy.

Jul 24, 2026

Text / LLM Vision-Language

Amd/Reasoning

AMD's Instella-MoE Brings Reasoning to ROCm Hardware

A new open mixture-of-experts model with 16B total parameters and just 3B active is tuned to run on AMD's own accelerator stack.

Jul 23, 2026

Reasoning Text / LLM

Kwaipilot/Code

Kwaipilot Releases KAT-Coder V2.5 Dev, an Agentic MoE Coder

Kuaishou's coding team ships an open mixture-of-experts model built on the Qwen3.5 MoE architecture and tuned for agentic development work.

Jul 23, 2026

Code Text / LLM

Upstage/Text / LLM

Upstage's Solar Open2 arrives as a 250B MoE model

The Korean AI firm's latest open release scales to 250 billion parameters with a mixture-of-experts design tuned for English and Korean.

Jul 22, 2026

Text / LLM

Motif Technologies/Text / LLM

Motif Technologies debuts Motif 3 Beta, an MoE model

The Korean AI lab's preview release is a mixture-of-experts language model built for long-context, multilingual work.

Jul 20, 2026

Text / LLM

Unknown/Text / LLM

German consortium releases open 30B model Soofi S

A collaborative European effort ships a dense 30-billion-parameter model that claims top marks on both English and German benchmarks.

Jul 16, 2026

Text / LLM

inclusionAI/Text / LLM

inclusionAI ships LLaDA2.2-flash diffusion LLM

A new Apache-2.0 mixture-of-experts model that generates text through diffusion rather than left-to-right decoding.

Jul 16, 2026

Text / LLM

Thinkingmachines/Any-to-AnyMajor release

Thinking Machines Lab debuts Inkling, its first open model

The lab's inaugural open-weights release is a mixture-of-experts system that takes image and audio inputs, shipped under a permissive Apache 2.0 license.

Jul 15, 2026

Any-to-Any Vision-Language

inclusionAI/Reasoning

inclusionAI's Ring-Zero Scales Zero-RL to a Trillion Parameters

A new mixture-of-experts model learns to reason through reinforcement learning alone, without human-annotated chains of thought.

Jul 13, 2026

Reasoning Text / LLM

Poolside/Code

Poolside releases Laguna-S-2.1 coding model

The AI coding startup puts a version of its Laguna family on Hugging Face under the permissive OpenMDW license.

Jul 13, 2026

Code Text / LLM

Ai Sage/Text / LLM

GigaChat 3.5 arrives as a 432B mixture-of-experts model

The multilingual instruct model activates 28B parameters per token and leans on hybrid attention for efficiency at scale.

Jul 5, 2026

Text / LLM

Prism Ml/Text / LLM

Bonsai-27B Brings 1-Bit Quantization to Local Inference

A ternary-weight 27B model with hybrid attention aims to run large-model reasoning on everyday hardware.

Jul 4, 2026

Text / LLM

Tencent/Text / LLM

Tencent releases Hunyuan Hy3 under Apache 2.0

The company's latest mixture-of-experts model arrives as an openly licensed conversational LLM on Hugging Face.

Jul 2, 2026

Text / LLM

Google DeepMind/Any-to-AnyMajor release

Google DeepMind's Gemma 4 Goes Multimodal and MoE

The new open-weights family adds a mixture-of-experts design, encoder-free multimodal inputs, and an optional thinking mode.

Jul 1, 2026

Any-to-Any Text / LLM

Mistral AI/Text / LLM

Mistral's Leanstral 1.5 puts 119B in a lean MoE

The new Apache-2.0 mixture-of-experts model activates just 6B parameters per token, trading raw density for cheaper inference.

Jul 1, 2026

Text / LLM Reasoning

Soofi Project/Text / LLM

German Consortium Debuts Soofi S, an Open 30B MoE Model

A Mamba-2 mixture-of-experts model claims top marks in both English and German benchmarks.

Jul 1, 2026

Text / LLM

IBM/Text / LLM

Liquid AI's LFM2.5 230M targets phones and robots

A 230-million-parameter language model built to run locally on constrained hardware like the Raspberry Pi.

Jul 1, 2026

Text / LLM

Mistral AI/Text / LLM

Liquid AI's LFM2.5-230M targets phones and robots

A 230-million-parameter model built to run on constrained hardware like Raspberry Pi and edge robotics.

Jul 1, 2026

Text / LLM

NVIDIA/Text / LLM

Liquid AI's LFM2.5-230M targets phones and robots

A 230-million-parameter language model built to run on hardware as modest as a Raspberry Pi.

Jul 1, 2026

Text / LLM

Meituan/Text / LLM

Meituan releases LongCat-2.0 language model

The Chinese delivery giant continues its push into open AI with a new text model on Hugging Face.

Jun 30, 2026

Text / LLM

DeepSeek/Text / LLMMajor release

DeepSeek Releases V4-Pro, an MIT-Licensed MoE Model

The new flagship arrives as a mixture-of-experts system with FP8 weights and open reasoning capabilities under a permissive license.

Jun 27, 2026

Text / LLM Reasoning

DeepSeek/Text / LLM

DeepSeek Releases V4-Flash for Low-Latency Inference

A lighter, faster member of DeepSeek's V4 line arrives on Hugging Face under a permissive MIT license.

Jun 27, 2026

Text / LLM

Deepreinforce Ai/Text / LLM

DeepReinforce Releases Ornith 1.0, a 35B Reasoning Model

The new dense model ships in GGUF format under a permissive MIT license, aimed at local and self-hosted deployment.

Jun 25, 2026

Text / LLM Reasoning

LiquidAI/Text / LLM

Liquid AI's LFM2.5-230M targets on-device language tasks

A 230-million-parameter multilingual model built to run efficiently at the edge rather than in the cloud.

Jun 24, 2026

Text / LLM

NVIDIA/Text / LLM

NVIDIA's Nemotron 3 Puzzle Runs Big on a Lean Budget

A 75-billion-parameter mixture-of-experts reasoning model that activates just 9 billion parameters per token.

Jun 24, 2026

Text / LLM Reasoning

NVIDIA/Text / LLM

NVIDIA's Nemotron-3 Puzzle Brings a Lean MoE to Reasoning

The 75B-parameter model activates just 9B per token and ships in NVIDIA's NVFP4 format for efficient inference.

Jun 24, 2026

Text / LLM Reasoning

Deepreinforce Ai/Text / LLM

DeepReinforce debuts Ornith-1.0, a 397B MoE model

The flagship of a new open model family arrives under a permissive MIT license, with reasoning among its stated strengths.

Jun 23, 2026

Text / LLM Reasoning

Qwen · Alibaba/Text / LLM

Qwen's AgentWorld Simulates Worlds for AI Agents

Alibaba's new MoE model acts as a language world model, generating the environments that agents act within.

Jun 22, 2026

Text / LLM Reasoning

InternScience/Reasoning

Agents-A1: A 35B MoE Built for Agentic Scaling

InclusionAI's new mixture-of-experts model bets that agent-horizon scaling can rival far larger systems on long-running tasks.

Jun 22, 2026

Reasoning Text / LLM

Deepreinforce Ai/Text / LLM

DeepReinforce's Ornith-1.0-9B Targets Agentic Coding

A compact, MIT-licensed 9B model built for autonomous coding tasks arrives on Hugging Face.

Jun 21, 2026

Code Text / LLM

Deepreinforce Ai/Text / LLM

Ornith-1.0-35B brings a mid-size MoE to agentic coding

An MIT-licensed mixture-of-experts model targets self-scaffolding code tasks without the footprint of a frontier system.

Jun 21, 2026

Code Text / LLM

Poolside/Code

Poolside releases Laguna XS 2.1 code model

The compact, code-focused language model arrives on Hugging Face under an open model license.

Jun 20, 2026

Code Text / LLM

Zhipu AI/Text / LLMMajor release

Zhipu AI Releases MIT-Licensed GLM-5.2 MoE Model

The new bilingual model from the Chinese AI firm uses a Mixture of Experts architecture and sparse attention under a fully permissive license.

Jun 17, 2026

Text / LLM Reasoning

Poolside/Text / LLM

Poolside Releases Laguna-M.1, an Open MoE Model

The AI coding startup steps into open weights with an Apache-2.0 mixture-of-experts model built for text and code.

Jun 15, 2026

Text / LLM Code

Microsoft/Text / LLM

Microsoft's FastContext is a 4B sub-agent for code

A compact Qwen3-derived model built to explore repositories, released under a permissive MIT license.

Jun 14, 2026

Text / LLM Code

Moonshot AI/Text / LLMMajor release

Moonshot AI releases Kimi K3, a 2.8T-parameter MoE model

The open-weights multimodal model leans into coding and agentic tasks, extending Moonshot's Kimi line into a new scale bracket.

Jun 13, 2026

Text / LLM Vision-Language

WeiboAI/Reasoning

Weibo AI Releases VibeThinker-3B, a Compact Reasoning Model

The new 3-billion-parameter model from the Chinese tech giant focuses on challenging benchmarks in mathematics, coding, and graduate-level questions.

Jun 12, 2026

Reasoning Text / LLM

Moonshot AI/CodeMajor release

Moonshot AI Releases Kimi, a Multimodal Coding Model

The new Mixture-of-Experts model from the Chinese AI company can generate code while also understanding visual inputs, a rare combination in open models.

Jun 11, 2026

Code Vision-Language

Google DeepMind/Text / LLM

Google Releases Open-Source DiffusionGemma 26B Model

The new 26B parameter model from DeepMind uses a diffusion-based architecture, a technique more common in image generation, to produce text.

Jun 9, 2026

Text / LLM Vision-Language

Cohere/Code

Cohere Releases North-Mini-Code, an Open MoE Model

The new Apache 2.0-licensed model is designed for code generation and agentic chat applications, using a Mixture-of-Experts architecture for efficiency.

Jun 5, 2026

Code Text / LLM

Google DeepMind/Any-to-AnyMajor release

Google Releases Gemma 4 12B Multimodal Model

The new 12-billion-parameter open model from DeepMind introduces a unified 'any-to-any' architecture for advanced multimodal tasks.

May 23, 2026

Any-to-Any Vision-Language

Google DeepMind/Any-to-AnyMajor release

Google Releases Gemma 4, a 12B 'Any-to-Any' Model

The new 12-billion-parameter model from Google DeepMind is designed to handle a flexible mix of data types, moving beyond traditional text and image inputs.

May 23, 2026

Any-to-Any Vision-Language

OpenBMB/Text / LLM

OpenBMB's MiniCPM5-1B targets on-device AI

The compact 1B-parameter model brings long-context handling and tool-calling to phones and laptops.

May 21, 2026

Text / LLM

OpenBMB/Text / LLM

GLiGuard: A Sub-1B Model for Faster LLM Guardrails

The team behind GLiNER releases an open-source small language model aimed at making safety moderation cheaper and quicker to run.

May 12, 2026

Text / LLM

OpenBMB/Code

Needle: A 26M-Parameter Model Built for Tool Calling

Cactus Compute distilled Gemini's tool-calling behavior into a tiny model meant to run locally.

May 12, 2026

Code Text / LLM

Tencent/Text / LLM

Tencent Releases 1.8B Model for Multilingual Translation

The 1.8 billion-parameter model from the Chinese tech giant is designed for high-quality translation across a wide range of language pairs.

May 11, 2026

Text / LLM

Moonshot AI/Vision-LanguageMajor release

Kimi K2.6 tops closed models in coding test

Moonshot AI's open-weights mixture-of-experts model reportedly outperformed Claude, GPT-5.5, and Gemini on a programming challenge.

May 3, 2026

Code Text / LLM

Google DeepMind/Any-to-Any

Google Releases Gemma 4 Multimodal Open Model

The new 26-billion-parameter model from DeepMind uses a mixture-of-experts design for greater efficiency and is tuned for assistant-style tasks.

Apr 23, 2026

Any-to-Any Text / LLM

Google DeepMind/Any-to-AnyMajor release

Google Releases Multimodal Gemma 4 31B Model

The new 31-billion-parameter model is an instruction-tuned, 'any-to-any' powerhouse released under a permissive Apache 2.0 license.

Apr 23, 2026

Any-to-Any Text / LLM

Google DeepMind/Any-to-Any

Google Releases 4B Multimodal Gemma 4 Assistant

The new 4-billion-parameter model is instruction-tuned for 'any-to-any' tasks, handling a flexible mix of data types.

Apr 23, 2026

Any-to-Any Text / LLM

Google DeepMind/Any-to-Any

Google Releases 2B Multimodal Gemma 4 Assistant Model

The new compact model from DeepMind is instruction-tuned for "any-to-any" tasks, capable of processing and generating mixed data types.

Apr 23, 2026

Any-to-Any Text / LLM

DeepSeek/Text / LLMMajor release

DeepSeek Releases V4-Pro, an Open MoE Contender

The new flagship model combines a Mixture-of-Experts architecture with a permissive MIT license, positioning it for wide commercial adoption.

Apr 22, 2026

Text / LLM Reasoning

DeepSeek/Text / LLMMajor release

DeepSeek Releases V4-Flash, a Fast MIT-Licensed MoE Model

The new Mixture of Experts model from the Beijing-based AI lab is optimized for fast, efficient conversational AI and carries a fully permissive license.

Apr 22, 2026

Text / LLM Reasoning

Qwen · Alibaba/Vision-Language

Alibaba's Qwen Releases Open 27B Vision Model

The new dense model, licensed under Apache 2.0, brings both text and image understanding to the midrange parameter space.

Apr 21, 2026

Vision-Language Text / LLM

Qwen · Alibaba/Vision-LanguageMajor release

Qwen Releases 35B Multimodal Mixture-of-Experts Model

The new Qwen3.6-35B-A3B from Alibaba's Qwen team combines vision and language capabilities using an efficient sparse architecture.

Apr 15, 2026

Vision-Language Text / LLM

Moonshot AI/Vision-LanguageMajor release

Moonshot AI Releases Kimi-K2.6 Multimodal Model

The Chinese AI lab has published weights for its new vision-language model, though a restrictive license limits its use to research applications.

Apr 14, 2026

Vision-Language Text / LLM

NVIDIA/Text / LLM

NVIDIA's Nemotron TwoTower is a MoE experiment

An experimental 30B mixture-of-experts base model blends diffusion and Mamba ideas under a two-tower design.

Apr 11, 2026

Text / LLM

NVIDIA/Text / LLM

NVIDIA's Nemotron TwoTower mixes diffusion and Mamba

A new 30B mixture-of-experts base model activates just 3B parameters per token and pairs a hybrid diffusion/Mamba design.

Apr 11, 2026

Text / LLM

MiniMax/Text / LLM

MiniMax Releases M2.7, an MoE Model with FP8 Weights

The new conversational language model from the Chinese AI company uses a Mixture-of-Experts architecture and 8-bit weights, but is released under a restrictive custom license.

Apr 9, 2026

Text / LLM Reasoning

Zhipu AI/Text / LLMMajor release

Zhipu AI Releases Open-Source GLM-5.1 MoE Model

The new bilingual model from the Chinese AI firm features an efficient Mixture-of-Experts architecture and a fully permissive MIT license.

Apr 3, 2026

Text / LLM Reasoning

Meituan/Any-to-Any

Meituan Releases LongCat-Next 'Any-to-Any' AI Model

The Chinese tech company has released the weights for a unified model that can process and generate combinations of text, images, audio, and video.

Mar 25, 2026

Any-to-Any Text / LLM

Cactus Compute/Code

Needle: A 26M-Param Model Built for On-Device Tool Calls

Cactus Compute's tiny encoder-decoder is distilled specifically for function calling at the edge, trading general chat for a narrow, useful job.

Mar 16, 2026

Text / LLM Code

Google DeepMind/Any-to-AnyMajor release

Google Releases Gemma 4, a 26B Vision-Language Model

The new open-source model from DeepMind uses a Mixture-of-Experts architecture to handle both text and image inputs efficiently.

Mar 11, 2026

Vision-Language Text / LLM

Google DeepMind/Any-to-AnyMajor release

Google Releases Multimodal Gemma 4 31B Model

The new 31-billion-parameter model is instruction-tuned and can process both text and images, marking a significant expansion for the Gemma family.

Mar 11, 2026

Vision-Language Text / LLM

Google DeepMind/Any-to-AnyMajor release

Google Releases Compact Gemma 4 E2B Multimodal Model

The new 2-billion-parameter model from Google DeepMind brings efficient image-and-text understanding to the open-source Gemma family.

Mar 2, 2026

Any-to-Any Vision-Language

Google DeepMind/Any-to-AnyMajor release

Google's Gemma 4 Arrives with Any-to-Any Multimodal Skills

The new 2-billion-parameter model from DeepMind can process text, vision, and audio, making it a versatile and efficient foundation for developers.

Mar 2, 2026

Any-to-Any Vision-Language

Google DeepMind/Any-to-Any

Google Releases Gemma 4 E4B, a 4B Multimodal Model

The new 4-billion-parameter vision-language model brings image and text understanding to Google's popular open-source family.

Mar 2, 2026

Any-to-Any Vision-Language

Google DeepMind/Any-to-AnyMajor release

Google's Gemma 4 Debuts with Any-to-Any Multimodality

The new 4-billion parameter model from Google DeepMind is designed for versatile input and output, handling text, images, and other data types.

Mar 2, 2026

Any-to-Any Vision-Language

Qwen · Alibaba/Vision-Language

Alibaba's Qwen Releases Compact 0.8B Vision Model

The new 800-million-parameter model is the smallest in the Qwen3.5 family, designed for efficient multimodal tasks on consumer-grade hardware.

Feb 28, 2026

Vision-Language Text / LLM

Qwen · Alibaba/Vision-Language

Alibaba's Qwen team releases 4B vision-language model

The new Qwen3.5-4B model combines text and image understanding in a compact, permissively licensed package for developers.

Feb 27, 2026

Vision-Language Text / LLM

Qwen · Alibaba/Vision-Language

Qwen Releases 9B Multimodal Model in New 3.5 Series

The new open-source vision-language model from Alibaba's Qwen team offers strong performance in a compact, Apache 2.0-licensed package.

Feb 27, 2026

Vision-Language Text / LLM

Qwen · Alibaba/Vision-LanguageMajor release

Qwen Releases Flagship 122B Multimodal MoE Model

The new Qwen3.5-122B-A10B combines a massive parameter count with an efficient Mixture-of-Experts architecture for advanced vision and language tasks.

Feb 24, 2026

Vision-Language Text / LLM

Qwen · Alibaba/Vision-LanguageMajor release

Qwen Releases 27B Vision Model with Long Context

The new model from Alibaba's Qwen team combines multimodal understanding with a 131K token context window under a permissive Apache 2.0 license.

Feb 24, 2026

Vision-Language Text / LLM

Qwen · Alibaba/Vision-Language

Qwen Releases Efficient 35B Multimodal MoE Model

The new Qwen3.5-35B-A3B model from Alibaba combines vision and language capabilities with a resource-friendly Mixture of Experts design.

Feb 24, 2026

Vision-Language Text / LLM

Qwen · Alibaba/Vision-LanguageMajor release

Qwen releases flagship 397B multimodal MoE

The new open-source model from Alibaba uses a Mixture-of-Experts architecture to balance massive scale with efficient inference.

Feb 16, 2026

Vision-Language Text / LLM

MiniMax/Text / LLM

MiniMax Releases M2.5 Mixture-of-Experts Model

The Chinese AI company's first open-weight release uses an efficient FP8 data type but comes with a restrictive, non-commercial license.

Feb 12, 2026

Text / LLM

Zhipu AI/Text / LLMMajor release

Zhipu AI Releases Open-Source GLM-5 MoE Model

The new Mixture-of-Experts model from the Chinese AI company combines an advanced architecture with a fully permissive MIT license for commercial use.

Feb 11, 2026

Text / LLM Reasoning

Nanbeige/Text / LLM

Nanbeige Releases 3B Chinese-Enhanced Language Model

The new Llama-based model was trained from scratch on 3.5 trillion tokens of Chinese and English data to enhance its bilingual capabilities.

Feb 10, 2026

Text / LLM

Qwen · Alibaba/Code

Qwen Releases Coder-Next, A New Open MoE Coding Model

The new model from Alibaba's Qwen team uses a Mixture-of-Experts architecture and is released under the commercially-friendly Apache 2.0 license.

Jan 30, 2026

Code Text / LLM

Zhipu AI/Text / LLM

Zhipu AI Releases GLM-4.7-Flash MoE Model

The new Mixture-of-Experts model from the Beijing-based AI company is optimized for speed and released under the permissive MIT license.

Jan 19, 2026

Text / LLM

Google DeepMind/Text / LLM

Google Releases TranslateGemma for Open Translation

The new 4B-parameter model is an instruction-tuned variant of Gemma, designed specifically for high-quality multilingual translation tasks.

Jan 12, 2026

Text / LLM

Moonshot AI/Vision-LanguageMajor release

Moonshot AI Releases Kimi K2.5 Multimodal Model

The new vision-language model from the Chinese AI firm uses a Mixture-of-Experts architecture and is now available on Hugging Face.

Jan 1, 2026

Vision-Language Text / LLM

MiniMax/Text / LLM

MiniMax Debuts M2.1, an MoE Model Optimized with FP8

The new Mixture of Experts model from the Chinese AI firm uses 8-bit floating-point precision for a smaller memory footprint and faster inference.

Dec 20, 2025

Text / LLM

DeepSeek/Text / LLM

DeepSeek-V3.2 Arrives With FP8 Weights, MIT License

The new Mixture-of-Experts model from DeepSeek AI combines an efficient FP8 architecture with a fully permissive license for commercial use.

Dec 1, 2025

Text / LLM

Moonshot AI/ReasoningMajor release

Moonshot AI Releases Kimi-K2 Reasoning Model

The new Mixture-of-Experts model is designed for complex tasks but arrives in a custom compressed format with a restrictive license.

Nov 4, 2025

Reasoning Text / LLM

MiniMax/Text / LLMMajor release

MiniMax Releases M2, an Open-Weight MoE for Agents

The Shanghai-based AI startup has released a new Mixture-of-Experts model focused on complex reasoning, coding, and agentic tasks.

Oct 22, 2025

Text / LLM Reasoning

Google DeepMind/Text / LLM

Google Releases Compact FunctionGemma Model

The new 270-million-parameter model from Google DeepMind is fine-tuned specifically for reliable function calling and tool use.

Oct 8, 2025

Text / LLM

Zhipu AI/Text / LLMMajor release

Zhipu AI Releases Open-Weight MoE Model GLM-4.6

The new Mixture-of-Experts model is available under a permissive MIT license and is optimized for complex reasoning and coding tasks.

Sep 29, 2025

Text / LLM Reasoning

Qwen · Alibaba/Text / LLMMajor release

Qwen Releases 80B Mixture-of-Experts Model

The new Qwen3-Next model from Alibaba combines a large parameter count with an efficient MoE architecture to balance performance and computational cost.

Sep 9, 2025

Text / LLM

DeepSeek/Text / LLMMajor release

DeepSeek Releases 671B MoE Model Under MIT License

The new DeepSeek-V3.1-Base is a massive 671-billion-parameter Mixture-of-Experts model designed for efficient, large-scale research and development.

Aug 19, 2025

Text / LLM Reasoning

Google DeepMind/Text / LLM

Google Releases Gemma 3 270M for On-Device AI

The new ultra-compact model from DeepMind is designed for efficient performance in resource-constrained environments like mobile and web.

Aug 5, 2025

Text / LLM

OpenAI/ReasoningMajor release

OpenAI Releases 21B Open-Weight MoE Model

The new `gpt-oss-20b` is an Apache 2.0-licensed Mixture-of-Experts model designed to run efficiently on consumer-grade hardware.

Aug 4, 2025

Reasoning Text / LLM

OpenAI/ReasoningMajor release

OpenAI Releases Its First Open-Source MoE Model

The new 117-billion-parameter `gpt-oss-120b` is a Mixture-of-Experts model focused on reasoning, released under a permissive Apache 2.0 license.

Aug 4, 2025

Reasoning Text / LLM

Qwen · Alibaba/Code

Qwen Releases Compact 30B MoE for Coding Agents

The new Apache 2.0 model from Alibaba's Qwen team uses a Mixture-of-Experts architecture to deliver strong performance with only 3B active parameters.

Jul 31, 2025

Code Text / LLM

Qwen · Alibaba/CodeMajor release

Qwen Releases 480B Open-Source Model for Code Agents

The new flagship coding model from Alibaba's Qwen team uses a massive Mixture-of-Experts architecture and is released under a permissive Apache-2.0 license.

Jul 22, 2025

Code Text / LLM

Zhipu AI/Text / LLMMajor release

Z.ai Releases 355B Parameter GLM-4.5 Under MIT License

The new Mixture-of-Experts model combines massive scale with a fully permissive license, targeting complex reasoning and agentic applications.

Jul 20, 2025

Reasoning Text / LLM

Moonshot AI/Vision-LanguageMajor release

Moonshot AI Releases Trillion-Parameter Kimi-K2 Model

The new Mixture-of-Experts model brings massive scale to the open-weights community, focusing on complex reasoning and coding tasks with a 128K context window.

Jul 11, 2025

Text / LLM Reasoning

Latest Text / LLM models