Latest open-source Reasoning models

DeepSeek/Text / LLM

DeepSeek Ships V4-Flash, a 304B MoE Tuned for Agents

The latest checkpoint in DeepSeek's V4 line leans into agentic workflows while keeping the permissive MIT license.

Jul 31, 2026

Text / LLM Reasoning

DeepSeek/Text / LLMMajor release

DeepSeek Refreshes V4-Flash With New 0731 Checkpoint

The MIT-licensed mixture-of-experts model returns in an updated build shipping with FP8 weights for cheaper inference.

Jul 31, 2026

Text / LLM Reasoning

LGAI EXAONE/Text / LLM

LG AI Research debuts K-EXAONE 2.0, a 750B MoE model

The new mixture-of-experts model activates 37B parameters per token and targets English, Korean, and Spanish reasoning tasks.

Jul 29, 2026

Text / LLM Reasoning

Amd/Reasoning

AMD's Instella-MoE Brings Reasoning to ROCm Hardware

A new open mixture-of-experts model with 16B total parameters and just 3B active is tuned to run on AMD's own accelerator stack.

Jul 23, 2026

Text / LLM Reasoning

inclusionAI/Reasoning

inclusionAI's Ring-Zero Scales Zero-RL to a Trillion Parameters

A new mixture-of-experts model learns to reason through reinforcement learning alone, without human-annotated chains of thought.

Jul 13, 2026

Text / LLM Reasoning

NVIDIA/Any-to-Any

NVIDIA's Audex Unifies Audio Understanding and Speech

A new 30B mixture-of-experts model from NVIDIA handles both listening and speaking within a single audio-text architecture.

Jul 6, 2026

Any-to-Any Reasoning

Google DeepMind/Any-to-AnyMajor release

Google DeepMind's Gemma 4 Goes Multimodal and MoE

The new open-weights family adds a mixture-of-experts design, encoder-free multimodal inputs, and an optional thinking mode.

Jul 1, 2026

Text / LLM Any-to-Any

Mistral AI/Text / LLM

Mistral's Leanstral 1.5 puts 119B in a lean MoE

The new Apache-2.0 mixture-of-experts model activates just 6B parameters per token, trading raw density for cheaper inference.

Jul 1, 2026

Text / LLM Reasoning

DeepSeek/Text / LLMMajor release

DeepSeek Releases V4-Pro, an MIT-Licensed MoE Model

The new flagship arrives as a mixture-of-experts system with FP8 weights and open reasoning capabilities under a permissive license.

Jun 27, 2026

Text / LLM Reasoning

Deepreinforce Ai/Text / LLM

DeepReinforce Releases Ornith 1.0, a 35B Reasoning Model

The new dense model ships in GGUF format under a permissive MIT license, aimed at local and self-hosted deployment.

Jun 25, 2026

Text / LLM Reasoning

NVIDIA/Text / LLM

NVIDIA's Nemotron 3 Puzzle Runs Big on a Lean Budget

A 75-billion-parameter mixture-of-experts reasoning model that activates just 9 billion parameters per token.

Jun 24, 2026

Text / LLM Reasoning

NVIDIA/Text / LLM

NVIDIA's Nemotron-3 Puzzle Brings a Lean MoE to Reasoning

The 75B-parameter model activates just 9B per token and ships in NVIDIA's NVFP4 format for efficient inference.

Jun 24, 2026

Text / LLM Reasoning

Deepreinforce Ai/Text / LLM

DeepReinforce debuts Ornith-1.0, a 397B MoE model

The flagship of a new open model family arrives under a permissive MIT license, with reasoning among its stated strengths.

Jun 23, 2026

Text / LLM Reasoning

Qwen · Alibaba/Text / LLM

Qwen's AgentWorld Simulates Worlds for AI Agents

Alibaba's new MoE model acts as a language world model, generating the environments that agents act within.

Jun 22, 2026

Text / LLM Reasoning

InternScience/Reasoning

Agents-A1: A 35B MoE Built for Agentic Scaling

InclusionAI's new mixture-of-experts model bets that agent-horizon scaling can rival far larger systems on long-running tasks.

Jun 22, 2026

Text / LLM Reasoning

Zhipu AI/Text / LLMMajor release

Zhipu AI Releases MIT-Licensed GLM-5.2 MoE Model

The new bilingual model from the Chinese AI firm uses a Mixture of Experts architecture and sparse attention under a fully permissive license.

Jun 17, 2026

Text / LLM Reasoning

Moonshot AI/Text / LLMMajor release

Moonshot AI releases Kimi K3, a 2.8T-parameter MoE model

The open-weights multimodal model leans into coding and agentic tasks, extending Moonshot's Kimi line into a new scale bracket.

Jun 13, 2026

Text / LLM Reasoning

WeiboAI/Reasoning

Weibo AI Releases VibeThinker-3B, a Compact Reasoning Model

The new 3-billion-parameter model from the Chinese tech giant focuses on challenging benchmarks in mathematics, coding, and graduate-level questions.

Jun 12, 2026

Code Text / LLM

MiniMax/Vision-LanguageMajor release

MiniMax Releases M3, a Multimodal MoE Model

The new open-weight model from MiniMax AI combines vision, coding, and reasoning using a Mixture-of-Experts architecture.

Jun 2, 2026

Code Any-to-Any

Moonshot AI/Vision-LanguageMajor release

Kimi K2.6 tops closed models in coding test

Moonshot AI's open-weights mixture-of-experts model reportedly outperformed Claude, GPT-5.5, and Gemini on a programming challenge.

May 3, 2026

Code Text / LLM

NVIDIA/Any-to-Any

NVIDIA Releases Efficient Nemotron-3 Multimodal MoE

The new 30-billion parameter Mixture-of-Experts model handles text and images while using only 3 billion active parameters for inference.

Apr 24, 2026

Any-to-Any Reasoning

DeepSeek/Text / LLMMajor release

DeepSeek Releases V4-Pro, an Open MoE Contender

The new flagship model combines a Mixture-of-Experts architecture with a permissive MIT license, positioning it for wide commercial adoption.

Apr 22, 2026

Code Text / LLM

DeepSeek/Text / LLMMajor release

DeepSeek Releases V4-Flash, a Fast MIT-Licensed MoE Model

The new Mixture of Experts model from the Beijing-based AI lab is optimized for fast, efficient conversational AI and carries a fully permissive license.

Apr 22, 2026

Text / LLM Reasoning

NVIDIA/Any-to-Any

NVIDIA Releases Nemotron-3-Nano Omni-Modal MoE

The new 30-billion-parameter Mixture-of-Experts model handles any combination of modalities with just 3 billion active parameters.

Apr 20, 2026

Any-to-Any Reasoning

Qwen · Alibaba/Vision-LanguageMajor release

Qwen Releases 35B Multimodal Mixture-of-Experts Model

The new Qwen3.6-35B-A3B from Alibaba's Qwen team combines vision and language capabilities using an efficient sparse architecture.

Apr 15, 2026

Text / LLM Reasoning

MiniMax/Text / LLM

MiniMax Releases M2.7, an MoE Model with FP8 Weights

The new conversational language model from the Chinese AI company uses a Mixture-of-Experts architecture and 8-bit weights, but is released under a restrictive custom license.

Apr 9, 2026

Text / LLM Reasoning

Zhipu AI/Text / LLMMajor release

Zhipu AI Releases Open-Source GLM-5.1 MoE Model

The new bilingual model from the Chinese AI firm features an efficient Mixture-of-Experts architecture and a fully permissive MIT license.

Apr 3, 2026

Text / LLM Reasoning

Zhipu AI/Text / LLMMajor release

Zhipu AI Releases Open-Source GLM-5 MoE Model

The new Mixture-of-Experts model from the Chinese AI company combines an advanced architecture with a fully permissive MIT license for commercial use.

Feb 11, 2026

Text / LLM Reasoning

Google DeepMind/Vision-Language

Google's MedGemma brings open vision AI to medicine

The new 4-billion-parameter vision-language model is specialized for tasks in radiology, pathology, and complex clinical reasoning.

Jan 7, 2026

Reasoning Vision-Language

Moonshot AI/Vision-LanguageMajor release

Moonshot AI Releases Kimi K2.5 Multimodal Model

The new vision-language model from the Chinese AI firm uses a Mixture-of-Experts architecture and is now available on Hugging Face.

Jan 1, 2026

Text / LLM Reasoning

Baidu/Vision-Language

Baidu Releases Open Vision-Language MoE Model

The new ERNIE 4.5 VL model brings advanced multimodal reasoning to the open-source community with an efficient Mixture-of-Experts architecture.

Nov 7, 2025

Reasoning Vision-Language

Moonshot AI/ReasoningMajor release

Moonshot AI Releases Kimi-K2 Reasoning Model

The new Mixture-of-Experts model is designed for complex tasks but arrives in a custom compressed format with a restrictive license.

Nov 4, 2025

Text / LLM Reasoning

MiniMax/Text / LLMMajor release

MiniMax Releases M2, an Open-Weight MoE for Agents

The Shanghai-based AI startup has released a new Mixture-of-Experts model focused on complex reasoning, coding, and agentic tasks.

Oct 22, 2025

Code Text / LLM

Zhipu AI/Text / LLMMajor release

Zhipu AI Releases Open-Weight MoE Model GLM-4.6

The new Mixture-of-Experts model is available under a permissive MIT license and is optimized for complex reasoning and coding tasks.

Sep 29, 2025

Text / LLM Reasoning

Qwen · Alibaba/Any-to-Any

Qwen Releases 'Thinking' Multimodal MoE Model

The new 30-billion-parameter Mixture-of-Experts model from Alibaba's Qwen team is designed to show its reasoning process for complex multimodal tasks.

Sep 15, 2025

Any-to-Any Reasoning

DeepSeek/Text / LLMMajor release

DeepSeek Releases 671B MoE Model Under MIT License

The new DeepSeek-V3.1-Base is a massive 671-billion-parameter Mixture-of-Experts model designed for efficient, large-scale research and development.

Aug 19, 2025

Text / LLM Reasoning

Zhipu AI/Vision-LanguageMajor release

Zhipu AI Releases Open Vision Model GLM-4.5V

The new Mixture-of-Experts model offers strong multimodal reasoning capabilities under a permissive MIT license.

Aug 10, 2025

Reasoning Vision-Language

OpenAI/ReasoningMajor release

OpenAI Releases 21B Open-Weight MoE Model

The new `gpt-oss-20b` is an Apache 2.0-licensed Mixture-of-Experts model designed to run efficiently on consumer-grade hardware.

Aug 4, 2025

Text / LLM Reasoning

OpenAI/ReasoningMajor release

OpenAI Releases Its First Open-Source MoE Model

The new 117-billion-parameter `gpt-oss-120b` is a Mixture-of-Experts model focused on reasoning, released under a permissive Apache 2.0 license.

Aug 4, 2025

Text / LLM Reasoning

Zhipu AI/Text / LLMMajor release

Z.ai Releases 355B Parameter GLM-4.5 Under MIT License

The new Mixture-of-Experts model combines massive scale with a fully permissive license, targeting complex reasoning and agentic applications.

Jul 20, 2025

Code Text / LLM

Moonshot AI/Vision-LanguageMajor release

Moonshot AI Releases Trillion-Parameter Kimi-K2 Model

The new Mixture-of-Experts model brings massive scale to the open-weights community, focusing on complex reasoning and coding tasks with a 128K context window.

Jul 11, 2025

Text / LLM Reasoning

Zhipu AI/Vision-Language

Zhipu AI Open-Sources 9B Vision Model with 'Thinking' Mode

The new GLM-4.1V-9B-Thinking model makes its vision and chain-of-thought reasoning capabilities available under a permissive MIT license.

Jun 28, 2025

Reasoning Vision-Language

Latest Reasoning models