OpenAIReasoning

OpenAI Releases 21B Open-Weight MoE Model

The new `gpt-oss-20b` is an Apache 2.0-licensed Mixture-of-Experts model designed to run efficiently on consumer-grade hardware.

Aug 4, 2025

Major releaseApache 2.0

OpenAI has released gpt-oss-20b, a new open-weight model that brings a powerful architecture to more accessible hardware. The model is a Mixture-of-Experts (MoE) with approximately 21 billion total parameters and is available under the permissive Apache 2.0 license, encouraging broad community use and adaptation.

The most significant aspect of this release is its focus on efficiency. MoE models achieve high performance by selectively activating only a fraction of their total parameters for any given input, which can drastically lower computational requirements compared to dense models of a similar size. This design makes gpt-oss-20b particularly relevant for developers and researchers working with consumer-grade GPUs.

Capabilities and Specifications

Beyond its architecture, gpt-oss-20b features a large 131,072-token context window, enabling it to process and reason over very long documents or complex conversations. Its primary focus on reasoning suggests it is well-suited for tasks that require logical deduction and problem-solving.

This release provides a powerful new tool for building sophisticated AI applications on more widely available hardware. Developers can access the model weights, technical details, and usage instructions on the official Hugging Face repository.

Sources

openai/gpt-oss-20b
Hugging Face
Visit

0 comments

No comments yet. Be the first to weigh in.

DeepSeek Ships V4-Flash, a 304B MoE Tuned for Agents

The latest checkpoint in DeepSeek's V4 line leans into agentic workflows while keeping the permissive MIT license.

Jul 31, 2026

DeepSeek/Text / LLM

DeepSeek Refreshes V4-Flash With New 0731 Checkpoint

The MIT-licensed mixture-of-experts model returns in an updated build shipping with FP8 weights for cheaper inference.

Jul 31, 2026

LGAI EXAONE/Text / LLM

LG AI Research debuts K-EXAONE 2.0, a 750B MoE model

The new mixture-of-experts model activates 37B parameters per token and targets English, Korean, and Spanish reasoning tasks.

Jul 29, 2026