The Open Weights
LatestModelsLeaderboardsUpcomingCompanies
Subscribe
The Open Weights

The daily record of open-source AI. New model releases, leaderboards, and what's coming next — written for people who ship.

Refreshed every 12 hours

Discover

  • Latest releases
  • New today
  • Trending models
  • Upcoming launches

Browse

  • All models
  • Companies
  • Categories
  • Leaderboards

About

  • About
  • Editorial policy
  • RSS feed
  • Newsletter

© 2026 The Open Weights. An independent publication.

Aggregated by Claude · written with Gemini · curated by humans.

LatestDeepSeekV3.1-Base
DeepSeekText / LLM

DeepSeek Releases 671B MoE Model Under MIT License

The new DeepSeek-V3.1-Base is a massive 671-billion-parameter Mixture-of-Experts model designed for efficient, large-scale research and development.

Aug 19, 2025
Major releaseMIT
DeepSeek · Text / LLM
DeepSeek-V3.1-Base
DeepSeek-V3.1-Base

AI research firm DeepSeek has released DeepSeek-V3.1-Base, a powerful new foundation model that significantly expands the top tier of open-source AI. With a total of 671 billion parameters, it is one of the largest and most capable base models made available to the public under a permissive license.

The model's architecture is a Mixture-of-Experts (MoE), a design that allows for massive parameter counts while managing computational costs. Instead of activating all 671 billion parameters for every task, an MoE model intelligently routes inputs to specialized "expert" subnetworks, making training and inference more efficient than a dense model of equivalent size. The official model card also notes the use of FP8 weights, a lower-precision format that further improves performance and reduces memory requirements.

Why it matters

The release of a model of this scale under the highly permissive MIT license is a major contribution to the open-source community. It provides researchers and developers with a powerful, commercially viable foundation for building specialized applications without the restrictive licensing often attached to state-of-the-art models. This gives organizations a new, high-quality starting point for fine-tuning on proprietary data for complex reasoning and generation tasks.

As a "base" model, DeepSeek-V3.1 is not intended for direct use as a chatbot but is instead optimized for further training and adaptation. Developers can access the model and its components directly from its Hugging Face repository. Its release signals a continuing trend of top-tier AI capabilities becoming more accessible, fostering broader innovation in the field.

Sources

  • deepseek-ai/DeepSeek-V3.1-Base

    Hugging Face

    Visit

0 comments

Protected by Turnstile

No comments yet. Be the first to weigh in.

Get the model

Weights

Specs

Parameters671B · MoE
Context window—
LicenseMIT
Downloads3.8K

Modalities

Text / LLMReasoning

More in Text / LLM

Zhipu AI
GLM-5.2
GLM-5.2
Zhipu AI/Text / LLM

Zhipu AI Releases MIT-Licensed GLM-5.2 MoE Model

The new bilingual model from the Chinese AI firm uses a Mixture of Experts architecture and sparse attention under a fully permissive license.

Jun 17, 2026
Weibo AI
VibeThinker-3B
VibeThinker-3B
Weibo AI/Reasoning

Weibo AI Releases VibeThinker-3B, a Compact Reasoning Model

The new 3-billion-parameter model from the Chinese tech giant focuses on challenging benchmarks in mathematics, coding, and graduate-level questions.

Jun 12, 2026
Moonshot AI
Kimi-K2.7-Code
Kimi-K2.7-Code
Moonshot AI/Code

Moonshot AI Releases Kimi, a Multimodal Coding Model

The new Mixture-of-Experts model from the Chinese AI company can generate code while also understanding visual inputs, a rare combination in open models.

Jun 11, 2026