The Open Weights
LatestModelsLeaderboardsUpcomingCompanies
Subscribe
The Open Weights

The daily record of open-source AI. New model releases, leaderboards, and what's coming next — written for people who ship.

Refreshed every 12 hours

Discover

  • Latest releases
  • New today
  • Trending models
  • Upcoming launches

Browse

  • All models
  • Companies
  • Categories
  • Leaderboards

About

  • About
  • Editorial policy
  • RSS feed
  • Newsletter

© 2026 The Open Weights. An independent publication.

Aggregated by Claude · written with Gemini · curated by humans.

LatestQwen · Alibaba3.5
Qwen · AlibabaVision-Language

Alibaba's Qwen Releases Compact 0.8B Vision Model

The new 800-million-parameter model is the smallest in the Qwen3.5 family, designed for efficient multimodal tasks on consumer-grade hardware.

Feb 28, 2026
UpdateApache 2.0
Qwen · Alibaba · Vision-Language
Qwen3.5-0.8B
Qwen3.5-0.8B

The Qwen team at Alibaba has released a new, notably compact model in its latest series: Qwen3.5-0.8B. As a vision-language model (VLM) with just 800 million parameters, it represents one of the smallest multimodal offerings from a major AI lab.

This instruction-tuned model is designed to understand and respond to prompts that combine both text and images. Its capabilities include tasks like describing what's in a photo, answering questions about visual content, and engaging in simple, visually-grounded dialogue.

The primary advantage of Qwen3.5-0.8B is its efficiency. The sub-billion parameter size makes it a practical choice for developers and researchers working with limited computational resources, such as consumer-grade GPUs or edge devices. It lowers the barrier to entry for experimenting with multimodal AI.

Released under a permissive Apache 2.0 license, the model is available for both academic and commercial use. It joins a growing Qwen3.5 family, providing a lightweight option for applications where a larger, more resource-intensive model would be impractical.

Sources

  • Qwen/Qwen3.5-0.8B

    Hugging Face

    Visit

0 comments

Protected by Turnstile

No comments yet. Be the first to weigh in.

Get the model

Weights

Specs

Parameters800M
Context window—
LicenseAPACHE-2.0
Downloads1.9M

Modalities

Vision-LanguageText / LLM

More in Vision-Language

Moonshot AI
Kimi-K2.7-Code
Kimi-K2.7-Code
Moonshot AI/Code

Moonshot AI Releases Kimi, a Multimodal Coding Model

The new Mixture-of-Experts model from the Chinese AI company can generate code while also understanding visual inputs, a rare combination in open models.

Jun 11, 2026
Google DeepMind
DiffusionGemma 26B-A4B Instruct
DiffusionGemma 26B-A4B Instruct
Google DeepMind/Text / LLM

Google Releases Open-Source DiffusionGemma 26B Model

The new 26B parameter model from DeepMind uses a diffusion-based architecture, a technique more common in image generation, to produce text.

Jun 9, 2026
MiniMax
MiniMax-M3
MiniMax-M3
MiniMax/Vision-Language

MiniMax Releases M3, a Multimodal MoE Model

The new open-weight model from MiniMax AI combines vision, coding, and reasoning using a Mixture-of-Experts architecture.

Jun 2, 2026