moondreamVision-Language

Moondream 3 Arrives in Preview Release

The next generation of the efficient, open-source vision-language model is now available for early testing and feedback.

Sep 11, 2025

NotableOther

A preview version of Moondream 3, the next iteration of the compact and efficient vision-language model, has been released. Continuing the series' focus on performance in a small footprint, this new model is designed for a variety of image understanding tasks where resource constraints are a key consideration.

A New Architecture

Moondream 3 represents a significant architectural update. The model, which has around 4 billion parameters, is built on two powerful open components: a SigLIP vision encoder for image processing and Microsoft's recently released Phi-3-mini for its language understanding and generation capabilities. According to the release notes, the model was trained from scratch on a new dataset.

The project's goal is to provide a capable but lightweight alternative to the massive vision models released by major labs. By combining best-in-class open components, Moondream 3 aims to deliver strong performance without requiring extensive computational resources, making it suitable for on-device or edge applications.

This release is explicitly a preview intended to gather community feedback for future improvements. Developers can explore the model's capabilities on its Hugging Face repository. It is available for use under a custom 'Moondream license,' which users should review before implementation.

Sources

moondream/moondream3-preview
Hugging Face
Visit

0 comments

No comments yet. Be the first to weigh in.

Thinking Machines Debuts Inkling Small, a Compact Multimodal MoE

The Apache-2.0 model brings mixture-of-experts efficiency to image, audio, and text tasks in a smaller footprint.

Jul 27, 2026

Microsoft/Vision-Language

Microsoft's Mage-VL Streams Video Natively

A codec-native multimodal foundation model aims to understand live video and vision-language input in real time.

Jul 26, 2026

Swiss Ai/Text / LLM

Apertus v1.5 70B arrives with an Apache-2.0 license

Switzerland's open-model effort ships a 70-billion-parameter, multilingual and multimodal system that anyone can use, modify, and deploy.

Jul 24, 2026

A New Architecture