Qwen Releases 9B Multimodal Model in New 3.5 Series
The new open-source vision-language model from Alibaba's Qwen team offers strong performance in a compact, Apache 2.0-licensed package.
The Qwen team at Alibaba has introduced Qwen3.5-9B, the first release in its new generation of open-source models. This 9-billion parameter model is primarily a vision-language model (VLM), designed to understand and process both text and images to generate text-based outputs.
As a dense model, Qwen3.5-9B offers a powerful and efficient architecture in a popular size class. It operates with a context window of 8,192 tokens, making it suitable for a range of multimodal tasks. Importantly, the model is released under the permissive Apache 2.0 license, allowing for broad commercial and research applications without restrictive usage policies.
The release provides developers with a capable, mid-sized multimodal model that balances performance with computational efficiency. Its open license and manageable size make it an attractive option for teams building applications that require visual understanding, such as advanced chatbots, content generation tools, and accessibility software, without needing the resources to run massive, proprietary models.
Developers can explore the model's capabilities and access the weights directly from its Hugging Face repository. The release continues the Qwen team's consistent contribution of high-quality, open models to the AI community.
Sources
- Visit
Qwen/Qwen3.5-9B
Hugging Face
0 comments
No comments yet. Be the first to weigh in.
More in Vision-Language
Moonshot AI Releases Kimi, a Multimodal Coding Model
The new Mixture-of-Experts model from the Chinese AI company can generate code while also understanding visual inputs, a rare combination in open models.
Google Releases Open-Source DiffusionGemma 26B Model
The new 26B parameter model from DeepMind uses a diffusion-based architecture, a technique more common in image generation, to produce text.

MiniMax Releases M3, a Multimodal MoE Model
The new open-weight model from MiniMax AI combines vision, coding, and reasoning using a Mixture-of-Experts architecture.