Qwen · Alibaba

Qwen · Alibaba/Text / LLM

Qwen's AgentWorld Simulates Worlds for AI Agents

Alibaba's new MoE model acts as a language world model, generating the environments that agents act within.

Jun 22, 2026

Text / LLM Reasoning

Qwen · Alibaba/Vision-Language

Alibaba's Qwen Releases Open 27B Vision Model

The new dense model, licensed under Apache 2.0, brings both text and image understanding to the midrange parameter space.

Apr 21, 2026

Qwen · Alibaba/Vision-LanguageMajor release

Qwen Releases 35B Multimodal Mixture-of-Experts Model

The new Qwen3.6-35B-A3B from Alibaba's Qwen team combines vision and language capabilities using an efficient sparse architecture.

Apr 15, 2026

Text / LLM Reasoning

Qwen · Alibaba/Vision-Language

Alibaba's Qwen Releases Compact 0.8B Vision Model

The new 800-million-parameter model is the smallest in the Qwen3.5 family, designed for efficient multimodal tasks on consumer-grade hardware.

Feb 28, 2026

Qwen · Alibaba/Vision-Language

Alibaba's Qwen team releases 4B vision-language model

The new Qwen3.5-4B model combines text and image understanding in a compact, permissively licensed package for developers.

Feb 27, 2026

Qwen · Alibaba/Vision-Language

Qwen Releases 9B Multimodal Model in New 3.5 Series

The new open-source vision-language model from Alibaba's Qwen team offers strong performance in a compact, Apache 2.0-licensed package.

Feb 27, 2026

Qwen · Alibaba/Vision-LanguageMajor release

Qwen Releases Flagship 122B Multimodal MoE Model

The new Qwen3.5-122B-A10B combines a massive parameter count with an efficient Mixture-of-Experts architecture for advanced vision and language tasks.

Feb 24, 2026

Qwen · Alibaba/Vision-LanguageMajor release

Qwen Releases 27B Vision Model with Long Context

The new model from Alibaba's Qwen team combines multimodal understanding with a 131K token context window under a permissive Apache 2.0 license.

Feb 24, 2026

Qwen · Alibaba/Vision-Language

Qwen Releases Efficient 35B Multimodal MoE Model

The new Qwen3.5-35B-A3B model from Alibaba combines vision and language capabilities with a resource-friendly Mixture of Experts design.

Feb 24, 2026

Qwen · Alibaba/Vision-LanguageMajor release

Qwen releases flagship 397B multimodal MoE

The new open-source model from Alibaba uses a Mixture-of-Experts architecture to balance massive scale with efficient inference.

Feb 16, 2026

Qwen · Alibaba/Code

Qwen Releases Coder-Next, A New Open MoE Coding Model

The new model from Alibaba's Qwen team uses a Mixture-of-Experts architecture and is released under the commercially-friendly Apache 2.0 license.

Jan 30, 2026

Code Text / LLM

Qwen · Alibaba/Speech → Text

Qwen Releases 0.6B Model for Audio-Text Alignment

The new open-source tool, based on the Qwen3 architecture, precisely synchronizes audio recordings with their corresponding text transcripts.

Jan 28, 2026

Qwen · Alibaba/Speech → Text

Qwen3 Family Expands into Speech Recognition

Alibaba's Qwen team has released a new 1.7-billion-parameter model designed specifically for automatic speech recognition.

Jan 28, 2026

Qwen · Alibaba/Speech → Text

Qwen open-sources compact model for speech recognition

The new 600-million-parameter Qwen3-ASR model is designed for efficient, high-quality audio transcription under a permissive license.

Jan 28, 2026

Qwen · Alibaba/Text → Image

Alibaba's Qwen Team Releases Z-Image Diffusion Model

The makers of the popular Qwen language models have published their first open-source text-to-image generator with a permissive Apache 2.0 license.

Jan 23, 2026

Qwen · Alibaba/Text → Speech

Qwen Releases Open-Source Voice Cloning Model

The new 600-million-parameter Qwen3-TTS model can generate speech in multiple languages and clone voices from short audio clips.

Jan 21, 2026

Qwen · Alibaba/Text → Speech

Qwen Releases a Compact Custom-Voice TTS Model

The new 600-million-parameter model from Alibaba's Qwen team can clone voices from short audio clips for multilingual speech synthesis.

Jan 21, 2026

Qwen · Alibaba/Text → Speech

Qwen Releases Open 1.7B Custom Voice Synthesis Model

Alibaba's Qwen team has released a new text-to-speech model capable of cloning voices from just a few seconds of audio.

Jan 21, 2026

Qwen · Alibaba/Text → Speech

Qwen Unveils Open Model for Custom Voice Synthesis

The new 1.7-billion-parameter text-to-speech model from Alibaba's Qwen team can generate novel voices from short audio prompts.

Jan 21, 2026

Qwen · Alibaba/Text → Image

Qwen Releases Bilingual Open-Source Image Model

Alibaba's latest text-to-image generator, Qwen-Image 2512, is optimized for creating visuals from both English and Chinese prompts.

Dec 30, 2025

Qwen · Alibaba/Any-to-Any

Qwen's Fun-Audio-Chat: An Open Speech-to-Speech LLM

The 8-billion-parameter model from Alibaba's Qwen team understands and generates spoken responses, enabling more natural audio-first applications.

Dec 23, 2025

Speech → Text Any-to-Any

Qwen · Alibaba/Image Editing

Qwen Releases Open, Bilingual Image Editing Model

The new diffusion model from Alibaba's team allows for precise, instruction-based image modifications in both English and Chinese.

Dec 17, 2025

Image Editing

Qwen · Alibaba/Speech → Text

Qwen Releases Compact ASR Model for Streaming Audio

The new Fun-ASR-Nano model from Alibaba's team packs real-time multilingual transcription, speaker diarization, and hotword detection into an efficient package.

Dec 15, 2025

Qwen · Alibaba/Text → Speech

Alibaba Releases CosyVoice 3 for Expressive TTS

The new 500-million-parameter text-to-speech model from the Qwen team offers multilingual voice cloning and emotional control.

Dec 11, 2025

Qwen · Alibaba/Text → Image

Alibaba Releases Z-Image-Turbo, A Fast Open Image Model

The new text-to-image model from the team behind Qwen uses a diffusion transformer to generate high-resolution images in just a single step.

Nov 25, 2025

Any-to-Any Vision-Language

Qwen · Alibaba/Vision-Language

Alibaba Releases Qwen3-VL, an 8B Open-Source Vision Model

The latest vision-language model from the popular Qwen series is instruction-tuned and available under an Apache 2.0 license.

Oct 11, 2025

Vision-Language

Qwen · Alibaba/Vision-LanguageMajor release

Qwen Releases 30B MoE Vision Model, Qwen3-VL

The new open-source model from Alibaba uses a Mixture-of-Experts architecture to make its powerful vision-language capabilities more efficient to run.

Sep 30, 2025

Qwen · Alibaba/Image Editing

Qwen Releases Open-Source Instruction-Based Image Editor

The new model from Alibaba's Qwen team allows users to modify images using natural language prompts instead of complex tools or masks.

Sep 22, 2025

Image Editing

Qwen · Alibaba/Any-to-AnyMajor release

Qwen3-Omni Arrives With Any-to-Any Multimodality

The new 30B Mixture-of-Experts model from Alibaba's Qwen team can process and generate content across text, image, and audio formats.

Sep 20, 2025

Speech → Text Any-to-Any

Qwen · Alibaba/Any-to-Any

Qwen Releases 'Thinking' Multimodal MoE Model

The new 30-billion-parameter Mixture-of-Experts model from Alibaba's Qwen team is designed to show its reasoning process for complex multimodal tasks.

Sep 15, 2025

Any-to-Any Reasoning

Qwen · Alibaba/Any-to-Any

Qwen Releases 30B Model for Audio Captioning

The new Mixture-of-Experts model from Alibaba is fine-tuned to generate detailed, multilingual descriptions for complex audio content.

Sep 15, 2025

Any-to-Any Text → Speech

Qwen · Alibaba/Text → Video

Alibaba's Wan2.2 Adds Control to Open Video

The new 14-billion-parameter model from Alibaba's PAI team offers fine-grained control over video generation using inputs like sketches and depth maps.

Sep 10, 2025

Text → Video

Qwen · Alibaba/Text / LLMMajor release

Qwen Releases 80B Mixture-of-Experts Model

The new Qwen3-Next model from Alibaba combines a large parameter count with an efficient MoE architecture to balance performance and computational cost.

Sep 9, 2025

Text / LLM

Qwen · Alibaba/Image → Video

Alibaba Releases 14B Model for Audio-Driven Video

The new Wan2.2-S2V model takes a still image and a speech track to generate a realistic talking-head animation, available under a permissive license.

Aug 25, 2025

Qwen · Alibaba/Image EditingMajor release

Qwen Releases Open Model for Image Editing

The new open-source model from Alibaba lets users edit images with simple text commands in both English and Chinese.

Aug 17, 2025

Image Editing

Qwen · Alibaba/Text → ImageMajor release

Qwen releases open model for text-in-image generation

The new Apache 2.0 diffusion model from Alibaba's Qwen team focuses on accurately rendering both English and Chinese characters within generated images.

Aug 2, 2025

Qwen · Alibaba/Code

Qwen Releases Compact 30B MoE for Coding Agents

The new Apache 2.0 model from Alibaba's Qwen team uses a Mixture-of-Experts architecture to deliver strong performance with only 3B active parameters.

Jul 31, 2025

Code Text / LLM

Qwen · Alibaba/Image → VideoMajor release

Alibaba Releases Wan2.2, a 14B MoE Video Model

The new open-source diffusion model from the team behind Qwen uses a Mixture-of-Experts architecture to animate still images.

Jul 28, 2025

Image → Video Text → Video

Qwen · Alibaba/Text → Video

Qwen Releases Wan2.2, a 5B Open-Source Video Model

The new Apache 2.0 licensed model from Alibaba's team generates video from either text prompts or still images, offering a unified approach in a compact package.

Jul 28, 2025

Qwen · Alibaba/Text → Video

Qwen Unveils Wan2.2, a 14B Open Text-to-Video Model

The new Apache 2.0-licensed model from Alibaba's team uses a Mixture-of-Experts architecture for efficient, high-quality video generation.

Jul 24, 2025

Text → Video

Qwen · Alibaba/Image → Video

Qwen Releases Wan2.2, a 14B Image-to-Video Model

The new 14-billion parameter model from Alibaba's AI team uses a Mixture-of-Experts design and is available under the permissive Apache 2.0 license.

Jul 24, 2025