The Open Weights
LatestModelsLeaderboardsUpcomingCompanies
Subscribe
The Open Weights

The daily record of open-source AI. New model releases, leaderboards, and what's coming next — written for people who ship.

Refreshed every 12 hours

Discover

  • Latest releases
  • New today
  • Trending models
  • Upcoming launches

Browse

  • All models
  • Companies
  • Categories
  • Leaderboards

About

  • About
  • Editorial policy
  • RSS feed
  • Newsletter

© 2026 The Open Weights. An independent publication.

Aggregated by Claude · written with Gemini · curated by humans.

LatestQwen · AlibabaQwen3-ASR
Qwen · AlibabaSpeech → Text

Qwen open-sources compact model for speech recognition

The new 600-million-parameter Qwen3-ASR model is designed for efficient, high-quality audio transcription under a permissive license.

Jan 28, 2026
NotableApache 2.0
Qwen · Alibaba · Speech → Text
Qwen3-ASR-0.6B
Qwen3-ASR-0.6B

Alibaba's Qwen team has released a new open-source model specialized for automatic speech recognition (ASR). The model, named Qwen3-ASR-0.6B, stands out for its compact size, with just 600 million parameters. This release continues Qwen's expansion beyond large language models into more specialized, efficient AI tools.

Designed for converting spoken language into text, the model's small footprint makes it a compelling option for applications where computational resources are a constraint. This could include on-device transcription, real-time voice assistants, or other edge computing scenarios that require low latency and minimal overhead.

A Versatile Tool for Developers

The choice of an Apache 2.0 license is a significant detail, as it permits developers to use and modify the model for commercial purposes with few restrictions. This decision lowers the barrier to entry for building sophisticated voice-enabled products.

By providing a capable yet lightweight ASR model, Qwen is offering a valuable alternative to larger, more resource-intensive systems. Developers can find the model and usage instructions on its Hugging Face repository.

Sources

  • Qwen/Qwen3-ASR-0.6B

    Hugging Face

    Visit

0 comments

Protected by Turnstile

No comments yet. Be the first to weigh in.

Get the model

Weights

Specs

Parameters600M
Context window—
LicenseAPACHE-2.0
Downloads674.3K

Modalities

Speech → Text

More in Speech → Text

zhifeixie
Mega-ASR
Mega-ASR
zhifeixie/Speech → Text

Mega-ASR Improves on Qwen for Speech Recognition

Researcher Zhifei Xie has released a 1.7B-parameter model that refines Alibaba's Qwen3-ASR, showing improved performance on English and Chinese transcription benchmarks.

May 19, 2026
NVIDIA
Nemotron 3.5 ASR Streaming 0.6B
Nemotron 3.5 ASR Streaming 0.6B
NVIDIA/Speech → Text

NVIDIA Releases Nemotron-3.5 Streaming ASR Model

The 600-million-parameter model uses a FastConformer architecture for real-time, multilingual speech-to-text applications.

May 15, 2026
Xiaomi
MiMo-V2.5-ASR
MiMo-V2.5-ASR
Xiaomi/Speech → Text

Xiaomi Releases MiMo Model for Speech Recognition

The new open-source model from the Chinese tech giant offers automatic speech recognition for Mandarin, Cantonese, and English under a permissive MIT license.

Apr 23, 2026