The Open Weights
LatestModelsLeaderboardsUpcomingCompanies
Subscribe
The Open Weights

The daily record of open-source AI. New model releases, leaderboards, and what's coming next — written for people who ship.

Refreshed every 12 hours

Discover

  • Latest releases
  • New today
  • Trending models
  • Upcoming launches

Browse

  • All models
  • Companies
  • Categories
  • Leaderboards

About

  • About
  • Editorial policy
  • RSS feed
  • Newsletter

© 2026 The Open Weights. An independent publication.

Aggregated by Claude · written with Gemini · curated by humans.

LatestQwen · AlibabaNano 2512
Qwen · AlibabaSpeech → Text

Qwen Releases Compact ASR Model for Streaming Audio

The new Fun-ASR-Nano model from Alibaba's team packs real-time multilingual transcription, speaker diarization, and hotword detection into an efficient package.

Dec 15, 2025
NotableOther
Qwen · Alibaba · Speech → Text
Fun-ASR-Nano-2512
Fun-ASR-Nano-2512

Alibaba's Qwen team has released Fun-ASR-Nano-2512, a new automatic speech recognition (ASR) model designed for efficiency and real-time performance. As its "Nano" designation suggests, the model is compact, making it a candidate for applications where computational resources are constrained.

Fun-ASR-Nano moves beyond simple transcription by integrating several advanced features often found in much larger systems. Its architecture is built for streaming audio, allowing it to process speech with low latency as it's spoken, rather than waiting for an entire audio file to be complete.

Structured Audio Output

This combination of features makes the model particularly useful for building sophisticated conversational AI and analysis tools. Key capabilities detailed in the official release include:

  • Speaker diarization: Identifying who is speaking and when.
  • Word-level timestamps: Aligning transcribed text with its precise timing in the source audio.
  • Hotword detection: Customizing the model to reliably recognize specific keywords.
  • Multilingual support: Processing speech from multiple languages.

By packaging these tools into a lightweight model, the Qwen team provides a powerful component for developers creating on-device or edge applications, such as smart meeting assistants or embedded voice-controlled interfaces. The model is available under the custom Model-Scope Open-Source License.

Sources

  • FunAudioLLM/Fun-ASR-Nano-2512

    Hugging Face

    Visit

0 comments

Protected by Turnstile

No comments yet. Be the first to weigh in.

Get the model

Weights

Specs

Parameters—
Context window—
LicenseOTHER
Downloads2.1K

Modalities

Speech → Text

More in Speech → Text

zhifeixie
Mega-ASR
Mega-ASR
zhifeixie/Speech → Text

Mega-ASR Improves on Qwen for Speech Recognition

Researcher Zhifei Xie has released a 1.7B-parameter model that refines Alibaba's Qwen3-ASR, showing improved performance on English and Chinese transcription benchmarks.

May 19, 2026
NVIDIA
Nemotron 3.5 ASR Streaming 0.6B
Nemotron 3.5 ASR Streaming 0.6B
NVIDIA/Speech → Text

NVIDIA Releases Nemotron-3.5 Streaming ASR Model

The 600-million-parameter model uses a FastConformer architecture for real-time, multilingual speech-to-text applications.

May 15, 2026
Xiaomi
MiMo-V2.5-ASR
MiMo-V2.5-ASR
Xiaomi/Speech → Text

Xiaomi Releases MiMo Model for Speech Recognition

The new open-source model from the Chinese tech giant offers automatic speech recognition for Mandarin, Cantonese, and English under a permissive MIT license.

Apr 23, 2026