The Open Weights
LatestModelsLeaderboardsUpcomingCompanies
Subscribe
The Open Weights

The daily record of open-source AI. New model releases, leaderboards, and what's coming next — written for people who ship.

Refreshed every 12 hours

Discover

  • Latest releases
  • New today
  • Trending models
  • Upcoming launches

Browse

  • All models
  • Companies
  • Categories
  • Leaderboards

About

  • About
  • Editorial policy
  • RSS feed
  • Newsletter

© 2026 The Open Weights. An independent publication.

Aggregated by Claude · written with Gemini · curated by humans.

LatestXiaomiV2.5-ASR
XiaomiSpeech → Text

Xiaomi Releases MiMo Model for Speech Recognition

The new open-source model from the Chinese tech giant offers automatic speech recognition for Mandarin, Cantonese, and English under a permissive MIT license.

Apr 23, 2026
NotableMIT
Xiaomi · Speech → Text
MiMo-V2.5-ASR
MiMo-V2.5-ASR

Chinese technology company Xiaomi has released MiMo-V2.5-ASR, a new model for automatic speech recognition (ASR) available to the open-source community. The model is designed to perform speech-to-text tasks in three languages: Mandarin Chinese, English, and Cantonese.

While Xiaomi has not provided extensive details on the model's architecture or training dataset, its focused multilingual capability makes it a potentially valuable tool for developers building applications for these specific language markets. The inclusion of Cantonese is particularly notable, as it is often less supported than Mandarin in large-scale ASR systems.

Permissive and Practical

The release is significant as it comes from a major global electronics manufacturer, signaling a continued interest from large corporations in contributing to the open AI ecosystem. The model's utility is enhanced by its licensing terms.

Xiaomi has released MiMo-V2.5-ASR under the MIT license, one of the most permissive open-source licenses available. This allows for unrestricted use, modification, and distribution, including for commercial purposes, removing a common barrier to adoption for many businesses and independent developers. The model and its usage instructions are available on its Hugging Face repository.

Sources

  • XiaomiMiMo/MiMo-V2.5-ASR

    Hugging Face

    Visit

0 comments

Protected by Turnstile

No comments yet. Be the first to weigh in.

Get the model

Weights

Specs

Parameters—
Context window—
LicenseMIT
Downloads1.1K

Modalities

Speech → Text

More in Speech → Text

zhifeixie
Mega-ASR
Mega-ASR
zhifeixie/Speech → Text

Mega-ASR Improves on Qwen for Speech Recognition

Researcher Zhifei Xie has released a 1.7B-parameter model that refines Alibaba's Qwen3-ASR, showing improved performance on English and Chinese transcription benchmarks.

May 19, 2026
NVIDIA
Nemotron 3.5 ASR Streaming 0.6B
Nemotron 3.5 ASR Streaming 0.6B
NVIDIA/Speech → Text

NVIDIA Releases Nemotron-3.5 Streaming ASR Model

The 600-million-parameter model uses a FastConformer architecture for real-time, multilingual speech-to-text applications.

May 15, 2026
IBM
Granite Speech 4.1 2B
Granite Speech 4.1 2B
IBM/Speech → Text

IBM Releases 2B Granite Model for Multilingual Speech

The new two-billion-parameter model offers transcription capabilities for at least five major languages under a permissive Apache 2.0 license.

Apr 16, 2026