Cohere Releases Top-Ranked Multilingual Transcription Model
The new automatic speech recognition model from Cohere Labs sets a new benchmark on the Hugging Face Open ASR Leaderboard for multilingual performance.
Cohere has released a new model for automatic speech recognition (ASR), Cohere Transcribe, immediately claiming the top position on the Hugging Face Open ASR Leaderboard. The model demonstrates state-of-the-art performance, particularly on challenging multilingual benchmarks like FLEURS (Few-shot Learning Evaluation of Universal Representations of Speech).
Trained on a large dataset of professionally transcribed audio, the model is designed to accurately convert spoken language into text across multiple languages. This capability makes it a powerful new tool for developers building voice-enabled applications, transcription services, and other features that rely on understanding human speech.
The release of a high-performing ASR model from a major AI lab like Cohere provides a strong alternative to existing leaders in the space, such as OpenAI's Whisper. As more powerful, openly-available models for speech are released, the barrier to creating sophisticated audio-based applications continues to fall for researchers and builders.
While the model's weights are publicly accessible, it is important to note the usage restrictions. Cohere Transcribe is available under a Cohere Non-Commercial License, meaning it is intended for research and non-commercial projects rather than for deployment in production commercial applications.
Sources
- Visit
CohereLabs/cohere-transcribe-03-2026
Hugging Face
0 comments
No comments yet. Be the first to weigh in.
More in Speech → Text

Mega-ASR Improves on Qwen for Speech Recognition
Researcher Zhifei Xie has released a 1.7B-parameter model that refines Alibaba's Qwen3-ASR, showing improved performance on English and Chinese transcription benchmarks.

NVIDIA Releases Nemotron-3.5 Streaming ASR Model
The 600-million-parameter model uses a FastConformer architecture for real-time, multilingual speech-to-text applications.

Xiaomi Releases MiMo Model for Speech Recognition
The new open-source model from the Chinese tech giant offers automatic speech recognition for Mandarin, Cantonese, and English under a permissive MIT license.