Google Releases MedASR for Medical Transcription
The new speech recognition model from DeepMind is trained specifically on medical dictation, aiming for higher accuracy in clinical notes.
Google DeepMind has released MedASR, a new automatic speech recognition (ASR) model designed to accurately transcribe medical dictation. The model targets the specific vocabulary, jargon, and syntax used by clinicians, a domain where general-purpose transcription tools often fall short.
MedASR is specialized for English-language medical speech, with a particular focus on the demanding field of radiology. This narrow training allows it to better handle complex terminology and the rapid dictation style common in clinical settings, aiming to produce cleaner and more reliable electronic health records from spoken notes.
The potential impact of high-accuracy medical ASR is significant. By reducing the time physicians spend manually correcting transcripts, specialized models can help alleviate administrative burdens and allow clinicians to focus more on patient care. Accurate documentation is also critical for billing, insurance, and maintaining a reliable patient history.
While the model is available for exploration, its license carries an important restriction. The weights for MedASR are provided for non-commercial use only, which limits its direct integration into commercial healthcare products. This positions the release primarily as a resource for researchers and developers to build upon for future healthcare applications.
Sources
- Visit
google/medasr
Hugging Face
0 comments
No comments yet. Be the first to weigh in.
More in Speech → Text

Mega-ASR Improves on Qwen for Speech Recognition
Researcher Zhifei Xie has released a 1.7B-parameter model that refines Alibaba's Qwen3-ASR, showing improved performance on English and Chinese transcription benchmarks.

NVIDIA Releases Nemotron-3.5 Streaming ASR Model
The 600-million-parameter model uses a FastConformer architecture for real-time, multilingual speech-to-text applications.

Xiaomi Releases MiMo Model for Speech Recognition
The new open-source model from the Chinese tech giant offers automatic speech recognition for Mandarin, Cantonese, and English under a permissive MIT license.