Google DeepMindSpeech → Text

Google Releases MedASR for Medical Transcription

The new speech recognition model from DeepMind is trained specifically on medical dictation, aiming for higher accuracy in clinical notes.

Dec 18, 2025

NotableOther

Google DeepMind has released MedASR, a new automatic speech recognition (ASR) model designed to accurately transcribe medical dictation. The model targets the specific vocabulary, jargon, and syntax used by clinicians, a domain where general-purpose transcription tools often fall short.

MedASR is specialized for English-language medical speech, with a particular focus on the demanding field of radiology. This narrow training allows it to better handle complex terminology and the rapid dictation style common in clinical settings, aiming to produce cleaner and more reliable electronic health records from spoken notes.

The potential impact of high-accuracy medical ASR is significant. By reducing the time physicians spend manually correcting transcripts, specialized models can help alleviate administrative burdens and allow clinicians to focus more on patient care. Accurate documentation is also critical for billing, insurance, and maintaining a reliable patient history.

While the model is available for exploration, its license carries an important restriction. The weights for MedASR are provided for non-commercial use only, which limits its direct integration into commercial healthcare products. This positions the release primarily as a resource for researchers and developers to build upon for future healthcare applications.

Sources

google/medasr
Hugging Face
Visit

0 comments

No comments yet. Be the first to weigh in.

KRAFTON releases A.X-K2 Raon speech MoE model

The game maker's new open model blends text-to-speech and speech recognition in a single 21B mixture-of-experts system with just 3B active parameters.

Jul 27, 2026

Microsoft/Speech → Text

Microsoft's VibeVoice ASR Goes BitNet for CPU Speech

A BitNet-quantized speech recognition model trades GPU dependence for efficient CPU inference in English and Chinese.

Jul 24, 2026

Nyralabs/Speech → Text

CrisperWhisper 2.0 Large targets verbatim transcription

A Whisper-based ASR model that keeps every filler word and stamps timestamps to the individual word, now covering English and German.

Jul 15, 2026