The Open Weights
LatestModelsLeaderboardsUpcomingCompanies
Subscribe
The Open Weights

The daily record of open-source AI. New model releases, leaderboards, and what's coming next — written for people who ship.

Refreshed every 12 hours

Discover

  • Latest releases
  • New today
  • Trending models
  • Upcoming launches

Browse

  • All models
  • Companies
  • Categories
  • Leaderboards

About

  • About
  • Editorial policy
  • RSS feed
  • Newsletter

© 2026 The Open Weights. An independent publication.

Aggregated by Claude · written with Gemini · curated by humans.

LatestIBM4.1-2b
IBMSpeech → Text

IBM Releases 2B Granite Model for Multilingual Speech

The new two-billion-parameter model offers transcription capabilities for at least five major languages under a permissive Apache 2.0 license.

Apr 16, 2026
NotableApache 2.0
IBM · Speech → Text
Granite Speech 4.1 2B
Granite Speech 4.1 2B

IBM has entered the open-source speech recognition arena with Granite Speech 4.1, a new two-billion-parameter model. Released under the permissive Apache 2.0 license, the model is designed for automatic speech recognition (ASR), also known as speech-to-text, and is available for developers to download and integrate freely.

This release provides a strong foundation for building multilingual voice applications. The model was trained to handle transcription for several languages, broadening its utility for global development teams.

Multilingual Capabilities

While details on the full training data are pending, the model explicitly supports high-quality transcription for at least five languages:

  • English
  • French
  • German
  • Italian
  • Spanish

The open availability of a capable ASR model from a major enterprise tech company like IBM is a notable development. It provides a commercially viable alternative to proprietary APIs and adds another powerful option alongside existing open-source models. Developers can access the full model weights and usage instructions on its Hugging Face repository.

Sources

  • ibm-granite/granite-speech-4.1-2b

    Hugging Face

    Visit

0 comments

Protected by Turnstile

No comments yet. Be the first to weigh in.

Get the model

Weights

Specs

Parameters2B
Context window—
LicenseAPACHE-2.0
Downloads435.9K

Modalities

Speech → Text

More in Speech → Text

zhifeixie
Mega-ASR
Mega-ASR
zhifeixie/Speech → Text

Mega-ASR Improves on Qwen for Speech Recognition

Researcher Zhifei Xie has released a 1.7B-parameter model that refines Alibaba's Qwen3-ASR, showing improved performance on English and Chinese transcription benchmarks.

May 19, 2026
NVIDIA
Nemotron 3.5 ASR Streaming 0.6B
Nemotron 3.5 ASR Streaming 0.6B
NVIDIA/Speech → Text

NVIDIA Releases Nemotron-3.5 Streaming ASR Model

The 600-million-parameter model uses a FastConformer architecture for real-time, multilingual speech-to-text applications.

May 15, 2026
Xiaomi
MiMo-V2.5-ASR
MiMo-V2.5-ASR
Xiaomi/Speech → Text

Xiaomi Releases MiMo Model for Speech Recognition

The new open-source model from the Chinese tech giant offers automatic speech recognition for Mandarin, Cantonese, and English under a permissive MIT license.

Apr 23, 2026