The Open Weights
LatestModelsLeaderboardsUpcomingCompanies
Subscribe
The Open Weights

The daily record of open-source AI. New model releases, leaderboards, and what's coming next — written for people who ship.

Refreshed every 12 hours

Discover

  • Latest releases
  • New today
  • Trending models
  • Upcoming launches

Browse

  • All models
  • Companies
  • Categories
  • Leaderboards

About

  • About
  • Editorial policy
  • RSS feed
  • Newsletter

© 2026 The Open Weights. An independent publication.

Aggregated by Claude · written with Gemini · curated by humans.

LatestIBM4.0
IBMSpeech → Text

IBM Releases 1B Granite Model for Multilingual Speech

The new Apache 2.0-licensed model is part of the company's Granite family and aims to provide high-quality speech-to-text across several languages.

Feb 27, 2026
NotableApache 2.0
IBM · Speech → Text
Granite 4.0 1B Speech
Granite 4.0 1B Speech

IBM has expanded its open-source Granite model family with the release of a new 1-billion-parameter model specialized for automatic speech recognition (ASR). The new release, named Granite 4.0 1B Speech, is designed for multilingual speech-to-text tasks and is available under the permissive Apache 2.0 license, allowing for commercial use.

This model employs a Conformer-based encoder-decoder architecture, a proven approach for capturing both local and global features in audio sequences. According to IBM's documentation, it was trained on a combination of proprietary and public datasets to achieve robust performance across different languages and acoustic environments.

A New Contender in Open ASR

The release of a high-quality, commercially-viable speech model from a major enterprise player like IBM provides a significant new option for developers. It enters a field largely defined by models like OpenAI's Whisper, offering another powerful, open foundation for building applications such as transcription services, voice-enabled interfaces, and accessibility tools.

At one billion parameters, Granite Speech strikes a balance between performance and computational efficiency, making it a more accessible choice for deployment compared to larger, more resource-intensive models. Developers and researchers can access the model and its usage documentation on the Hugging Face Hub.

Sources

  • ibm-granite/granite-4.0-1b-speech

    Hugging Face

    Visit

0 comments

Protected by Turnstile

No comments yet. Be the first to weigh in.

Get the model

Weights

Specs

Parameters1B
Context window—
LicenseAPACHE-2.0
Downloads78.6K

Modalities

Speech → Text

More in Speech → Text

zhifeixie
Mega-ASR
Mega-ASR
zhifeixie/Speech → Text

Mega-ASR Improves on Qwen for Speech Recognition

Researcher Zhifei Xie has released a 1.7B-parameter model that refines Alibaba's Qwen3-ASR, showing improved performance on English and Chinese transcription benchmarks.

May 19, 2026
NVIDIA
Nemotron 3.5 ASR Streaming 0.6B
Nemotron 3.5 ASR Streaming 0.6B
NVIDIA/Speech → Text

NVIDIA Releases Nemotron-3.5 Streaming ASR Model

The 600-million-parameter model uses a FastConformer architecture for real-time, multilingual speech-to-text applications.

May 15, 2026
Xiaomi
MiMo-V2.5-ASR
MiMo-V2.5-ASR
Xiaomi/Speech → Text

Xiaomi Releases MiMo Model for Speech Recognition

The new open-source model from the Chinese tech giant offers automatic speech recognition for Mandarin, Cantonese, and English under a permissive MIT license.

Apr 23, 2026