The Open Weights
LatestModelsLeaderboardsUpcomingCompanies
Subscribe
The Open Weights

The daily record of open-source AI. New model releases, leaderboards, and what's coming next — written for people who ship.

Refreshed every 12 hours

Discover

  • Latest releases
  • New today
  • Trending models
  • Upcoming launches

Browse

  • All models
  • Companies
  • Categories
  • Leaderboards

About

  • About
  • Editorial policy
  • RSS feed
  • Newsletter

© 2026 The Open Weights. An independent publication.

Aggregated by Claude · written with Gemini · curated by humans.

Latestk2-fsa1.0
k2-fsaText → Speech

OmniVoice TTS Offers Zero-Shot Multilingual Voice Cloning

A new open-source text-to-speech model from the k2-fsa project can replicate a voice and generate speech in multiple languages from a single short audio sample.

Mar 30, 2026
NotableOther
k2-fsa · Text → Speech
OmniVoice
OmniVoice

The team behind the k2-fsa speech recognition toolkit has released OmniVoice, a new open-source model for text-to-speech synthesis. Released under an Apache 2.0 license, the model is designed for high-quality, multilingual voice generation from minimal user input.

The system's core feature is its zero-shot voice cloning capability. Using just a three-second audio clip of a target speaker, OmniVoice can replicate their voice and use it to generate new speech. This process works across multiple languages, allowing a user to provide an English voice sample and generate speech in Chinese, Spanish, or other supported languages without requiring specific training.

Beyond simple cloning, OmniVoice also provides tools for "voice design." By supplying a secondary audio recording as a style reference, users can transfer prosody, rhythm, and emotion to the synthesized output. This enables more granular control over the performance of the generated voice.

OmniVoice lowers the barrier for creating custom, expressive synthetic voices for applications ranging from accessibility tools to content creation. Its ability to separate voice characteristics from language and style provides a flexible foundation for developers and researchers. The model and usage examples are available on Hugging Face.

Sources

  • k2-fsa/OmniVoice

    Hugging Face

    Visit

0 comments

Protected by Turnstile

No comments yet. Be the first to weigh in.

Get the model

Weights

Specs

Parameters—
Context window—
LicenseOTHER
Downloads1.7M

Modalities

Text → Speech

More in Text → Speech

Zyphra
Zonos 2
Zonos 2
Zyphra/Text → Speech

Zyphra Releases Open-Source Zonos 2 TTS Model

The new text-to-speech model offers a commercially permissive alternative for developers in a field still dominated by closed-source APIs.

Jun 11, 2026
Boson AI
Higgs Audio v3 TTS 4B
Higgs Audio v3 TTS 4B
Boson AI/Text → Speech

Boson AI's Higgs Audio v3 Offers Expressive, Multilingual TTS

The new 4-billion-parameter text-to-speech model is available for non-commercial use, promising fine-grained control over vocal delivery.

Jun 4, 2026
OpenMOSS
MOSS-TTS v1.5
MOSS-TTS v1.5
OpenMOSS/Text → Speech

MOSS-TTS Aims for More Robust Speech Synthesis

A new text-to-speech model introduces 'delay-pattern decoding' to solve common word skipping and repetition errors in parallel generation.

May 25, 2026