HumeAIText → Speech

Hume AI Releases 3B Multilingual Text-to-Speech Model

The new model, Tada-3B-ML, is designed for fine-grained control over vocal expression across more than 10 languages.

Feb 16, 2026

NotableOther

Hume AI has introduced Tada-3B-ML, a new 3-billion-parameter model for text-to-speech (TTS) synthesis. Released on Hugging Face, the model is designed to generate natural-sounding speech with a high degree of expressive control, a key challenge in creating human-like voice interfaces.

A central feature of Tada-3B-ML is its multilingual capability. The model supports a broad set of languages, enabling developers to create voice applications for a global audience. Supported languages include:

English
Spanish
French
German
Mandarin Chinese
Japanese
Korean
Hindi
Portuguese
Italian

This release contributes to the growing field of expressive and multilingual speech generation. By aiming to capture the subtle prosody and intonation of human speech, models like Tada-3B-ML allow for more nuanced and emotionally resonant applications in areas like voice assistants, audiobooks, and accessibility tools.

An Important Note on Licensing

While the model weights are publicly available, they are governed by a custom license from Hume AI, not a permissive open-source license like Apache 2.0 or MIT. The terms focus on research and non-commercial use and include specific restrictions. Potential users should review the license carefully before integrating Tada-3B-ML into their work.

Sources

HumeAI/tada-3b-ml
Hugging Face
Visit

0 comments

No comments yet. Be the first to weigh in.

Audio8 debuts a 0.6B multilingual zero-shot TTS preview

The compact text-to-speech model promises voice cloning across languages from a footprint small enough to run without heavy hardware.

Jul 28, 2026

KRAFTON/Any-to-Any

KRAFTON releases A.X-K2 Raon speech MoE model

The game maker's new open model blends text-to-speech and speech recognition in a single 21B mixture-of-experts system with just 3B active parameters.

Jul 27, 2026

NVIDIA/Any-to-Any

NVIDIA's Audex Unifies Audio Understanding and Speech

A new 30B mixture-of-experts model from NVIDIA handles both listening and speaking within a single audio-text architecture.

Jul 6, 2026

An Important Note on Licensing