Fish Audio's S2-Pro Brings Expressive TTS to Open Source
The new text-to-speech model can follow natural language instructions to control tone, clone voices from short clips, and speak multiple languages.
Company
Releases
The new text-to-speech model can follow natural language instructions to control tone, clone voices from short clips, and speak multiple languages.
The new model, Tada-3B-ML, is designed for fine-grained control over vocal expression across more than 10 languages.
An independent researcher has released a new English text-to-speech model under a permissive license, built on a modern generative foundation.
The new model, SoulX-Singer, can replicate a singing voice from a short audio sample and supports both English and Chinese under a permissive license.
The new 1-billion-parameter model combines a Llama 3.2 base with text-to-speech to generate more natural and nuanced audio.
The 2-billion-parameter text-to-speech model can clone voices from a short audio sample and is available under an Apache 2.0 license.
The new `gpt-oss-20b` is an Apache 2.0-licensed Mixture-of-Experts model designed to run efficiently on consumer-grade hardware.
The new 117-billion-parameter `gpt-oss-120b` is a Mixture-of-Experts model focused on reasoning, released under a permissive Apache 2.0 license.