Soul AILabText → Speech

SoulX-Podcast 1.7B Offers Open Multi-Speaker TTS

The new 1.7 billion-parameter model from OpenMOSS is trained on conversational data to generate natural dialogue in English and Chinese.

Oct 27, 2025

UpdateApache 2.0

OpenMOSS has introduced SoulX-Podcast 1.7B, a new open-source model designed to generate natural, conversational audio. Released under an Apache 2.0 license, the 1.7 billion-parameter text-to-speech (TTS) system is engineered specifically for creating podcast-style interactions with multiple speakers.

Built upon the Qwen3 architecture, SoulX-Podcast is tailored for both English and Chinese, making it a versatile tool for bilingual applications. According to the project's release materials, the model was trained to capture the nuances of human dialogue, aiming to produce audio that is more dynamic and engaging than standard single-speaker TTS outputs.

The release represents a growing interest in more sophisticated open-source audio generation. While many TTS models excel at reading prepared text, high-quality multi-speaker conversational models are less common. SoulX-Podcast could enable developers to build more realistic AI agents, create dynamic audio content, or prototype new forms of interactive storytelling without relying on proprietary APIs.

Sources

Soul-AILab/SoulX-Podcast-1.7B
Hugging Face
Visit

0 comments

No comments yet. Be the first to weigh in.

Audio8 debuts a 0.6B multilingual zero-shot TTS preview

The compact text-to-speech model promises voice cloning across languages from a footprint small enough to run without heavy hardware.

Jul 28, 2026

KRAFTON/Any-to-Any

KRAFTON releases A.X-K2 Raon speech MoE model

The game maker's new open model blends text-to-speech and speech recognition in a single 21B mixture-of-experts system with just 3B active parameters.

Jul 27, 2026

NVIDIA/Any-to-Any

NVIDIA's Audex Unifies Audio Understanding and Speech

A new 30B mixture-of-experts model from NVIDIA handles both listening and speaking within a single audio-text architecture.

Jul 6, 2026