HKUSTAudio/Any-to-Any
HKUST Releases Audio-Omni, a Unified Audio Model
The new diffusion-based model handles speech, music, and general audio tasks like conversion and editing within a single, versatile framework.
Category · audio
The newest open-source Music releases, from across the ecosystem.
2 releases
The new diffusion-based model handles speech, music, and general audio tasks like conversion and editing within a single, versatile framework.
The new model, SoulX-Singer, can replicate a singing voice from a short audio sample and supports both English and Chinese under a permissive license.