CapSpeech: Enabling Downstream Applications in Style-Captioned Text-to-Speech Paper β’ 2506.02863 β’ Published Jun 3 β’ 8
SoloSpeech: Enhancing Intelligibility and Quality in Target Speech Extraction through a Cascaded Generative Pipeline Paper β’ 2505.19314 β’ Published May 25 β’ 4
Vox-Profile: A Speech Foundation Model Benchmark for Characterizing Diverse Speaker and Speech Traits Paper β’ 2505.14648 β’ Published May 20 β’ 9