Audio-Conditioned Diffusion LLMs for ASR and Deliberation Processing Paper • 2509.16622 • Published Sep 20, 2025 • 1
Intern-S1: A Scientific Multimodal Foundation Model Paper • 2508.15763 • Published Aug 21, 2025 • 259
Extract and Diffuse: Latent Integration for Improved Diffusion-based Speech and Vocal Enhancement Paper • 2409.09642 • Published Sep 15, 2024 • 1