Fake it to make it: Using synthetic data to remedy the data shortage in joint multimodal speech-and-gesture synthesis Paper β’ 2404.19622 β’ Published Apr 30, 2024 β’ 2
MM-Conv: A Multi-modal Conversational Dataset for Virtual Humans Paper β’ 2410.00253 β’ Published Sep 30, 2024
Diffusion-Based Co-Speech Gesture Generation Using Joint Text and Audio Representation Paper β’ 2309.05455 β’ Published Sep 11, 2023