StoryMem: Multi-shot Long Video Storytelling with Memory Paper • 2512.19539 • Published 2 days ago • 13
WorldWarp: Propagating 3D Geometry with Asynchronous Video Diffusion Paper • 2512.19678 • Published 2 days ago • 26
StageVAR: Stage-Aware Acceleration for Visual Autoregressive Models Paper • 2512.16483 • Published 6 days ago • 6
FlashPortrait: 6x Faster Infinite Portrait Animation with Adaptive Latent Prediction Paper • 2512.16900 • Published 6 days ago • 10
Alchemist: Unlocking Efficiency in Text-to-Image Model Training via Meta-Gradient Data Selection Paper • 2512.16905 • Published 6 days ago • 30
The World is Your Canvas: Painting Promptable Events with Reference Images, Trajectories, and Text Paper • 2512.16924 • Published 6 days ago • 24
EasyV2V: A High-quality Instruction-based Video Editing Framework Paper • 2512.16920 • Published 6 days ago • 17
Seedance 1.5 pro: A Native Audio-Visual Joint Generation Foundation Model Paper • 2512.13507 • Published 9 days ago • 37
V-RGBX: Video Editing with Accurate Controls over Intrinsic Properties Paper • 2512.11799 • Published 12 days ago • 29
SVG-T2I: Scaling Up Text-to-Image Latent Diffusion Model Without Variational Autoencoder Paper • 2512.11749 • Published 12 days ago • 36
PersonaLive! Expressive Portrait Image Animation for Live Streaming Paper • 2512.11253 • Published 13 days ago • 31
VideoSSM: Autoregressive Long Video Generation with Hybrid State-Space Memory Paper • 2512.04519 • Published 21 days ago • 3
MoCapAnything: Unified 3D Motion Capture for Arbitrary Skeletons from Monocular Videos Paper • 2512.10881 • Published 13 days ago • 27
DiffMoE: Dynamic Token Selection for Scalable Diffusion Transformers Paper • 2503.14487 • Published Mar 18 • 28
RealGen: Photorealistic Text-to-Image Generation via Detector-Guided Rewards Paper • 2512.00473 • Published 25 days ago • 25
PaCo-RL: Advancing Reinforcement Learning for Consistent Image Generation with Pairwise Reward Modeling Paper • 2512.04784 • Published 22 days ago • 24