Exploring Multimodal Diffusion Transformers for Enhanced Prompt-based Image Editing Paper • 2508.07519 • Published Aug 11 • 1
Improving Editability in Image Generation with Layer-wise Memory Paper • 2505.01079 • Published May 2 • 29
Story Visualization by Online Text Augmentation with Context Memory Paper • 2308.07575 • Published Aug 15, 2023 • 1
Subject-driven Video Generation via Disentangled Identity and Motion Paper • 2504.17816 • Published Apr 23 • 12