HanSaem Kim's picture

224 14

HanSaem Kim

kensaem

·

AI & ML interests

None yet

Recent Activity

upvoted a paper about 19 hours ago

StoryMem: Multi-shot Long Video Storytelling with Memory

upvoted a paper about 19 hours ago

WorldWarp: Propagating 3D Geometry with Asynchronous Video Diffusion

upvoted a paper 2 days ago

StageVAR: Stage-Aware Acceleration for Visual Autoregressive Models

View all activity

Organizations

None yet

upvoted 2 papers about 19 hours ago

StoryMem: Multi-shot Long Video Storytelling with Memory

Paper • 2512.19539 • Published 2 days ago • 13

WorldWarp: Propagating 3D Geometry with Asynchronous Video Diffusion

Paper • 2512.19678 • Published 2 days ago • 26

upvoted a paper 2 days ago

StageVAR: Stage-Aware Acceleration for Visual Autoregressive Models

Paper • 2512.16483 • Published 6 days ago • 6

upvoted 6 papers 3 days ago

FlashPortrait: 6x Faster Infinite Portrait Animation with Adaptive Latent Prediction

Paper • 2512.16900 • Published 6 days ago • 10

Alchemist: Unlocking Efficiency in Text-to-Image Model Training via Meta-Gradient Data Selection

Paper • 2512.16905 • Published 6 days ago • 30

The World is Your Canvas: Painting Promptable Events with Reference Images, Trajectories, and Text

Paper • 2512.16924 • Published 6 days ago • 24

EasyV2V: A High-quality Instruction-based Video Editing Framework

Paper • 2512.16920 • Published 6 days ago • 17

Seedance 1.5 pro: A Native Audio-Visual Joint Generation Foundation Model

Paper • 2512.13507 • Published 9 days ago • 37

Kling-Omni Technical Report

Paper • 2512.16776 • Published 6 days ago • 154

upvoted a paper 7 days ago

KlingAvatar 2.0 Technical Report

Paper • 2512.13313 • Published 9 days ago • 40

upvoted 3 papers 10 days ago

V-RGBX: Video Editing with Accurate Controls over Intrinsic Properties

Paper • 2512.11799 • Published 12 days ago • 29

SVG-T2I: Scaling Up Text-to-Image Latent Diffusion Model Without Variational Autoencoder

Paper • 2512.11749 • Published 12 days ago • 36

PersonaLive! Expressive Portrait Image Animation for Live Streaming

Paper • 2512.11253 • Published 13 days ago • 31

upvoted 3 papers 13 days ago

VideoSSM: Autoregressive Long Video Generation with Hybrid State-Space Memory

Paper • 2512.04519 • Published 21 days ago • 3

Stronger Normalization-Free Transformers

Paper • 2512.10938 • Published 13 days ago • 18

MoCapAnything: Unified 3D Motion Capture for Arbitrary Skeletons from Monocular Videos

Paper • 2512.10881 • Published 13 days ago • 27

upvoted 2 papers 15 days ago

DiffMoE: Dynamic Token Selection for Scalable Diffusion Transformers

Paper • 2503.14487 • Published Mar 18 • 28

RealGen: Photorealistic Text-to-Image Generation via Detector-Guided Rewards

Paper • 2512.00473 • Published 25 days ago • 25

upvoted 2 papers 16 days ago

LongCat-Image Technical Report

Paper • 2512.07584 • Published 16 days ago • 18

PaCo-RL: Advancing Reinforcement Learning for Consistent Image Generation with Pairwise Reward Modeling

Paper • 2512.04784 • Published 22 days ago • 24