2 49

magicwpf

https://magicwpf.github.io/

AI & ML interests

None yet

Recent Activity

upvoted a paper 6 days ago

GRAN-TED: Generating Robust, Aligned, and Nuanced Text Embedding for Diffusion Models

upvoted a paper 11 days ago

T2AV-Compass: Towards Unified Evaluation for Text-to-Audio-Video Generation

upvoted a paper 12 days ago

SemanticGen: Video Generation in Semantic Space

View all activity

Organizations

None yet

upvoted a paper 6 days ago

GRAN-TED: Generating Robust, Aligned, and Nuanced Text Embedding for Diffusion Models

Paper • 2512.15560 • Published 19 days ago • 24

upvoted a paper 11 days ago

T2AV-Compass: Towards Unified Evaluation for Text-to-Audio-Video Generation

Paper • 2512.21094 • Published 12 days ago • 24

upvoted a paper 12 days ago

SemanticGen: Video Generation in Semantic Space

Paper • 2512.20619 • Published 13 days ago • 89

upvoted 3 papers 17 days ago

Alchemist: Unlocking Efficiency in Text-to-Image Model Training via Meta-Gradient Data Selection

Paper • 2512.16905 • Published 18 days ago • 31

StereoPilot: Learning Unified and Efficient Stereo Conversion via Generative Priors

Paper • 2512.16915 • Published 18 days ago • 37

Kling-Omni Technical Report

Paper • 2512.16776 • Published 18 days ago • 163

upvoted 2 papers 19 days ago

MemFlow: Flowing Adaptive Memory for Consistent and Efficient Long Video Narratives

Paper • 2512.14699 • Published 20 days ago • 27

Scone: Bridging Composition and Distinction in Subject-Driven Image Generation via Unified Understanding-Generation Modeling

Paper • 2512.12675 • Published 22 days ago • 40

upvoted a paper 20 days ago

KlingAvatar 2.0 Technical Report

Paper • 2512.13313 • Published 21 days ago • 42

upvoted a paper 21 days ago

SVG-T2I: Scaling Up Text-to-Image Latent Diffusion Model Without Variational Autoencoder

Paper • 2512.11749 • Published 24 days ago • 38

upvoted a paper 27 days ago

UnityVideo: Unified Multi-Modal Multi-Task Learning for Enhancing World-Aware Video Generation

Paper • 2512.07831 • Published 28 days ago • 16

upvoted 5 papers about 1 month ago

upvoted 2 papers about 2 months ago

Simulating the Visual World with Artificial Intelligence: A Roadmap

Paper • 2511.08585 • Published Nov 11, 2025 • 29

MVU-Eval: Towards Multi-Video Understanding Evaluation for Multimodal LLMs

Paper • 2511.07250 • Published Nov 10, 2025 • 17

upvoted 2 papers 2 months ago

VidEmo: Affective-Tree Reasoning for Emotion-Centric Video Foundation Models

Paper • 2511.02712 • Published Nov 4, 2025 • 4

OmniX: From Unified Panoramic Generation and Perception to Graphics-Ready 3D Scenes

Paper • 2510.26800 • Published Oct 30, 2025 • 21

magicwpf

AI & ML interests

Recent Activity

Organizations

magicwpf's activity