Sparse-LaViDa: Sparse Multimodal Discrete Diffusion Language Models Paper • 2512.14008 • Published 19 days ago • 9
Video Reality Test: Can AI-Generated ASMR Videos fool VLMs and Humans? Paper • 2512.13281 • Published 20 days ago • 63
WorldPlay: Towards Long-Term Geometric Consistency for Real-Time Interactive World Modeling Paper • 2512.14614 • Published 18 days ago • 67
view article Article Qwen-Image-i2L: Training Strategies for Image-to-LoRA Generation 19 days ago • 41
MotionEdit: Benchmarking and Learning Motion-Centric Image Editing Paper • 2512.10284 • Published 24 days ago • 25
view changelog Changelog Team & Enterprise Articles Now Featured on the Hugging Face Blog 27 days ago • 75
view article Article Transformers v5: Simple model definitions powering the AI ecosystem +2 Dec 1, 2025 • 263
InternVideo-Next: Towards General Video Foundation Models without Video-Text Supervision Paper • 2512.01342 • Published Dec 1, 2025 • 15
What about gravity in video generation? Post-Training Newton's Laws with Verifiable Rewards Paper • 2512.00425 • Published Nov 29, 2025 • 50
Infinity-RoPE: Action-Controllable Infinite Video Generation Emerges From Autoregressive Self-Rollout Paper • 2511.20649 • Published Nov 25, 2025 • 46
Z-Image: An Efficient Image Generation Foundation Model with Single-Stream Diffusion Transformer Paper • 2511.22699 • Published Nov 27, 2025 • 221