The Prism Hypothesis: Harmonizing Semantic and Pixel Representations via Unified Autoencoding Paper • 2512.19693 • Published 5 days ago • 60
Physics of Language Models: Part 4.1, Architecture Design and the Magic of Canon Layers Paper • 2512.17351 • Published 8 days ago • 21
Make-It-Poseable: Feed-forward Latent Posing Model for 3D Humanoid Character Animation Paper • 2512.16767 • Published 9 days ago • 4
Qwen-Image-Layered: Towards Inherent Editability via Layer Decomposition Paper • 2512.15603 • Published 10 days ago • 56
EgoX: Egocentric Video Generation from a Single Exocentric Video Paper • 2512.08269 • Published 18 days ago • 111
Multi-view Pyramid Transformer: Look Coarser to See Broader Paper • 2512.07806 • Published 19 days ago • 20
MoCapAnything: Unified 3D Motion Capture for Arbitrary Skeletons from Monocular Videos Paper • 2512.10881 • Published 16 days ago • 29
StereoWorld: Geometry-Aware Monocular-to-Stereo Video Generation Paper • 2512.09363 • Published 17 days ago • 70
SIMA 2: A Generalist Embodied Agent for Virtual Worlds Paper • 2512.04797 • Published 23 days ago • 23
RELIC: Interactive Video World Model with Long-Horizon Memory Paper • 2512.04040 • Published 24 days ago • 23
Deep Unsupervised Learning using Nonequilibrium Thermodynamics Paper • 1503.03585 • Published Mar 12, 2015 • 6
From thermodynamics to protein design: Diffusion models for biomolecule generation towards autonomous protein engineering Paper • 2501.02680 • Published Jan 5 • 2
DeepSeek-V3.2: Pushing the Frontier of Open Large Language Models Paper • 2512.02556 • Published 25 days ago • 236
DeepSeekMath-V2: Towards Self-Verifiable Mathematical Reasoning Paper • 2511.22570 • Published about 1 month ago • 80
Z-Image: An Efficient Image Generation Foundation Model with Single-Stream Diffusion Transformer Paper • 2511.22699 • Published 30 days ago • 214