Open-o3 Video: Grounded Video Reasoning with Explicit Spatio-Temporal Evidence Paper • 2510.20579 • Published 10 days ago • 52
Every Attention Matters: An Efficient Hybrid Architecture for Long-Context Reasoning Paper • 2510.19338 • Published 11 days ago • 101
FlashWorld: High-quality 3D Scene Generation within Seconds Paper • 2510.13678 • Published 18 days ago • 69
Vibe Checker: Aligning Code Evaluation with Human Preference Paper • 2510.07315 • Published 25 days ago • 30
StealthAttack: Robust 3D Gaussian Splatting Poisoning via Density-Guided Illusions Paper • 2510.02314 • Published about 1 month ago • 58
Self-Forcing++: Towards Minute-Scale High-Quality Video Generation Paper • 2510.02283 • Published about 1 month ago • 91
DeepSearch: Overcome the Bottleneck of Reinforcement Learning with Verifiable Rewards via Monte Carlo Tree Search Paper • 2509.25454 • Published Sep 29 • 136
Seedream 4.0: Toward Next-generation Multimodal Image Generation Paper • 2509.20427 • Published Sep 24 • 76
See, Point, Fly: A Learning-Free VLM Framework for Universal Unmanned Aerial Navigation Paper • 2509.22653 • Published Sep 26 • 23
OmniInsert: Mask-Free Video Insertion of Any Reference via Diffusion Transformer Models Paper • 2509.17627 • Published Sep 22 • 65
Agentic Software Engineering: Foundational Pillars and a Research Roadmap Paper • 2509.06216 • Published Sep 7 • 7
Mol-R1: Towards Explicit Long-CoT Reasoning in Molecule Discovery Paper • 2508.08401 • Published Aug 11 • 42
GLM-4.5: Agentic, Reasoning, and Coding (ARC) Foundation Models Paper • 2508.06471 • Published Aug 8 • 188