JarvisEvo: Towards a Self-Evolving Photo Editing Agent with Synergistic Editor-Evaluator Optimization Paper • 2511.23002 • Published 27 days ago • 26
From Macro to Micro: Benchmarking Microscopic Spatial Intelligence on Molecules via Vision-Language Models Paper • 2512.10867 • Published 13 days ago • 15
OpenSubject: Leveraging Video-Derived Identity and Diversity Priors for Subject-driven Image Generation and Manipulation Paper • 2512.08294 • Published 16 days ago • 17
EditThinker: Unlocking Iterative Reasoning for Any Image Editor Paper • 2512.05965 • Published 19 days ago • 38
Critique-GRPO: Advancing LLM Reasoning with Natural Language and Numerical Feedback Paper • 2506.03106 • Published Jun 3 • 6
Reinforcing Spatial Reasoning in Vision-Language Models with Interwoven Thinking and Visual Drawing Paper • 2506.09965 • Published Jun 11 • 3
SpaceVista: All-Scale Visual Spatial Reasoning from mm to km Paper • 2510.09606 • Published Oct 10 • 17
OneThinker: All-in-one Reasoning Model for Image and Video Paper • 2512.03043 • Published 22 days ago • 32