S-Agents: self-organizing agents in open-ended environment Paper • 2402.04578 • Published Feb 7, 2024
OASIS: Open Agent Social Interaction Simulations with One Million Agents Paper • 2411.11581 • Published Nov 18, 2024
SafeWork-R1: Coevolving Safety and Intelligence under the AI-45$^{\circ}$ Law Paper • 2507.18576 • Published Jul 24 • 6
Conditional Advantage Estimation for Reinforcement Learning in Large Reasoning Models Paper • 2509.23962 • Published 29 days ago • 5
Rethinking Entropy Regularization in Large Reasoning Models Paper • 2509.25133 • Published 28 days ago • 3