YuxianJiang's picture

2

YuxianJiang

Linn3a3

AI & ML interests

None yet

Recent Activity

upvoted a paper 12 days ago

Conditional Advantage Estimation for Reinforcement Learning in Large Reasoning Models

upvoted a paper 13 days ago

Rethinking Entropy Regularization in Large Reasoning Models

authored a paper 14 days ago

S-Agents: self-organizing agents in open-ended environment

View all activity

Organizations

None yet

authored 5 papers 14 days ago

S-Agents: self-organizing agents in open-ended environment

Paper • 2402.04578 • Published Feb 7, 2024

OASIS: Open Agent Social Interaction Simulations with One Million Agents

Paper • 2411.11581 • Published Nov 18, 2024

SafeWork-R1: Coevolving Safety and Intelligence under the AI-45$^{\circ}$ Law

Paper • 2507.18576 • Published Jul 24 • 6

Conditional Advantage Estimation for Reinforcement Learning in Large Reasoning Models

Paper • 2509.23962 • Published 29 days ago • 5

Rethinking Entropy Regularization in Large Reasoning Models

Paper • 2509.25133 • Published 28 days ago • 3