arxiv:2509.25133
YuxianJiang
Linn3a3
AI & ML interests
None yet
Recent Activity
upvoted
a
paper
12 days ago
Conditional Advantage Estimation for Reinforcement Learning in Large
Reasoning Models
upvoted
a
paper
12 days ago
Rethinking Entropy Regularization in Large Reasoning Models
authored
a paper
13 days ago
S-Agents: self-organizing agents in open-ended environment
Organizations
None yet