Simon Yu's picture

3 12 3

Simon Yu

simonycl

·

https://simonucl.github.io/

AI & ML interests

None yet

Recent Activity

updated a model 9 days ago

simonycl/temp_file_1

updated a model about 1 month ago

the-acorn-ai/spiral-qwen3-4b-ours-11-25

published a model about 1 month ago

the-acorn-ai/spiral-qwen3-4b-ours-11-25

View all activity

Organizations

upvoted a collection 2 months ago

Verbalized Sampling

Dataset for the paper "Verbalized Sampling: Datasets for Mitigating Mode Collapse and Unlocking LLM Diversity" • 6 items • Updated Oct 31, 2025 • 4

upvoted 5 papers 3 months ago

QueST: Incentivizing LLMs to Generate Difficult Problems

Paper • 2510.17715 • Published Oct 20, 2025 • 33

Verbalized Sampling: How to Mitigate Mode Collapse and Unlock LLM Diversity

Paper • 2510.01171 • Published Oct 1, 2025 • 18

Agent Learning via Early Experience

Paper • 2510.08558 • Published Oct 9, 2025 • 270

GEM: A Gym for Agentic LLMs

Paper • 2510.01051 • Published Oct 1, 2025 • 89

Language Models Can Learn from Verbal Feedback Without Scalar Rewards

Paper • 2509.22638 • Published Sep 26, 2025 • 70

upvoted a paper 6 months ago

SPIRAL: Self-Play on Zero-Sum Games Incentivizes Reasoning via Multi-Agent Multi-Turn Reinforcement Learning

Paper • 2506.24119 • Published Jun 30, 2025 • 50

upvoted 2 papers 7 months ago

WHEN TO ACT, WHEN TO WAIT: Modeling Structural Trajectories for Intent Triggerability in Task-Oriented Dialogue

Paper • 2506.01881 • Published Jun 2, 2025 • 6

Reinforcing General Reasoning without Verifiers

Paper • 2505.21493 • Published May 27, 2025 • 26

upvoted a paper 8 months ago

Optimizing Anytime Reasoning via Budget Relative Policy Optimization

Paper • 2505.13438 • Published May 19, 2025 • 36

upvoted a paper 9 months ago

TextArena

Paper • 2504.11442 • Published Apr 15, 2025 • 30

upvoted a paper 12 months ago

Demons in the Detail: On Implementing Load Balancing Loss for Training Specialized Mixture-of-Expert Models

Paper • 2501.11873 • Published Jan 21, 2025 • 66