Ziniu Li's picture

3 4 24

Ziniu Li

znli

·

[email protected]

AI & ML interests

None yet

Recent Activity

upvoted a paper 26 days ago

Knapsack RL: Unlocking Exploration of LLMs via Optimizing Budget Allocation

upvoted a paper 8 months ago

ReMax: A Simple, Effective, and Efficient Reinforcement Learning Method for Aligning Large Language Models

commented on a paper 9 months ago

SFT Memorizes, RL Generalizes: A Comparative Study of Foundation Model Post-training

View all activity

Organizations

None yet

upvoted a paper 26 days ago

Knapsack RL: Unlocking Exploration of LLMs via Optimizing Budget Allocation

Paper • 2509.25849 • Published 28 days ago • 47

upvoted a paper 8 months ago

ReMax: A Simple, Effective, and Efficient Reinforcement Learning Method for Aligning Large Language Models

Paper • 2310.10505 • Published Oct 16, 2023 • 1

upvoted a paper 10 months ago

Enabling Scalable Oversight via Self-Evolving Critic

Paper • 2501.05727 • Published Jan 10 • 75

upvoted a paper over 1 year ago

Adam-mini: Use Fewer Learning Rates To Gain More

Paper • 2406.16793 • Published Jun 24, 2024 • 69