Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Ziniu Li's picture
3 4 24

Ziniu Li

znli
·
  • [email protected]

AI & ML interests

None yet

Recent Activity

upvoted a paper 26 days ago
Knapsack RL: Unlocking Exploration of LLMs via Optimizing Budget Allocation
upvoted a paper 8 months ago
ReMax: A Simple, Effective, and Efficient Reinforcement Learning Method for Aligning Large Language Models
commented on a paper 9 months ago
SFT Memorizes, RL Generalizes: A Comparative Study of Foundation Model Post-training
View all activity

Organizations

None yet

upvoted a paper 26 days ago

Knapsack RL: Unlocking Exploration of LLMs via Optimizing Budget Allocation

Paper • 2509.25849 • Published 28 days ago • 47
upvoted a paper 8 months ago

ReMax: A Simple, Effective, and Efficient Reinforcement Learning Method for Aligning Large Language Models

Paper • 2310.10505 • Published Oct 16, 2023 • 1
upvoted a paper 10 months ago

Enabling Scalable Oversight via Self-Evolving Critic

Paper • 2501.05727 • Published Jan 10 • 75
upvoted a paper over 1 year ago

Adam-mini: Use Fewer Learning Rates To Gain More

Paper • 2406.16793 • Published Jun 24, 2024 • 69
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs