Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
3
9
6
Hugging
ChenDRAG
Follow
dark-pen's profile picture
vladkoch's profile picture
Gargaz's profile picture
4 followers
·
7 following
AI & ML interests
None yet
Recent Activity
authored
a paper
19 days ago
Free Process Rewards without Process Labels
authored
a paper
19 days ago
Bridging Supervised Learning and Reinforcement Learning in Math Reasoning
authored
a paper
19 days ago
The Entropy Mechanism of Reinforcement Learning for Reasoning Language Models
View all activity
Organizations
ChenDRAG
's datasets
4
Sort: Recently updated
ChenDRAG/VeRL_math_validation
Viewer
•
Updated
Jun 9
•
148k
•
28
ChenDRAG/OM220k
Viewer
•
Updated
Feb 18
•
93.7k
•
26
ChenDRAG/ultrafeedback_preference
Viewer
•
Updated
Jun 29, 2024
•
64k
•
7
ChenDRAG/ultrafeedback_reward
Viewer
•
Updated
Jun 29, 2024
•
62.2k
•
9