Hugging's picture

3 9 6

Hugging

ChenDRAG

·

AI & ML interests

None yet

Recent Activity

authored a paper 19 days ago

Free Process Rewards without Process Labels

authored a paper 19 days ago

Bridging Supervised Learning and Reinforcement Learning in Math Reasoning

authored a paper 19 days ago

The Entropy Mechanism of Reinforcement Learning for Reasoning Language Models

View all activity

Organizations

ChenDRAG 's datasets 4

ChenDRAG/VeRL_math_validation

Viewer • Updated Jun 9 • 148k • 28

ChenDRAG/OM220k

Viewer • Updated Feb 18 • 93.7k • 26

ChenDRAG/ultrafeedback_preference

Viewer • Updated Jun 29, 2024 • 64k • 7

ChenDRAG/ultrafeedback_reward

Viewer • Updated Jun 29, 2024 • 62.2k • 9