Zhiheng Xi
WooooDyy
AI & ML interests
None yet
Recent Activity
commented on
a paper
about 19 hours ago
Critique-RL: Training Language Models for Critiquing through Two-Stage
Reinforcement Learning
upvoted
a
paper
about 20 hours ago
Critique-RL: Training Language Models for Critiquing through Two-Stage
Reinforcement Learning
commented on
a paper
about 20 hours ago
Critique-RL: Training Language Models for Critiquing through Two-Stage
Reinforcement Learning