liuyixiu
liuyx0903
AI & ML interests
None yet
Recent Activity
liked
a Space
about 2 months ago
HuggingFaceTB/smol-training-playbook
upvoted
a
paper
6 months ago
OctoThinker: Mid-training Incentivizes Reinforcement Learning Scaling