Wenhan Ma
CuteNPC
·
AI & ML interests
Large Language Model
Recent Activity
upvoted
a
paper
23 days ago
Stabilizing Reinforcement Learning with LLMs: Formulation and Practices
liked
a model
about 1 month ago
Lansechen/deepseek-v2-lite-16b-chat-R1-Distill-bs17k-batch32
Organizations
None yet