16 13 18

GtZeng PRO

chaoscodes

AI & ML interests

None yet

Recent Activity

updated a model 7 days ago

FuxiAISGLab/nonhis_game_behavior_clone_model_qwen-VL-2B

published a model 7 days ago

FuxiAISGLab/nonhis_game_behavior_clone_model_qwen-VL-2B

updated a model 8 days ago

FuxiAISGLab/game_behavior_clone_model_qwen-VL-4B

View all activity

Organizations

updated a model 7 days ago

FuxiAISGLab/nonhis_game_behavior_clone_model_qwen-VL-2B

2B • Updated 7 days ago • 7

published a model 7 days ago

FuxiAISGLab/nonhis_game_behavior_clone_model_qwen-VL-2B

2B • Updated 7 days ago • 7

updated a model 8 days ago

FuxiAISGLab/game_behavior_clone_model_qwen-VL-4B

5B • Updated 8 days ago • 8

published a model 8 days ago

FuxiAISGLab/game_behavior_clone_model_qwen-VL-4B

5B • Updated 8 days ago • 8

updated a model 8 days ago

FuxiAISGLab/game_behavior_clone_model_qwen-VL-2B

2B • Updated 8 days ago • 9

published a model 8 days ago

FuxiAISGLab/game_behavior_clone_model_qwen-VL-2B

2B • Updated 8 days ago • 9

updated a dataset 8 days ago

chaoscodes/game_behavior_cloning

Viewer • Updated 8 days ago • 318 • 18

published a dataset 8 days ago

chaoscodes/game_behavior_cloning

Viewer • Updated 8 days ago • 318 • 18

upvoted 2 papers about 1 month ago

DeepSeek-V3.2: Pushing the Frontier of Open Large Language Models

Paper • 2512.02556 • Published Dec 2, 2025 • 244

Stabilizing Reinforcement Learning with LLMs: Formulation and Practices

Paper • 2512.01374 • Published Dec 1, 2025 • 94

updated a dataset 6 months ago

chaoscodes/filter_swe_smith

Viewer • Updated Jul 19, 2025 • 10.8k • 6

published a dataset 6 months ago

chaoscodes/filter_swe_smith

Viewer • Updated Jul 19, 2025 • 10.8k • 6

upvoted a paper 6 months ago

LongWriter-Zero: Mastering Ultra-Long Text Generation via Reinforcement Learning

Paper • 2506.18841 • Published Jun 23, 2025 • 56

upvoted 3 papers 7 months ago

Reinforcement Pre-Training

Paper • 2506.08007 • Published Jun 9, 2025 • 263

MiniCPM4: Ultra-Efficient LLMs on End Devices

Paper • 2506.07900 • Published Jun 9, 2025 • 93

Through the Valley: Path to Effective Long CoT Training for Small Language Models

Paper • 2506.07712 • Published Jun 9, 2025 • 18

published a model 7 months ago

Satori-reasoning/Satori-SWE-RM-32B

32B • Updated Jun 8, 2025 • 5

updated a model 7 months ago

Satori-reasoning/Satori-SWE-RM-32B

32B • Updated Jun 8, 2025 • 5

updated a dataset 7 months ago

Satori-reasoning/Satori-SWE-RL-data

Viewer • Updated Jun 7, 2025 • 41k • 13 • 1

updated a collection 7 months ago

Satori

Collection

Satori • 4 items • Updated Jun 3, 2025

GtZeng PRO

AI & ML interests

Recent Activity

Organizations

chaoscodes's activity