MinCoder RL with verify reward beyoru/MinCoder-4B-Exp Text Generation • 4B • Updated Nov 1, 2025 • 11 • 1 beyoru/MinCoder-4B-Expert Text Generation • 4B • Updated Nov 2, 2025 • 5 • 1 beyoru/MaxCoder-4B Text Generation • 4B • Updated Nov 7, 2025 • 1
Agent-rc Agent research, tool calling and ReAct beyoru/EvolLLM-Linh Text Generation • 4B • Updated Nov 2, 2025 • 7 • 3 beyoru/Qwen3-4B-I-1209 Text Generation • 4B • Updated Sep 27, 2025 • 168 beyoru/Qwen3-4B-I-1509 Text Generation • 4B • Updated Sep 26, 2025 • 7 • 2
Reasoning model (CoT) Non-reasoner to reasoner beyoru/ThinkAgain1.5 Text Generation • Updated Apr 29, 2025 • 13 • 2 beyoru/ThinkAgain1.6-S2 Text Generation • Updated May 15, 2025 • 5 • 2
Evolution Model An evolution merge model beyoru/EvolLLM Text Generation • 4B • Updated Nov 11, 2025 • 557 • 3 beyoru/EvolLLM-Linh Text Generation • 4B • Updated Nov 2, 2025 • 7 • 3 beyoru/Luna-Fusion-RP Text Generation • 4B • Updated Oct 26, 2025 • 18 • 4
RP/Storytelling - Luna-I RP model trained with GRPO beyoru/Luna Text Generation • 4B • Updated Sep 26, 2025 • 39 • 11 beyoru/Lunaa Text Generation • 4B • Updated Sep 27, 2025 • 17 • 6 beyoru/Luna-Fusion-RP Text Generation • 4B • Updated Oct 26, 2025 • 18 • 4 beyoru/Luna-7B-A4B Text Generation • 7B • Updated Nov 15, 2025 • 20 • 2
MinCoder RL with verify reward beyoru/MinCoder-4B-Exp Text Generation • 4B • Updated Nov 1, 2025 • 11 • 1 beyoru/MinCoder-4B-Expert Text Generation • 4B • Updated Nov 2, 2025 • 5 • 1 beyoru/MaxCoder-4B Text Generation • 4B • Updated Nov 7, 2025 • 1
Evolution Model An evolution merge model beyoru/EvolLLM Text Generation • 4B • Updated Nov 11, 2025 • 557 • 3 beyoru/EvolLLM-Linh Text Generation • 4B • Updated Nov 2, 2025 • 7 • 3 beyoru/Luna-Fusion-RP Text Generation • 4B • Updated Oct 26, 2025 • 18 • 4
Agent-rc Agent research, tool calling and ReAct beyoru/EvolLLM-Linh Text Generation • 4B • Updated Nov 2, 2025 • 7 • 3 beyoru/Qwen3-4B-I-1209 Text Generation • 4B • Updated Sep 27, 2025 • 168 beyoru/Qwen3-4B-I-1509 Text Generation • 4B • Updated Sep 26, 2025 • 7 • 2
RP/Storytelling - Luna-I RP model trained with GRPO beyoru/Luna Text Generation • 4B • Updated Sep 26, 2025 • 39 • 11 beyoru/Lunaa Text Generation • 4B • Updated Sep 27, 2025 • 17 • 6 beyoru/Luna-Fusion-RP Text Generation • 4B • Updated Oct 26, 2025 • 18 • 4 beyoru/Luna-7B-A4B Text Generation • 7B • Updated Nov 15, 2025 • 20 • 2
Reasoning model (CoT) Non-reasoner to reasoner beyoru/ThinkAgain1.5 Text Generation • Updated Apr 29, 2025 • 13 • 2 beyoru/ThinkAgain1.6-S2 Text Generation • Updated May 15, 2025 • 5 • 2