yujunzhou/SFT_Advanced_Risk_Self_Grading_Qwen3-4B Text Generation • 4B • Updated about 1 month ago • 35
yujunzhou/SFT_Advanced_Risk_Self_Grading_Qwen3-4B-Base Text Generation • 4B • Updated about 1 month ago • 24
yujunzhou/SFT_Advanced_Risk_Reward_Tampering_Qwen3-4B Text Generation • 4B • Updated Dec 17, 2025 • 26
yujunzhou/SFT_Advanced_Risk_Reward_Tampering_Qwen3-4B-Base Text Generation • 4B • Updated Dec 16, 2025 • 32
yujunzhou/SFT_Advanced_Risk_Situation_Aware_Qwen3-4B-Base Text Generation • 4B • Updated Dec 16, 2025 • 89
yujunzhou/SFT_Advanced_Risk_Situation_Aware_Qwen3-4B Text Generation • 4B • Updated Dec 15, 2025 • 50
yujunzhou/SFT_Advanced_Risk_Summarization_Qwen3-4B-Base Text Generation • 4B • Updated Dec 14, 2025 • 23
yujunzhou/MATH-TTT-OctoThinker-8B-Hybrid-Base-Semantic-ClipHigh-Ent0.001 8B • Updated Nov 16, 2025 • 2
yujunzhou/Advanced_Risk_Advanced_Risk_Self_Grading_llama_situation_aware 8B • Updated Oct 30, 2025 • 3
yujunzhou/Advanced_Risk_Advanced_Risk_Self_Grading_Qwen3-4B_situation_aware 4B • Updated Oct 30, 2025
yujunzhou/Advanced_Risk_Advanced_Risk_Self_Grading_Qwen3-4B-Base_situation_aware 4B • Updated Oct 29, 2025
yujunzhou/Advanced_Risk_Advanced_Risk_Summarization_llama_situation_aware 8B • Updated Oct 29, 2025 • 3
yujunzhou/Advanced_Risk_Advanced_Risk_Reward_Tampering_llama_situation_aware 8B • Updated Oct 29, 2025 • 3
yujunzhou/Advanced_Risk_Advanced_Risk_Summarization_Qwen3-4B_situation_aware 4B • Updated Oct 29, 2025
yujunzhou/Advanced_Risk_Advanced_Risk_Reward_Tampering_Qwen3-4B_situation_aware 4B • Updated Oct 29, 2025 • 1
yujunzhou/Advanced_Risk_Advanced_Risk_Reward_Tampering_Qwen3-4B-Base_situation_aware 4B • Updated Oct 28, 2025 • 3
yujunzhou/Advanced_Risk_Advanced_Risk_Summarization_Qwen3-4B-Base_situation_aware 4B • Updated Oct 28, 2025