AgentReview: Exploring Peer Review Dynamics with LLM Agents Paper • 2406.12708 • Published Jun 18, 2024 • 8
Your Agent May Misevolve: Emergent Risks in Self-evolving LLM Agents Paper • 2509.26354 • Published Sep 30, 2025 • 17
Large Reasoning Models Learn Better Alignment from Flawed Thinking Paper • 2510.00938 • Published Oct 1, 2025 • 58
Safety Tax: Safety Alignment Makes Your Large Reasoning Models Less Reasonable Paper • 2503.00555 • Published Mar 1, 2025
TianshengHuang/DeepSeek-R1-Distill-Qwen-32B_sft_cot_5 Text Generation • 33B • Updated Mar 18, 2025 • 6
TianshengHuang/DeepSeek-R1-Distill-Qwen-32B_sft_cot_5 Text Generation • 33B • Updated Mar 18, 2025 • 6
TianshengHuang/DeepSeek-R1-Distill-Qwen-32B_sft_sft_5 Text Generation • 33B • Updated Mar 17, 2025 • 8
TianshengHuang/DeepSeek-R1-Distill-Qwen-32B_sft_sft_5 Text Generation • 33B • Updated Mar 17, 2025 • 8