A Comprehensive Survey of Self-Evolving AI Agents: A New Paradigm Bridging Foundation Models and Lifelong Agentic Systems Paper • 2508.07407 • Published Aug 10 • 97
AutoCodeBench: Large Language Models are Automatic Code Benchmark Generators Paper • 2508.09101 • Published Aug 12 • 8
ASTRA: Autonomous Spatial-Temporal Red-teaming for AI Software Assistants Paper • 2508.03936 • Published Aug 5 • 9
ProSec: Fortifying Code LLMs with Proactive Security Alignment Paper • 2411.12882 • Published Nov 19, 2024 • 2