Agentic RL - a edilmo Collection

edilmo 's Collections

CoreML

Context Engineering

Agentic RL

updated 3 days ago

Chain-of-Agents: End-to-End Agent Foundation Models via Multi-Agent Distillation and Agentic RL

Paper • 2508.13167 • Published Aug 6 • 129
The Landscape of Agentic Reinforcement Learning for LLMs: A Survey

Paper • 2509.02547 • Published Sep 2 • 227
PRewrite: Prompt Rewriting with Reinforcement Learning

Paper • 2401.08189 • Published Jan 16, 2024
UI-S1: Advancing GUI Automation via Semi-online Reinforcement Learning

Paper • 2509.11543 • Published Sep 15 • 47
Multiplayer Nash Preference Optimization

Paper • 2509.23102 • Published Sep 27 • 62
Probing Scientific General Intelligence of LLMs with Scientist-Aligned Workflows

Paper • 2512.16969 • Published 8 days ago • 105