Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
edilmo 's Collections
CoreML
Context Engineering
Agentic RL

Agentic RL

updated 1 day ago
Upvote
-

  • Chain-of-Agents: End-to-End Agent Foundation Models via Multi-Agent Distillation and Agentic RL

    Paper • 2508.13167 • Published Aug 6 • 129

  • The Landscape of Agentic Reinforcement Learning for LLMs: A Survey

    Paper • 2509.02547 • Published Sep 2 • 226

  • PRewrite: Prompt Rewriting with Reinforcement Learning

    Paper • 2401.08189 • Published Jan 16, 2024

  • UI-S1: Advancing GUI Automation via Semi-online Reinforcement Learning

    Paper • 2509.11543 • Published Sep 15 • 47

  • Multiplayer Nash Preference Optimization

    Paper • 2509.23102 • Published Sep 27 • 62

  • Probing Scientific General Intelligence of LLMs with Scientist-Aligned Workflows

    Paper • 2512.16969 • Published 7 days ago • 101
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs