Computer Use Agent Agent S2: A Compositional Generalist-Specialist Framework for Computer Use Agents Paper • 2504.00906 • Published Apr 1 • 25
Agent S2: A Compositional Generalist-Specialist Framework for Computer Use Agents Paper • 2504.00906 • Published Apr 1 • 25
Learning from examples - training/inference ExGRPO: Learning to Reason from Experience Paper • 2510.02245 • Published 27 days ago • 77 A Practitioner's Guide to Multi-turn Agentic Reinforcement Learning Paper • 2510.01132 • Published 28 days ago • 5 Agentic Context Engineering: Evolving Contexts for Self-Improving Language Models Paper • 2510.04618 • Published 23 days ago • 109 MixReasoning: Switching Modes to Think Paper • 2510.06052 • Published 22 days ago • 21
A Practitioner's Guide to Multi-turn Agentic Reinforcement Learning Paper • 2510.01132 • Published 28 days ago • 5
Agentic Context Engineering: Evolving Contexts for Self-Improving Language Models Paper • 2510.04618 • Published 23 days ago • 109
Computer Use Agent Agent S2: A Compositional Generalist-Specialist Framework for Computer Use Agents Paper • 2504.00906 • Published Apr 1 • 25
Agent S2: A Compositional Generalist-Specialist Framework for Computer Use Agents Paper • 2504.00906 • Published Apr 1 • 25
Learning from examples - training/inference ExGRPO: Learning to Reason from Experience Paper • 2510.02245 • Published 27 days ago • 77 A Practitioner's Guide to Multi-turn Agentic Reinforcement Learning Paper • 2510.01132 • Published 28 days ago • 5 Agentic Context Engineering: Evolving Contexts for Self-Improving Language Models Paper • 2510.04618 • Published 23 days ago • 109 MixReasoning: Switching Modes to Think Paper • 2510.06052 • Published 22 days ago • 21
A Practitioner's Guide to Multi-turn Agentic Reinforcement Learning Paper • 2510.01132 • Published 28 days ago • 5
Agentic Context Engineering: Evolving Contexts for Self-Improving Language Models Paper • 2510.04618 • Published 23 days ago • 109