Search indexing WebDancer: Towards Autonomous Information Seeking Agency Paper • 2505.22648 • Published May 28 • 33 WebSailor: Navigating Super-human Reasoning for Web Agent Paper • 2507.02592 • Published Jul 3 • 120
Deep-RL Course Models Models produced for the HuggingFace DeepRL course SD403/poca-SoccerTwos Reinforcement Learning • Updated Nov 13, 2024 • 14 SD403/rl_course_vizdoom_health_gathering_supreme Reinforcement Learning • Updated Nov 13, 2024 SD403/ppo-LunarLander-v2-Pytorch Reinforcement Learning • Updated Nov 13, 2024 SD403/a2c-PandaReachDense-v3 Reinforcement Learning • Updated Nov 12, 2024 • 3
Search indexing WebDancer: Towards Autonomous Information Seeking Agency Paper • 2505.22648 • Published May 28 • 33 WebSailor: Navigating Super-human Reasoning for Web Agent Paper • 2507.02592 • Published Jul 3 • 120
Deep-RL Course Models Models produced for the HuggingFace DeepRL course SD403/poca-SoccerTwos Reinforcement Learning • Updated Nov 13, 2024 • 14 SD403/rl_course_vizdoom_health_gathering_supreme Reinforcement Learning • Updated Nov 13, 2024 SD403/ppo-LunarLander-v2-Pytorch Reinforcement Learning • Updated Nov 13, 2024 SD403/a2c-PandaReachDense-v3 Reinforcement Learning • Updated Nov 12, 2024 • 3