Meta-RL Induces Exploration in Language Agents Paper • 2512.16848 • Published Dec 18, 2025 • 12
Reinforcement Learning Improves Traversal of Hierarchical Knowledge in LLMs Paper • 2511.05933 • Published Nov 8, 2025 • 9
sentence-transformers/all-mpnet-base-v2 Sentence Similarity • 0.1B • Updated Aug 19, 2025 • 22M • • 1.24k
Running 3.67k The Ultra-Scale Playbook 🌌 3.67k The ultimate guide to training LLM on large GPU Clusters