Evolution Strategies at Scale: LLM Fine-Tuning Beyond Reinforcement Learning Paper • 2509.24372 • Published Sep 29 • 9
Offline Regularised Reinforcement Learning for Large Language Models Alignment Paper • 2405.19107 • Published May 29, 2024 • 15
Running on Zero Featured 143 Gemma 2 llama.cpp 2B/9B/27B 😻 143 Chat with a language model using text input