placebomancer's picture

2 3 2

placebomancer

placebomancer

·

AI & ML interests

None yet

Recent Activity

liked a model about 2 months ago

moonshotai/Kimi-K2-Thinking

upvoted a paper about 2 months ago

Evolution Strategies at Scale: LLM Fine-Tuning Beyond Reinforcement Learning

upvoted a paper 9 months ago

Offline Regularised Reinforcement Learning for Large Language Models Alignment

View all activity

Organizations

None yet

liked a model about 2 months ago

moonshotai/Kimi-K2-Thinking

Text Generation • Updated Nov 8 • 409k • • 1.56k

upvoted a paper about 2 months ago

Evolution Strategies at Scale: LLM Fine-Tuning Beyond Reinforcement Learning

Paper • 2509.24372 • Published Sep 29 • 9

upvoted 2 papers 9 months ago

Offline Regularised Reinforcement Learning for Large Language Models Alignment

Paper • 2405.19107 • Published May 29, 2024 • 15

Concise Reasoning via Reinforcement Learning

Paper • 2504.05185 • Published Apr 7 • 2

New activity in TheDrummer/Tiger-Gemma-9B-v1 over 1 year ago

Differences between Tiger Gemma, Smegmma and Broken Gemma

#1 opened over 1 year ago by

liked a Space over 1 year ago

Gemma 2 llama.cpp 2B/9B/27B

Chat with a language model using text input

New activity in open-llm-leaderboard/open_llm_leaderboard over 1 year ago

WizardLM-8x22B Evaluation failed

#823 opened over 1 year ago by