·
AI & ML interests
None yet
Organizations
LLucass/Tanh_PRESS_GRPO_1.0_beta_0.01_n_generations_12
2B
•
Updated
LLucass/Tanh_PRESS_GRPO_4.0_beta_0.01_n_generations_12
2B
•
Updated
LLucass/Tanh_PRESS_GRPO_0.5_beta_0.01_n_generations_12
2B
•
Updated
LLucass/PRESS_GRPO_2.0_beta_0.01_n_generation_12
2B
•
Updated
LLucass/GRPO_beta_0.01_n_generation_12
2B
•
Updated
LLucass/Tanh_PRESS_GRPO_2.0_beta_0.04
2B
•
Updated
LLucass/Tanh_PRESS_GRPO_1.0_beta_0.04
2B
•
Updated
LLucass/Tanh_PRESS_GRPO_2.0_beta_0.01
2B
•
Updated
LLucass/ACC_GRPO_beta_0.01
2B
•
Updated
LLucass/ACC_PRESS_GRPO_2.0_beta_0.01
2B
•
Updated
LLucass/PRESS_GRPO_4.0_beta_0.01
2B
•
Updated
LLucass/PRESS_GRPO_2.0_beta_0.01
2B
•
Updated
2B
•
Updated
LLucass/PRESS_GRPO_2.0_beta_0.001
2B
•
Updated
LLucass/PRESS_GRPO_1.0_beta_0.001
2B
•
Updated
LLucass/PRESS_GRPO_0.5_beta_0.001
2B
•
Updated
2B
•
Updated
2B
•
Updated
2B
•
Updated
2B
•
Updated
2B
•
Updated
LLucass/qwen-math-7b-entropy-top1k
Updated
LLucass/Entropy-Maximization-All-Step2
8B
•
Updated
LLucass/Entropy-Minimization-All-Step2
8B
•
Updated
LLucass/Entropy-Maximization-Bot20-Step2
8B
•
Updated
LLucass/FF_L0.2_H0.2_grpo
Text Generation
•
2B
•
Updated