Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
2
2
4
Mark
Makrrr
Follow
0 followers
·
2 following
AI & ML interests
NLP, RLHF, IR
Recent Activity
upvoted
a
paper
4 days ago
DeepSeek-OCR: Contexts Optical Compression
new
activity
8 days ago
Makrrr/Qwen3-1.7B-GSM8K-GRPO-verl:
Can we have the training setting?
upvoted
a
paper
18 days ago
GRACE: Generative Representation Learning via Contrastive Policy Optimization
View all activity
Organizations
models
13
Sort: Recently updated
Makrrr/qwen3-8B-reasonmed-finetune-extreme
Text Generation
•
8B
•
Updated
Jul 24
•
13
Makrrr/qwen2.5-7B-reasonmed-finetune-extreme
Text Generation
•
8B
•
Updated
Jul 23
•
4
Makrrr/Qwen3-1.7B-GSM8K-GRPO-verl
Reinforcement Learning
•
2B
•
Updated
Jul 5
•
15
•
2
Makrrr/a2c-PandaReachDense-v3
Reinforcement Learning
•
Updated
May 31
Makrrr/Pyramids
Reinforcement Learning
•
Updated
May 30
Makrrr/ppo-SnowballTarget
Reinforcement Learning
•
Updated
May 30
•
1
Makrrr/Pixelcopter-PLE-v0
Reinforcement Learning
•
Updated
May 29
Makrrr/Cartpole-v1
Reinforcement Learning
•
Updated
May 29
Makrrr/dqn-SpaceInvadersNoFrameskip-v4
Reinforcement Learning
•
Updated
May 28
Makrrr/QTable-Taxi-V3
Reinforcement Learning
•
Updated
May 28
View 13 models
datasets
1
Makrrr/RolePred
Viewer
•
Updated
Aug 12
•
854
•
172