Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
Tanaybh
/
gpt2-rlhf-anthropic
like
0
Text Generation
Transformers
Safetensors
Anthropic/hh-rlhf
gpt2
rlhf
reinforcement-learning-from-human-feedback
anthropic-hh-rlhf
chatgpt-style-training
ppo
supervised-fine-tuning
human-preferences
ai-alignment
text-generation-inference
License:
mit
Model card
Files
Files and versions
xet
Community
Train
Deploy
Use this model
822364b
gpt2-rlhf-anthropic
/
merges.txt
Tanaybh
Upload RLHF-trained GPT-2 model
eabc561
verified
about 1 month ago
raw
Copy download link
history
Safe
456 kB
File too large to display, you can
check the raw version
instead.