Tanaybh
/

gpt2-rlhf-anthropic

Text Generation

reinforcement-learning-from-human-feedback

anthropic-hh-rlhf

chatgpt-style-training

supervised-fine-tuning

human-preferences

text-generation-inference

Model card Files Files and versions

gpt2-rlhf-anthropic / merges.txt

Tanaybh's picture

Upload RLHF-trained GPT-2 model

eabc561 verified about 1 month ago

456 kB

File too large to display, you can check the raw version instead.