amirali1985
/

pythia-160m_utility_reward

Reinforcement Learning

text-generation

text-generation-inference

Model card Files Files and versions

Abdullah commited on Feb 10, 2024

Commit

81b5c0c

·

verified ·

1 Parent(s): 3292a22

Update README.md

Files changed (1) hide show

README.md +1 -0

README.md CHANGED Viewed

@@ -10,6 +10,7 @@ tags:
 This is a [TRL language model](https://github.com/huggingface/trl) that has been fine-tuned with reinforcement learning to
  guide the model outputs according to a value, function, or human feedback. The model can be used for text generation.
 ## Usage

 This is a [TRL language model](https://github.com/huggingface/trl) that has been fine-tuned with reinforcement learning to
  guide the model outputs according to a value, function, or human feedback. The model can be used for text generation.
+ This was used as a test model in the reward interpretability study at https://arxiv.org/abs/2310.08164.
 ## Usage