pszemraj
/

pythia-6.9b-HC3

Text Generation

Generated from Trainer

text-generation-inference

Model card Files Files and versions

pszemraj commited on Feb 12, 2023

Commit

f29252c

·

1 Parent(s): b83d551

Update README.md

Files changed (1) hide show

README.md +1 -1

README.md CHANGED Viewed

@@ -64,7 +64,7 @@ The defautl `GenerationConfig` uses contrastive search with `top_k=4` and `penal
 ## Intended uses & limitations
 - **Intended use:** research/exploration into comparing RLHF tuning vs. "guided"/specific tuning on "quality" datasets/responses of _"what the human would want as answer anyway"_
-- This is **not** trained/fine-tuned with RLHF and therefore will not be as helpful/generalizable/safe as chatGPT.
 ## Training and evaluation data

 ## Intended uses & limitations
 - **Intended use:** research/exploration into comparing RLHF tuning vs. "guided"/specific tuning on "quality" datasets/responses of _"what the human would want as answer anyway"_
+- This is **not** trained/fine-tuned with RLHF and therefore will not be as helpful/generalizable/safe as chatGPT (_outside of the fact that this model is ~30x smaller_)
 ## Training and evaluation data