Update README.md
Browse files
README.md
CHANGED
|
@@ -64,7 +64,7 @@ The defautl `GenerationConfig` uses contrastive search with `top_k=4` and `penal
|
|
| 64 |
## Intended uses & limitations
|
| 65 |
|
| 66 |
- **Intended use:** research/exploration into comparing RLHF tuning vs. "guided"/specific tuning on "quality" datasets/responses of _"what the human would want as answer anyway"_
|
| 67 |
-
- This is **not** trained/fine-tuned with RLHF and therefore will not be as helpful/generalizable/safe as chatGPT
|
| 68 |
|
| 69 |
## Training and evaluation data
|
| 70 |
|
|
|
|
| 64 |
## Intended uses & limitations
|
| 65 |
|
| 66 |
- **Intended use:** research/exploration into comparing RLHF tuning vs. "guided"/specific tuning on "quality" datasets/responses of _"what the human would want as answer anyway"_
|
| 67 |
+
- This is **not** trained/fine-tuned with RLHF and therefore will not be as helpful/generalizable/safe as chatGPT (_outside of the fact that this model is ~30x smaller_)
|
| 68 |
|
| 69 |
## Training and evaluation data
|
| 70 |
|