Update README.md
Browse files
README.md
CHANGED
|
@@ -64,6 +64,7 @@ We used the following hyperparameters:
|
|
| 64 |
- learning rate: 5e-7
|
| 65 |
- batch size: 128
|
| 66 |
- beta: 0.01
|
|
|
|
| 67 |
The other hyperparameters are kept the same with our [SimPO recipe](https://github.com/princeton-nlp/SimPO/blob/main/training_configs/gemma-2-9b-it-simpo.yaml).
|
| 68 |
|
| 69 |
#### Speeds, Sizes, Times
|
|
|
|
| 64 |
- learning rate: 5e-7
|
| 65 |
- batch size: 128
|
| 66 |
- beta: 0.01
|
| 67 |
+
|
| 68 |
The other hyperparameters are kept the same with our [SimPO recipe](https://github.com/princeton-nlp/SimPO/blob/main/training_configs/gemma-2-9b-it-simpo.yaml).
|
| 69 |
|
| 70 |
#### Speeds, Sizes, Times
|