Update README.md
Browse files
README.md
CHANGED
|
@@ -52,7 +52,7 @@ The optimizer used is AdaFactor with inverse square root learning rate schedule
|
|
| 52 |
|
| 53 |
### Fine-tuning
|
| 54 |
|
| 55 |
-
This model was then fine-tuned on a single TPU Pod
|
| 56 |
|
| 57 |
|
| 58 |
## Evaluation results
|
|
|
|
| 52 |
|
| 53 |
### Fine-tuning
|
| 54 |
|
| 55 |
+
This model was then fine-tuned on a single TPU Pod V3-8 for 80,000 steps in total, using sequence length 512 (batch size 256), using only the dataset only containing java code.
|
| 56 |
|
| 57 |
|
| 58 |
## Evaluation results
|