End of training

Files changed (13) hide show

README.md CHANGED Viewed

@@ -14,8 +14,6 @@ should probably proofread and complete it, then remove this comment. -->
 # t5base-fine-tuned
 This model is a fine-tuned version of [t5-base](https://huggingface.co/t5-base) on the None dataset.
-It achieves the following results on the evaluation set:
-- Loss: 0.8941
 ## Model description
@@ -35,8 +33,8 @@ More information needed
 The following hyperparameters were used during training:
 - learning_rate: 2e-05
-- train_batch_size: 6
-- eval_batch_size: 6
 - seed: 42
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
@@ -45,15 +43,11 @@ The following hyperparameters were used during training:
 ### Training results
-| Training Loss | Epoch | Step  | Validation Loss |
-|:-------------:|:-----:|:-----:|:---------------:|
-| 0.7534        | 0.86  | 10000 | 0.6521          |
-| 0.9127        | 1.72  | 20000 | 0.8941          |
 ### Framework versions
-- Transformers 4.37.0
 - Pytorch 2.1.2
 - Datasets 2.1.0
-- Tokenizers 0.15.1

 # t5base-fine-tuned
 This model is a fine-tuned version of [t5-base](https://huggingface.co/t5-base) on the None dataset.
 ## Model description
 The following hyperparameters were used during training:
 - learning_rate: 2e-05
+- train_batch_size: 2
+- eval_batch_size: 2
 - seed: 42
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
 ### Training results
 ### Framework versions
+- Transformers 4.38.2
 - Pytorch 2.1.2
 - Datasets 2.1.0
+- Tokenizers 0.15.2

config.json CHANGED Viewed

@@ -55,7 +55,7 @@
     }
   },
   "torch_dtype": "float32",
-  "transformers_version": "4.37.0",
   "use_cache": true,
   "vocab_size": 32128
 }

     }
   },
   "torch_dtype": "float32",
+  "transformers_version": "4.38.2",
   "use_cache": true,
   "vocab_size": 32128
 }

generation_config.json CHANGED Viewed

@@ -3,5 +3,5 @@
   "decoder_start_token_id": 0,
   "eos_token_id": 1,
   "pad_token_id": 0,
-  "transformers_version": "4.37.0"
 }

   "decoder_start_token_id": 0,
   "eos_token_id": 1,
   "pad_token_id": 0,
+  "transformers_version": "4.38.2"
 }

model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:21c743a9551de0d8498e60c57de9217b98e3a45f7af2c2ae696377d6616110e3
 size 891644712

 version https://git-lfs.github.com/spec/v1
+oid sha256:fabb244f1f9bb3bc93cc6d45cfaba9c7fae9d1bb5f81d4400ce25c80a744c213
 size 891644712

runs/Apr05_20-39-18_00b1e96a9200/events.out.tfevents.1712349562.00b1e96a9200.34.0 ADDED Viewed

+version https://git-lfs.github.com/spec/v1
+oid sha256:1b6805d17455faeafac3275399850f1980c8f59d33e225f0263d6093d51c817e
+size 5570

runs/Apr05_20-48-50_00b1e96a9200/events.out.tfevents.1712350138.00b1e96a9200.34.1 ADDED Viewed

+version https://git-lfs.github.com/spec/v1
+oid sha256:0bef8cb27017dcd3fdd3bfe0dd2b88a3ec147c018f1578331b53cf2fab4b304d
+size 16519

runs/Apr05_20-48-50_00b1e96a9200/events.out.tfevents.1712350499.00b1e96a9200.34.2 ADDED Viewed

+version https://git-lfs.github.com/spec/v1
+oid sha256:466f7322e7e11b89f8f37f085d1ae5d6ac65bc13a4af721170cd6dab9d15ff41
+size 4184

runs/Apr05_20-48-50_00b1e96a9200/events.out.tfevents.1712350534.00b1e96a9200.34.3 ADDED Viewed

+version https://git-lfs.github.com/spec/v1
+oid sha256:b92900b9f33568cfce725683d35450c998055d030c4c14b885da6e5393d6278e
+size 4184

runs/Apr05_20-56-09_00b1e96a9200/events.out.tfevents.1712350571.00b1e96a9200.261.0 ADDED Viewed

+version https://git-lfs.github.com/spec/v1
+oid sha256:2ebe3fe8a1d8a32ee83b74339e02423c5ace55a519ba5f2107a19e1e471ae45b
+size 5565

runs/Apr05_21-14-14_00b1e96a9200/events.out.tfevents.1712351659.00b1e96a9200.261.1 ADDED Viewed

+version https://git-lfs.github.com/spec/v1
+oid sha256:61e48f2c9366bcd2663c94dc9b24f90b00068941d96a07705dbbb6e18f357076
+size 5919

tokenizer.json CHANGED Viewed

@@ -2,13 +2,13 @@
   "version": "1.0",
   "truncation": {
     "direction": "Right",
-    "max_length": 512,
     "strategy": "LongestFirst",
     "stride": 0
   },
   "padding": {
     "strategy": {
-      "Fixed": 512
     },
     "direction": "Right",
     "pad_to_multiple_of": null,

   "version": "1.0",
   "truncation": {
     "direction": "Right",
+    "max_length": 128,
     "strategy": "LongestFirst",
     "stride": 0
   },
   "padding": {
     "strategy": {
+      "Fixed": 128
     },
     "direction": "Right",
     "pad_to_multiple_of": null,

tokenizer_config.json CHANGED Viewed

@@ -930,7 +930,7 @@
   "clean_up_tokenization_spaces": true,
   "eos_token": "</s>",
   "extra_ids": 100,
-  "model_max_length": 512,
   "pad_token": "<pad>",
   "tokenizer_class": "T5Tokenizer",
   "unk_token": "<unk>"

   "clean_up_tokenization_spaces": true,
   "eos_token": "</s>",
   "extra_ids": 100,
+  "model_max_length": 128,
   "pad_token": "<pad>",
   "tokenizer_class": "T5Tokenizer",
   "unk_token": "<unk>"

training_args.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:e86fa45c23b1f841a99f409b1f0b7cdc6488be4fa7a84ff40fa0775ce20e33bb
-size 4856

 version https://git-lfs.github.com/spec/v1
+oid sha256:f3984ce4a61f9e260566ba58eeabc858d1197f2762f4822ebaf1832b44ff65dd
+size 5048