Add evaluation results on the samsum config of samsum
Browse filesBeep boop, I am a bot from Hugging Face's automatic model evaluator 👋!\
Your model has been evaluated on the samsum config of the [samsum](https://huggingface.co/datasets/samsum) dataset by
@hgoyal194
, using the predictions stored [here](https://huggingface.co/datasets/autoevaluate/autoeval-eval-project-samsum-61336320-1319050351).\
Accept this pull request to see the results displayed on the [Hub leaderboard](https://huggingface.co/spaces/autoevaluate/leaderboards?dataset=samsum).\
Evaluate your model on more datasets [here](https://huggingface.co/spaces/autoevaluate/model-evaluator?dataset=samsum).
README.md
CHANGED
|
@@ -106,6 +106,39 @@ model-index:
|
|
| 106 |
type: gen_len
|
| 107 |
value: 23.1206
|
| 108 |
verified: true
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 109 |
---
|
| 110 |
### Bart model finetuned on xsum
|
| 111 |
|
|
|
|
| 106 |
type: gen_len
|
| 107 |
value: 23.1206
|
| 108 |
verified: true
|
| 109 |
+
- task:
|
| 110 |
+
type: summarization
|
| 111 |
+
name: Summarization
|
| 112 |
+
dataset:
|
| 113 |
+
name: samsum
|
| 114 |
+
type: samsum
|
| 115 |
+
config: samsum
|
| 116 |
+
split: test
|
| 117 |
+
metrics:
|
| 118 |
+
- name: ROUGE-1
|
| 119 |
+
type: rouge
|
| 120 |
+
value: 24.9158
|
| 121 |
+
verified: true
|
| 122 |
+
- name: ROUGE-2
|
| 123 |
+
type: rouge
|
| 124 |
+
value: 5.5837
|
| 125 |
+
verified: true
|
| 126 |
+
- name: ROUGE-L
|
| 127 |
+
type: rouge
|
| 128 |
+
value: 18.8935
|
| 129 |
+
verified: true
|
| 130 |
+
- name: ROUGE-LSUM
|
| 131 |
+
type: rouge
|
| 132 |
+
value: 20.76
|
| 133 |
+
verified: true
|
| 134 |
+
- name: loss
|
| 135 |
+
type: loss
|
| 136 |
+
value: 3.775235891342163
|
| 137 |
+
verified: true
|
| 138 |
+
- name: gen_len
|
| 139 |
+
type: gen_len
|
| 140 |
+
value: 23.0928
|
| 141 |
+
verified: true
|
| 142 |
---
|
| 143 |
### Bart model finetuned on xsum
|
| 144 |
|