Add evaluation results on scientific_papers dataset
Browse filesBeep boop, I am a bot from Hugging Face's automatic model evaluator 👋!\
Your model has been evaluated on the [scientific_papers](https://huggingface.co/datasets/scientific_papers) dataset by 
@lewtun
	, using the predictions stored [here](https://huggingface.co/datasets/autoevaluate/autoeval-staging-eval-project-d47ba8c2-7654949).\
Accept this pull request to see the results displayed on the [Hub leaderboard](https://huggingface.co/spaces/autoevaluate/leaderboards?dataset=scientific_papers).\
Evaluate your model on more datasets [here](https://huggingface.co/spaces/autoevaluate/model-evaluator?dataset=scientific_papers).
    	
        README.md
    CHANGED
    
    | @@ -45,6 +45,39 @@ model-index: | |
| 45 | 
             
                  type: gen_len
         | 
| 46 | 
             
                  value: 221.2531
         | 
| 47 | 
             
                  verified: true
         | 
|  | |
|  | |
|  | |
|  | |
|  | |
|  | |
|  | |
|  | |
|  | |
|  | |
|  | |
|  | |
|  | |
|  | |
|  | |
|  | |
|  | |
|  | |
|  | |
|  | |
|  | |
|  | |
|  | |
|  | |
|  | |
|  | |
|  | |
|  | |
|  | |
|  | |
|  | |
|  | |
|  | |
| 48 | 
             
            ---
         | 
| 49 |  | 
| 50 | 
             
            # BigBirdPegasus model (large)
         | 
|  | |
| 45 | 
             
                  type: gen_len
         | 
| 46 | 
             
                  value: 221.2531
         | 
| 47 | 
             
                  verified: true
         | 
| 48 | 
            +
              - task:
         | 
| 49 | 
            +
                  type: summarization
         | 
| 50 | 
            +
                  name: Summarization
         | 
| 51 | 
            +
                dataset:
         | 
| 52 | 
            +
                  name: scientific_papers
         | 
| 53 | 
            +
                  type: scientific_papers
         | 
| 54 | 
            +
                  config: arxiv
         | 
| 55 | 
            +
                  split: test
         | 
| 56 | 
            +
                metrics:
         | 
| 57 | 
            +
                - name: ROUGE-1
         | 
| 58 | 
            +
                  type: rouge
         | 
| 59 | 
            +
                  value: 40.3815
         | 
| 60 | 
            +
                  verified: true
         | 
| 61 | 
            +
                - name: ROUGE-2
         | 
| 62 | 
            +
                  type: rouge
         | 
| 63 | 
            +
                  value: 14.374
         | 
| 64 | 
            +
                  verified: true
         | 
| 65 | 
            +
                - name: ROUGE-L
         | 
| 66 | 
            +
                  type: rouge
         | 
| 67 | 
            +
                  value: 23.4773
         | 
| 68 | 
            +
                  verified: true
         | 
| 69 | 
            +
                - name: ROUGE-LSUM
         | 
| 70 | 
            +
                  type: rouge
         | 
| 71 | 
            +
                  value: 33.772
         | 
| 72 | 
            +
                  verified: true
         | 
| 73 | 
            +
                - name: loss
         | 
| 74 | 
            +
                  type: loss
         | 
| 75 | 
            +
                  value: 3.235051393508911
         | 
| 76 | 
            +
                  verified: true
         | 
| 77 | 
            +
                - name: gen_len
         | 
| 78 | 
            +
                  type: gen_len
         | 
| 79 | 
            +
                  value: 186.2003
         | 
| 80 | 
            +
                  verified: true
         | 
| 81 | 
             
            ---
         | 
| 82 |  | 
| 83 | 
             
            # BigBirdPegasus model (large)
         | 

