Spaces:

evaluate-metric
/

bleu

Running

App Files Files Community

bleu

by awais126 - opened Aug 17, 2022

base: refs/heads/main

←

from: refs/pr/2

Discussion Files changed

+14

-14

This PR is in draft mode

Files changed (2) hide show

README.md +13 -13
requirements.txt +1 -1

README.md CHANGED Viewed

@@ -4,7 +4,7 @@ emoji: 🤗
 colorFrom: blue
 colorTo: red
 sdk: gradio
-sdk_version: 3.19.1
 app_file: app.py
 pinned: false
 tags:
@@ -48,9 +48,9 @@ This metric takes as input a list of predicted sentences and a list of lists of
 ```
 ### Inputs
-- **predictions** (`list[str]`): Translations to score.
-- **references** (`Union[list[str], list[list[str]]]`): references for each translation.
-- **tokenizer** : approach used for standardizing `predictions` and `references`.
     The default tokenizer is `tokenizer_13a`, a relatively minimal tokenization approach that is however equivalent to `mteval-v13a`, used by WMT.
     This can be replaced by another tokenizer from a source such as [SacreBLEU](https://github.com/mjpost/sacrebleu/tree/master/sacrebleu/tokenizers).
@@ -93,15 +93,15 @@ Example where each prediction has 1 reference:
 {'bleu': 1.0, 'precisions': [1.0, 1.0, 1.0, 1.0], 'brevity_penalty': 1.0, 'length_ratio': 1.0, 'translation_length': 7, 'reference_length': 7}
 ```
-Example where the first prediction has 2 references:
 ```python
 >>> predictions = [
-...     "hello there general kenobi",
-...     "foo bar foobar"
 ... ]
 >>> references = [
-...     ["hello there general kenobi", "hello there!"],
-...     ["foo bar foobar"]
 ... ]
 >>> bleu = evaluate.load("bleu")
 >>> results = bleu.compute(predictions=predictions, references=references)
@@ -114,12 +114,12 @@ Example with the word tokenizer from NLTK:
 >>> bleu = evaluate.load("bleu")
 >>> from nltk.tokenize import word_tokenize
 >>> predictions = [
-...     "hello there general kenobi",
-...     "foo bar foobar"
 ... ]
 >>> references = [
-...     ["hello there general kenobi", "hello there!"],
-...     ["foo bar foobar"]
 ... ]
 >>> results = bleu.compute(predictions=predictions, references=references, tokenizer=word_tokenize)
 >>> print(results)

 colorFrom: blue
 colorTo: red
 sdk: gradio
+sdk_version: 3.0.2
 app_file: app.py
 pinned: false
 tags:
 ```
 ### Inputs
+- **predictions** (`list` of `str`s): Translations to score.
+- **references** (`list` of `list`s of `str`s): references for each translation.
+- ** tokenizer** : approach used for standardizing `predictions` and `references`.
     The default tokenizer is `tokenizer_13a`, a relatively minimal tokenization approach that is however equivalent to `mteval-v13a`, used by WMT.
     This can be replaced by another tokenizer from a source such as [SacreBLEU](https://github.com/mjpost/sacrebleu/tree/master/sacrebleu/tokenizers).
 {'bleu': 1.0, 'precisions': [1.0, 1.0, 1.0, 1.0], 'brevity_penalty': 1.0, 'length_ratio': 1.0, 'translation_length': 7, 'reference_length': 7}
 ```
+Example where the second prediction has 2 references:
 ```python
 >>> predictions = [
+...     ["hello there general kenobi",
+...     ["foo bar foobar"]
 ... ]
 >>> references = [
+...     [["hello there general kenobi"], ["hello there!"]],
+...     [["foo bar foobar"]]
 ... ]
 >>> bleu = evaluate.load("bleu")
 >>> results = bleu.compute(predictions=predictions, references=references)
 >>> bleu = evaluate.load("bleu")
 >>> from nltk.tokenize import word_tokenize
 >>> predictions = [
+...     ["hello there general kenobi",
+...     ["foo bar foobar"]
 ... ]
 >>> references = [
+...     [["hello there general kenobi"], ["hello there!"]],
+...     [["foo bar foobar"]]
 ... ]
 >>> results = bleu.compute(predictions=predictions, references=references, tokenizer=word_tokenize)
 >>> print(results)

requirements.txt CHANGED Viewed

	@@ -1 +1 @@
1	- git+https://github.com/huggingface/evaluate@~~7c4656a407213b71cb7e6f6634b7935c18f5140d~~


1	+ git+https://github.com/huggingface/evaluate@4487d9d1e65216a36b4aa94e3396a570f44a1525