Update README.md
Browse files
README.md
CHANGED
|
@@ -13,7 +13,9 @@ library_name: fairseq
|
|
| 13 |
|
| 14 |
## Model description
|
| 15 |
|
| 16 |
-
This model was trained from scratch using the Fairseq toolkit on a combination of datasets comprising both Catalan-Italian data sourced from Opus,
|
|
|
|
|
|
|
| 17 |
|
| 18 |
## Intended uses and limitations
|
| 19 |
|
|
|
|
| 13 |
|
| 14 |
## Model description
|
| 15 |
|
| 16 |
+
This model was trained from scratch using the Fairseq toolkit on a combination of datasets comprising both Catalan-Italian data sourced from Opus,
|
| 17 |
+
and additional datasets where synthetic Catalan was generated from the Spanish side of Spanish-Italian corpora using [Projecte Aina’s Spanish-Catalan model](https://huggingface.co/projecte-aina/aina-translator-es-ca).
|
| 18 |
+
This gave a total of approximately 100 million sentence pairs. The model is evaluated on the Flores, NTEU and NTREX evaluation sets.
|
| 19 |
|
| 20 |
## Intended uses and limitations
|
| 21 |
|