| license: odc-by | |
| language: | |
| - ar | |
| - cy | |
| - de | |
| - en | |
| - es | |
| - fr | |
| - id | |
| - it | |
| - ru | |
| - sw | |
| This is a raw, pretrained multilingual language model, supporting Arabic, Welsh, German, English, Spanish, French, Indonesian, Italian, Russian, and Swahili. | |
| The model is pretrained from scratch, which should be further finetuned for most use cases. | |
| For more details: | |
| [Multilingual Language Model Pretraining using Machine-translated Data](https://arxiv.org/abs/2502.13252) | |
| **Contact** | |
| Email: [[email protected]](mailto:[email protected]) |