Update README.md
Browse files
README.md
CHANGED
|
@@ -47,14 +47,14 @@ This is the model card for EuroVLM-9B-Preview, a multimodal vision-language mode
|
|
| 47 |
|
| 48 |
- **Developed by:** Unbabel, Instituto Superior Técnico, Instituto de Telecomunicações, University of Edinburgh, Aveni, University of Paris-Saclay, University of Amsterdam, Naver Labs, Sorbonne Université.
|
| 49 |
- **Funded by:** European Union.
|
| 50 |
-
- **Model type:** A 9B parameter multilingual multimodal transformer VLM (Vision-Language Model).
|
| 51 |
- **Language(s) (NLP):** Bulgarian, Croatian, Czech, Danish, Dutch, English, Estonian, Finnish, French, German, Greek, Hungarian, Irish, Italian, Latvian, Lithuanian, Maltese, Polish, Portuguese, Romanian, Slovak, Slovenian, Spanish, Swedish, Arabic, Catalan, Chinese, Galician, Hindi, Japanese, Korean, Norwegian, Russian, Turkish, and Ukrainian.
|
| 52 |
- **Modalities:** Text and Vision (images).
|
| 53 |
- **License:** Apache License 2.0.
|
| 54 |
|
| 55 |
## Model Details
|
| 56 |
|
| 57 |
-
EuroVLM-9B is a 9B parameter vision-language model that combines the multilingual capabilities of EuroLLM-9B with vision encoding components.
|
| 58 |
|
| 59 |
EuroVLM-9B was (visually) instruction tuned on a combination of multilingual vision-language datasets, including image captioning, visual question answering, and multimodal reasoning tasks across the supported languages.
|
| 60 |
|
|
|
|
| 47 |
|
| 48 |
- **Developed by:** Unbabel, Instituto Superior Técnico, Instituto de Telecomunicações, University of Edinburgh, Aveni, University of Paris-Saclay, University of Amsterdam, Naver Labs, Sorbonne Université.
|
| 49 |
- **Funded by:** European Union.
|
| 50 |
+
- **Model type:** A 9B+400M parameter multilingual multimodal transformer VLM (Vision-Language Model).
|
| 51 |
- **Language(s) (NLP):** Bulgarian, Croatian, Czech, Danish, Dutch, English, Estonian, Finnish, French, German, Greek, Hungarian, Irish, Italian, Latvian, Lithuanian, Maltese, Polish, Portuguese, Romanian, Slovak, Slovenian, Spanish, Swedish, Arabic, Catalan, Chinese, Galician, Hindi, Japanese, Korean, Norwegian, Russian, Turkish, and Ukrainian.
|
| 52 |
- **Modalities:** Text and Vision (images).
|
| 53 |
- **License:** Apache License 2.0.
|
| 54 |
|
| 55 |
## Model Details
|
| 56 |
|
| 57 |
+
EuroVLM-9B is a 9B+400M parameter vision-language model that combines the multilingual capabilities of EuroLLM-9B with vision encoding components.
|
| 58 |
|
| 59 |
EuroVLM-9B was (visually) instruction tuned on a combination of multilingual vision-language datasets, including image captioning, visual question answering, and multimodal reasoning tasks across the supported languages.
|
| 60 |
|