Spaces:

mteb
/

README

Running

File size: 3,931 Bytes

---
title: README
emoji: 🌍
colorFrom: purple
colorTo: red
sdk: static
pinned: false
---

MTEB is a Python framework for evaluating embeddings and retrieval systems for both text and image. 
MTEB covers more than 1000 languages and diverse tasks, from classics like classification and clustering to use-case specialized tasks such as legal, code, or healthcare retrieval.  

You can get started using [`mteb`](https://github.com/embeddings-benchmark/mteb), check out our [documentation](https://embeddings-benchmark.github.io/mteb/usage/get_started/).

| Overview                       |                                                                                      |
|--------------------------------|--------------------------------------------------------------------------------------|
| 📈 [Leaderboard]               | The interactive leaderboard of the benchmark                                         |
| **Get Started**.               |                                                                                      |
| 🏃 [Get Started]               | Overview of how to use mteb                                                          |
| 🤖 [Defining Models]           | How to use existing model and define custom ones                                     |
| 📋 [Selecting tasks]           | How to select tasks, benchmarks, splits etc.                                         |
| 🏭 [Running Evaluation]        | How to run the evaluations, including cache management, speeding up evaluations etc. |
| 📊 [Loading Results]           | How to load and work with existing model results                                     |
| **Overview**.                  |                                                                                      |
| 📋 [Tasks]                     | Overview of available tasks                                                          |
| 📐 [Benchmarks]                | Overview of available benchmarks                                                     |
| 🤖 [Models]                    | Overview of available Models                                                         |
| **Contributing**               |                                                                                      |
| 🤖 [Adding a model]            | How to submit a model to MTEB and to the leaderboard                                 |
| 👩‍💻 [Adding a dataset]          | How to add a new task/dataset to MTEB                                                |
| 👩‍💻 [Adding a benchmark]        | How to add a new benchmark to MTEB and to the leaderboard                            |
| 🤝 [Contributing]              | How to contribute to MTEB and set it up for development                              |

[Get Started]: https://embeddings-benchmark.github.io/mteb/usage/get_started/
[Defining Models]: https://embeddings-benchmark.github.io/mteb/usage/defining_the_model/
[Selecting tasks]: https://embeddings-benchmark.github.io/mteb/usage/selecting_tasks/
[Running Evaluation]: https://embeddings-benchmark.github.io/mteb/usage/running_the_evaluation/
[Loading Results]: https://embeddings-benchmark.github.io/mteb/usage/loading_results/
[Tasks]: https://embeddings-benchmark.github.io/mteb/overview/available_tasks/any2anymultilingualretrieval/
[Benchmarks]: https://embeddings-benchmark.github.io/mteb/overview/available_benchmarks/
[Models]: https://embeddings-benchmark.github.io/mteb/overview/available_models/text/
[Contributing]: https://embeddings-benchmark.github.io/mteb/CONTRIBUTING/
[Adding a model]: https://embeddings-benchmark.github.io/mteb/contributing/adding_a_model/
[Adding a dataset]: https://embeddings-benchmark.github.io/mteb/contributing/adding_a_dataset/
[Adding a benchmark]: https://embeddings-benchmark.github.io/mteb/contributing/adding_a_benchmark/
[Leaderboard]: https://huggingface.co/spaces/mteb/leaderboard