Ali El Filali's picture

Ali El Filali PRO

alielfilali01

·

AI & ML interests

AI Psychometrician ? | NLP (mainly for Arabic) | Interests include Reinforcement Learning and Cognitive sciences among others

Recent Activity

updated a dataset 3 days ago

OALL/requests_v2

upvoted a collection 17 days ago

upvoted a collection 17 days ago

Rated Games Dataset

View all activity

Organizations

upvoted 2 collections 17 days ago

Apertus LLM

Democratizing Open and Compliant LLMs for Global Language Environments: 8B and 70B open-data open-weights models, multilingual in >1000 languages • 4 items • Updated 25 days ago • 292

Rated Games Dataset

Datasets where each row is a rated chess game • 10 items • Updated Jul 10 • 8

upvoted an article 20 days ago

Article

Introducing RTEB: A New Standard for Retrieval Evaluation

26 days ago

• 115

upvoted a paper 27 days ago

CulturalBench: a Robust, Diverse and Challenging Benchmark on Measuring the (Lack of) Cultural Knowledge of LLMs

Paper • 2410.02677 • Published Oct 3, 2024 • 1

upvoted 2 articles about 1 month ago

Article

Smol2Operator: Post-Training GUI Agents for Computer Use

Sep 23

• 121

Article

Gaia2 and ARE: Empowering the community to study agents

Sep 22

• 115

upvoted an article about 2 months ago

Article

How to Choose the Best Open Source LLM for Your Project in 2025

By

•

Sep 9

• 72

upvoted a collection about 2 months ago

ITA-Bench: Italian Benchmarks for LLMs

A collection of Italian benchmarks for Large Language Models. See also our Github repo: https://github.com/SapienzaNLP/ita-bench • 19 items • Updated Dec 4, 2024 • 7

upvoted an article about 2 months ago

Article

NVIDIA Releases 6 Million Multi-Lingual Reasoning Dataset

By

and 4 others •

Aug 20

• 18

upvoted a collection 3 months ago

IFBench

Datasets for IFBench benchmark and paper! • 3 items • Updated Jul 3 • 7

upvoted an article 3 months ago

Article

Welcome GPT OSS, the new open-source model family from OpenAI!

Aug 5

• 502

upvoted a collection 3 months ago

gpt-oss

Open-weight models designed for powerful reasoning, agentic tasks, and versatile developer use cases. • 2 items • Updated Aug 7 • 365

upvoted an article 3 months ago

Article

Introducing Trackio: A Lightweight Experiment Tracking Library from Hugging Face

Jul 29

• 189

upvoted a paper 3 months ago

Falcon-H1: A Family of Hybrid-Head Language Models Redefining Efficiency and Performance

Paper • 2507.22448 • Published Jul 30 • 65

upvoted 6 articles 3 months ago

Article

Say hello to `hf`: a faster, friendlier Hugging Face CLI ✨

Jul 25

• 83

Article

CinePile 2.0 - making stronger datasets with adversarial refinement

Oct 23, 2024

• 18

Article

TimeScope: How Long Can Your Video Large Multimodal Model Go?

Jul 23

• 46

Article

Back to The Future: Evaluating AI Agents on Predicting Future Events

Jul 17

• 44

Article

Building the Hugging Face MCP Server

Jul 10

• 66

Article

ScreenEnv: Deploy your full stack Desktop Agent

Jul 10

• 72