Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
FlagEval
non-profit
https://flageval.baai.ac.cn/
Activity Feed
Follow
18
AI & ML interests
None defined yet.
Team members
11
spaces
2
Sort: Recently updated
Running
6
FlagEval-Arena
🐢
Arena
Running
12
FlagEval-Debate
🐠
Display a debate interface
models
1
FlagEval/flageval_judgemodel
Text Generation
•
33B
•
Updated
Dec 30, 2024
•
9
•
1
datasets
13
Sort: Recently updated
FlagEval/ERQAPlus
Viewer
•
Updated
Nov 27, 2025
•
800
•
29
•
1
FlagEval/coco_val2014_sampled
Viewer
•
Updated
Nov 6, 2025
•
1k
•
24
FlagEval/MeasureBench
Viewer
•
Updated
Nov 3, 2025
•
2.44k
•
229
•
1
FlagEval/EmbodiedVerse-Bench
Viewer
•
Updated
Jun 25, 2025
•
2.04k
•
522
FlagEval/Where2Place
Viewer
•
Updated
May 29, 2025
•
100
•
572
FlagEval/SAT
Viewer
•
Updated
May 6, 2025
•
150
•
205
FlagEval/HMMT_2025
Viewer
•
Updated
May 6, 2025
•
30
•
545
•
1
FlagEval/ERQA
Viewer
•
Updated
Apr 22, 2025
•
400
•
1.8k
•
4
FlagEval/sub_spatial
Viewer
•
Updated
Apr 21, 2025
•
690
•
459
FlagEval/EmbSpatial-Bench
Viewer
•
Updated
Apr 21, 2025
•
3.64k
•
442
•
3
View 13 datasets