Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

FlagEval

non-profit
https://flageval.baai.ac.cn/
Activity Feed

AI & ML interests

None defined yet.

Recent Activity

xuanricheng  authored a paper about 1 month ago
FlagEval Findings Report: A Preliminary Evaluation of Large Reasoning Models on Automatically Verifiable Textual and Visual Questions
philokey  authored a paper about 1 month ago
CMMU: A Benchmark for Chinese Multi-modal Multi-type Question Understanding and Reasoning
philokey  authored a paper about 1 month ago
Video-SafetyBench: A Benchmark for Safety Evaluation of Video LVLMs
View all activity

Richeng Xuan's profile picture Xuannan Liu 's profile picture llvvvv's profile picture Sherlock's profile picture Gray 's profile picture makarov's profile picture Zheqi He's profile picture jingshu's profile picture daiteng01's profile picture lixuejing's profile picture HelloGitHub's profile picture

FlagEval 's datasets 11

FlagEval/EmbodiedVerse-Bench

Viewer • Updated Jun 25 • 2.04k • 102

FlagEval/Where2Place

Viewer • Updated May 29 • 100 • 89

FlagEval/SAT

Viewer • Updated May 6 • 150 • 37

FlagEval/HMMT_2025

Viewer • Updated May 6 • 30 • 36

FlagEval/ERQA

Viewer • Updated Apr 22 • 400 • 341 • 2

FlagEval/sub_spatial

Viewer • Updated Apr 21 • 690 • 5

FlagEval/EmbSpatial-Bench

Viewer • Updated Apr 21 • 3.64k • 102 • 2

FlagEval/coco_val2014_sampled

Viewer • Updated Nov 21, 2024 • 1k • 15

FlagEval/documentation-images

Viewer • Updated Nov 13, 2024 • 3 • 158

FlagEval/CLCC_v1

Viewer • Updated Jul 29, 2024 • 760 • 18 • 3

FlagEval/HalluDial

Updated Jun 26, 2024 • 23 • 3
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs