Xu Yifan's picture

3 6

Xu Yifan

xuyifan

·

AI & ML interests

None yet

Recent Activity

authored a paper 8 days ago

GOAL: A Challenging Knowledge-grounded Video Captioning Benchmark for Real-time Soccer Commentary Generation

authored a paper 8 days ago

AlignBench: Benchmarking Chinese Alignment of Large Language Models

authored a paper 8 days ago

WebGLM: Towards An Efficient Web-Enhanced Question Answering System with Human Preferences

View all activity

Organizations

None yet

authored 12 papers 8 days ago

GOAL: A Challenging Knowledge-grounded Video Captioning Benchmark for Real-time Soccer Commentary Generation

Paper • 2303.14655 • Published Mar 26, 2023

AlignBench: Benchmarking Chinese Alignment of Large Language Models

Paper • 2311.18743 • Published Nov 30, 2023 • 1

WebGLM: Towards An Efficient Web-Enhanced Question Answering System with Human Preferences

Paper • 2306.07906 • Published Jun 13, 2023 • 13

AgentBench: Evaluating LLMs as Agents

Paper • 2308.03688 • Published Aug 7, 2023 • 25

ChatGLM-Math: Improving Math Problem-Solving in Large Language Models with a Self-Critique Pipeline

Paper • 2404.02893 • Published Apr 3, 2024 • 22

GLM-130B: An Open Bilingual Pre-trained Model

Paper • 2210.02414 • Published Oct 5, 2022 • 3

ChatGLM: A Family of Large Language Models from GLM-130B to GLM-4 All Tools

Paper • 2406.12793 • Published Jun 18, 2024 • 33

VisualAgentBench: Towards Large Multimodal Models as Visual Foundation Agents

Paper • 2408.06327 • Published Aug 12, 2024 • 17

AutoGLM: Autonomous Foundation Agents for GUIs

Paper • 2411.00820 • Published Oct 28, 2024 • 2

AndroidLab: Training and Systematic Benchmarking of Android Autonomous Agents

Paper • 2410.24024 • Published Oct 31, 2024 • 49

AndroidGen: Building an Android Language Agent under Data Scarcity

Paper • 2504.19298 • Published Apr 27

AgentRL: Scaling Agentic Reinforcement Learning with a Multi-Turn, Multi-Task Framework

Paper • 2510.04206 • Published 29 days ago • 2