3 6 1

Fangchen Yu

SciYu

https://sciyu.github.io/

SciYu

AI & ML interests

None yet

Recent Activity

authored a paper 3 days ago

Probing Scientific General Intelligence of LLMs with Scientist-Aligned Workflows

upvoted a paper 3 days ago

Probing Scientific General Intelligence of LLMs with Scientist-Aligned Workflows

upvoted a collection 3 days ago

SGI-Bench

View all activity

Organizations

None yet

authored a paper 3 days ago

Probing Scientific General Intelligence of LLMs with Scientist-Aligned Workflows

Paper • 2512.16969 • Published 7 days ago • 101

upvoted a paper 3 days ago

Probing Scientific General Intelligence of LLMs with Scientist-Aligned Workflows

Paper • 2512.16969 • Published 7 days ago • 101

upvoted a collection 3 days ago

SGI-Bench

Collection

Probing Scientific General Intelligence of LLMs with Scientist-Aligned Workflows • 9 items • Updated 1 day ago • 29

New activity in SciYu/HiPhO 6 days ago

Add Quick Start / Sample Usage section

#3 opened 3 months ago by

nielsr

updated a dataset 16 days ago

SciYu/HiPhO

Viewer • Updated 16 days ago • 138 • 1.16k • 5

upvoted 2 papers about 1 month ago

Reasoning via Video: The First Evaluation of Video Models' Reasoning Abilities through Maze-Solving Tasks

Paper • 2511.15065 • Published Nov 19 • 74

P1: Mastering Physics Olympiads with Reinforcement Learning

Paper • 2511.13612 • Published Nov 17 • 133

New activity in SciYu/HiPhO about 2 months ago

[bot] Conversion to Parquet

#2 opened 4 months ago by

parquet-converter

authored a paper 3 months ago

HiPhO: How Far Are (M)LLMs from Humans in the Latest High School Physics Olympiad Benchmark?

Paper • 2509.07894 • Published Sep 9 • 31

New activity in SciYu/HiPhO 4 months ago

Add metadata (task categories, license, language, tags) to HiPhO dataset card

#1 opened 4 months ago by

nielsr

liked a dataset 4 months ago

SciYu/HiPhO

Viewer • Updated 16 days ago • 138 • 1.16k • 5

upvoted a paper 4 months ago

HiPhO: How Far Are (M)LLMs from Humans in the Latest High School Physics Olympiad Benchmark?

Paper • 2509.07894 • Published Sep 9 • 31

published a dataset 4 months ago

SciYu/HiPhO

Viewer • Updated 16 days ago • 138 • 1.16k • 5

upvoted a paper 4 months ago

DeepResearch Arena: The First Exam of LLMs' Research Abilities via Seminar-Grounded Tasks

Paper • 2509.01396 • Published Sep 1 • 57

Fangchen Yu

AI & ML interests

Recent Activity

Organizations

SciYu's activity

Add Quick Start / Sample Usage section

[bot] Conversion to Parquet

Add metadata (task categories, license, language, tags) to HiPhO dataset card