Submitted by
taesiri
Scale AI
company
Verified
AI & ML interests
None defined yet.
Recent Activity
View all activity
Papers
ResearchRubrics: A Benchmark of Prompts and Rubrics For Evaluating Deep Research Agents
Chasing the Tail: Effective Rubric-based Reward Modeling for Large Language Model Post-Training