Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

Patronus AI

Team
company
Verified
https://patronus.ai
patronusai
Activity Feed Request to join this org

AI & ML interests

LLM Evaluation

Recent Activity

DarshanDeshpande  authored a paper 25 days ago
MEMTRACK: Evaluating Long-Term Memory and State Tracking in Multi-Platform Dynamic Agent Environments
vgtomahawk  updated a model 4 months ago
PatronusAI/lynx3_4b_full_finetune_v0.995-fp8-dynamic
vgtomahawk  published a model 4 months ago
PatronusAI/lynx3_4b_full_finetune_v0.995-fp8-dynamic
View all activity

Papers

MEMTRACK: Evaluating Long-Term Memory and State Tracking in Multi-Platform Dynamic Agent Environments

View all Papers

Darshan Deshpande's profile picture Varun Gangal's profile picture Anand Kannappan's profile picture Rebecca Qian's profile picture Bartosz Mielczarek's profile picture Bartosz Mielczarek's profile picture Varun Joshi's profile picture Arek's profile picture Sky Wang's profile picture Maciej Gełdon's profile picture Snigdha Banda's profile picture Shivani Jain's profile picture Hersh Mehta's profile picture Edgar Colque's profile picture Jedrzej's profile picture Chinmayee Kulkarni's profile picture Devanshu Bansal's profile picture

PatronusAI 's Spaces 5

pinned
Running
7

TRAIL Leaderboard

🥇

Trace Reasoning and Agentic Issue Localization Leaderboard

May 15
pinned
Runtime error
105

Enterprise Scenarios Leaderboard

🥇

Jun 12, 2024
Running
3

BLUR Leaderboard

🌍

BLUR leaderboard.

Apr 2
Runtime error
7

GLIDER

🦅

GLIDER: Grading LLM Interactions and Decisions using Explain

Dec 19, 2024
Runtime error
6

LynxDemo

🔥

Evaluate answer fidelity to document

Aug 15, 2024
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs