Spaces:

ahmad21omar
/

SLR-Bench

Sleeping

ahmad21omar commited on 27 days ago

Commit

d46e146

1 Parent(s): 9fe6f9f

remove colors in text

Files changed (1) hide show

app.py CHANGED Viewed

@@ -9,7 +9,7 @@ This is a leaderboard displaying model performance metrics for the SLR-BENCH ben
 We report the models’ Logical Reasoning Level (LRL), syntax score,
 stage-specific logical reasoning accuracy (basic, easy, medium, hard), total completion tokens, and inference cost.
 Higher LRL and accuracy indicate superior logical reasoning; lower compute, greater efficiency. Performance drops
-as complexity increases, while Reasoning LLMs (orange) consistently outperform conventional LLMs (blue).
 """)
 # Load the CSV file

 We report the models’ Logical Reasoning Level (LRL), syntax score,
 stage-specific logical reasoning accuracy (basic, easy, medium, hard), total completion tokens, and inference cost.
 Higher LRL and accuracy indicate superior logical reasoning; lower compute, greater efficiency. Performance drops
+as complexity increases, while Reasoning LLMs consistently outperform conventional LLMs.
 """)
 # Load the CSV file