Spaces:
Sleeping
Sleeping
Commit
·
d46e146
1
Parent(s):
9fe6f9f
remove colors in text
Browse files
app.py
CHANGED
|
@@ -9,7 +9,7 @@ This is a leaderboard displaying model performance metrics for the SLR-BENCH ben
|
|
| 9 |
We report the models’ Logical Reasoning Level (LRL), syntax score,
|
| 10 |
stage-specific logical reasoning accuracy (basic, easy, medium, hard), total completion tokens, and inference cost.
|
| 11 |
Higher LRL and accuracy indicate superior logical reasoning; lower compute, greater efficiency. Performance drops
|
| 12 |
-
as complexity increases, while Reasoning LLMs
|
| 13 |
""")
|
| 14 |
|
| 15 |
# Load the CSV file
|
|
|
|
| 9 |
We report the models’ Logical Reasoning Level (LRL), syntax score,
|
| 10 |
stage-specific logical reasoning accuracy (basic, easy, medium, hard), total completion tokens, and inference cost.
|
| 11 |
Higher LRL and accuracy indicate superior logical reasoning; lower compute, greater efficiency. Performance drops
|
| 12 |
+
as complexity increases, while Reasoning LLMs consistently outperform conventional LLMs.
|
| 13 |
""")
|
| 14 |
|
| 15 |
# Load the CSV file
|