Spaces:

lmarena
/

chatbot-arena-leaderboard

Running

LLMArena commited on Nov 18, 2024

Commit

6e35af2

verified ·

1 Parent(s): 73fff04

Update app.py

Files changed (1) hide show

app.py CHANGED Viewed

@@ -50,6 +50,7 @@ def make_arena_leaderboard_md(arena_df, last_updated_time):
     leaderboard_md = f"""
 Total # of models: **{total_models}**.{space} Total # of votes: **{"{:,}".format(total_votes)}**.{space} Last updated: {last_updated_time}.
 ***Rank (UB)**: model rating (upper bound), determined as one plus the number of models that are statistically better than the target model.
 Model A is statistically better than Model B when the lower bound of Model A's rating is higher than the upper bound of Model B's rating (with a 95% confidence interval).
 See Figure 1 below for a visualization of the confidence intervals of model ratings.

     leaderboard_md = f"""
 Total # of models: **{total_models}**.{space} Total # of votes: **{"{:,}".format(total_votes)}**.{space} Last updated: {last_updated_time}.
 ***Rank (UB)**: model rating (upper bound), determined as one plus the number of models that are statistically better than the target model.
 Model A is statistically better than Model B when the lower bound of Model A's rating is higher than the upper bound of Model B's rating (with a 95% confidence interval).
 See Figure 1 below for a visualization of the confidence intervals of model ratings.