Skip to content

Commit

Permalink
Automated leaderboard update
Browse files Browse the repository at this point in the history
  • Loading branch information
actions-user committed Oct 23, 2024
1 parent 3c6ae8f commit fccfd4e
Showing 1 changed file with 1 addition and 0 deletions.
Original file line number Diff line number Diff line change
@@ -1,4 +1,5 @@
name,length_controlled_winrate,win_rate,avg_length,link,samples,filter
NullModel (adversarial),86.45780691307947,76.91979180386511,872,https://github.com/sail-sg/Cheating-LLM-Benchmarks/,https://github.com/tatsu-lab/alpaca_eval/blob/main/results/NullModel/model_outputs.json,community
SelfMoA + gemma-2-9b-it-WPO-HB,78.53928111481099,77.58955217385297,3261,https://github.com/wenzhe-li/Self-MoA/,https://github.com/tatsu-lab/alpaca_eval/blob/main/results/SelfMoA_gemma-2-9b-it-WPO-HB/model_outputs.json,community
Shopee SlimMoA v1,77.4515432873834,75.6142865980535,1994,https://github.com/LLM-Alignment-sh/Shopee-SlimMoA,https://github.com/tatsu-lab/alpaca_eval/blob/main/results/Shopee-SlimMoA-v1/model_outputs.json,community
Blendax.AI-gm-l6-vo31,76.91981221023656,69.11033492869565,1809,https://www.blendax.ai/post/blendaxai-gm-l6-vo31,https://github.com/tatsu-lab/alpaca_eval/blob/main/results/blendaxai-gm-l6-vo31/model_outputs.json,community
Expand Down

0 comments on commit fccfd4e

Please sign in to comment.