Skip to content

Actions: tatsu-lab/alpaca_eval

test format leaderboard

Actions

Loading...
Loading

Show workflow options

Create status badge

Loading
248 workflow runs
248 workflow runs

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

Add TOA to AlpacaEval
test format leaderboard #251: Pull request #428 synchronize by YannDubs
December 27, 2024 20:20 2m 20s oceanypt:main
December 27, 2024 20:20 2m 20s
Add TOA to AlpacaEval
test format leaderboard #250: Pull request #428 opened by oceanypt
December 26, 2024 13:31 4m 34s oceanypt:main
December 26, 2024 13:31 4m 34s
Add FuseChat-3.0 models to AlpacaEval
test format leaderboard #249: Pull request #426 opened by yangzy39
December 16, 2024 07:01 4m 22s yangzy39:main
December 16, 2024 07:01 4m 22s
Add FuseChat-Llama-3.1-8B-Instruct, FuseChat-Gemma-2-9B-Instruct and …
test format leaderboard #248: Pull request #424 opened by yangzy39
December 15, 2024 06:46 4m 28s yangzy39:main
December 15, 2024 06:46 4m 28s
Add Llama-3-Instruct-8B-RainbowPO to AlpacaEval
test format leaderboard #247: Pull request #416 synchronize by hanyang1999
October 31, 2024 17:25 2m 22s hanyang1999:main
October 31, 2024 17:25 2m 22s
Add Llama-3-Instruct-8B-RainbowPO to AlpacaEval
test format leaderboard #246: Pull request #416 opened by hanyang1999
October 31, 2024 04:36 2m 18s hanyang1999:main
October 31, 2024 04:36 2m 18s
Add NullModel to AlpacaEval
test format leaderboard #245: Pull request #414 synchronize by YannDubs
October 23, 2024 17:00 2m 38s xszheng2020:main
October 23, 2024 17:00 2m 38s
Add NullModel to AlpacaEval
test format leaderboard #244: Pull request #414 opened by xszheng2020
October 15, 2024 20:39 4m 18s xszheng2020:main
October 15, 2024 20:39 4m 18s
Add GPO-Llama-3-8B-Instruct-GPM-2B and SPPO-Llama-3-8B-Instruct-GPM-2…
test format leaderboard #243: Pull request #413 opened by xukp20
October 10, 2024 13:50 4m 53s general-preference:main
October 10, 2024 13:50 4m 53s
Add SelfMoA_gemma-2-9b-it-SimPO, SelfMoA_gemma-2-9b-it-WPO-HB to AlpacaEval
test format leaderboard #242: Pull request #411 synchronize by wenzhe-li
September 25, 2024 19:37 2m 40s wenzhe-li:main
September 25, 2024 19:37 2m 40s
Add SelfMoA_gemma-2-9b-it-SimPO, SelfMoA_gemma-2-9b-it-WPO-HB to AlpacaEval
test format leaderboard #241: Pull request #411 opened by wenzhe-li
September 24, 2024 03:26 2m 39s wenzhe-li:main
September 24, 2024 03:26 2m 39s
Add Llama-3-8B-Instruct-SkillMix to AlpacaEval
test format leaderboard #240: Pull request #405 synchronize by YannDubs
September 15, 2024 17:30 4m 20s parksimon0808:main
September 15, 2024 17:30 4m 20s
Add REBEL-Llama-3-8B-Instruct-Armo to AlpacaEval
test format leaderboard #239: Pull request #403 opened by ZhaolinGao
August 28, 2024 18:50 2m 4s ZhaolinGao:main
August 28, 2024 18:50 2m 4s
Add Shopee-SlimMoA-v1 to AlpacaEval
test format leaderboard #238: Pull request #398 synchronize by YannDubs
August 26, 2024 21:32 2m 3s LLM-Alignment-sh:main
August 26, 2024 21:32 2m 3s
Add blendaxai-gm-l6-vo31 to AlpacaEval
test format leaderboard #237: Pull request #399 opened by ym-blendax-ai
August 23, 2024 12:45 2m 7s Blendax-AI:main
August 23, 2024 12:45 2m 7s
Add Shopee-SlimMoA-v1 to AlpacaEval
test format leaderboard #236: Pull request #398 opened by LLM-Alignment-sh
August 23, 2024 11:38 2m 5s LLM-Alignment-sh:main
August 23, 2024 11:38 2m 5s
Added Llama3-PBM-Nova-70B model
test format leaderboard #235: Pull request #395 synchronize by PKU-Baichuan
August 23, 2024 06:09 2m 4s PKU-Baichuan:main
August 23, 2024 06:09 2m 4s
Add blendaxai-gm-l6-vo14 to AlpacaEval
test format leaderboard #233: Pull request #397 synchronize by ym-blendax-ai
August 22, 2024 20:11 1m 58s main
August 22, 2024 20:11 1m 58s
Add blendaxai-gm-l6-vo14 to AlpacaEval
test format leaderboard #232: Pull request #397 opened by ym-blendax-ai
August 22, 2024 20:05 2m 13s main
August 22, 2024 20:05 2m 13s
Added Llama3-PBM-Nova-70B model
test format leaderboard #231: Pull request #395 synchronize by PKU-Baichuan
August 21, 2024 06:59 2m 11s PKU-Baichuan:main
August 21, 2024 06:59 2m 11s
Added Llama3-PBM-Nova-70B model
test format leaderboard #230: Pull request #395 opened by PKU-Baichuan
August 19, 2024 13:10 2m 11s PKU-Baichuan:main
August 19, 2024 13:10 2m 11s
[ENH] add mistral v0.3, Qwen2 70b, gtp4 mini
test format leaderboard #229: Pull request #393 synchronize by YannDubs
August 17, 2024 22:48 2m 6s yann/models_rubriceval
August 17, 2024 22:48 2m 6s
[ENH] add mistral v0.3, Qwen2 70b, gtp4 mini
test format leaderboard #228: Pull request #393 opened by YannDubs
August 17, 2024 22:39 2m 8s yann/models_rubriceval
August 17, 2024 22:39 2m 8s
Add blendaxai-gm-l3-v35 to AlpacaEval
test format leaderboard #227: Pull request #389 synchronize by ym-blendax-ai
August 14, 2024 17:57 2m 8s main
August 14, 2024 17:57 2m 8s
Add blendaxai-gm-l3-v35 to AlpacaEval
test format leaderboard #226: Pull request #389 opened by ym-blendax-ai
August 14, 2024 15:32 2m 5s main
August 14, 2024 15:32 2m 5s