Skip to content

Actions: tatsu-lab/alpaca_eval

test format leaderboard

Actions

Loading...
Loading

Show workflow options

Create status badge

Loading
249 workflow runs
249 workflow runs

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

[ENH] adding drbx and gpt4 turbo
test format leaderboard #150: Pull request #275 synchronize by YannDubs
April 12, 2024 17:49 2m 14s yann/turbo
April 12, 2024 17:49 2m 14s
Add Nanbeige2-8B-Chat to AlpacaEval
test format leaderboard #149: Pull request #274 opened by yuani114
April 12, 2024 08:48 2m 9s main
April 12, 2024 08:48 2m 9s
Add Conifer-7B-DPO to AlpacaEval
test format leaderboard #148: Pull request #267 opened by liulixin29
April 3, 2024 15:57 2m 5s liulixin29:main
April 3, 2024 15:57 2m 5s
"Add Mistral-7B-LoRA-RAHF-DUAL to AlpacaEval"
test format leaderboard #147: Pull request #266 opened by LiuAmber
April 2, 2024 12:56 2m 19s Mistral-7B-LoRA-RAHF-DUAL
April 2, 2024 12:56 2m 19s
Add TempNet-LLaMA2-Chat to AlpacaEval
test format leaderboard #146: Pull request #264 synchronize by xumao-nju
April 2, 2024 09:14 2m 14s xumao-nju:main
April 2, 2024 09:14 2m 14s
Add TempNet-LLaMA2-Chat to AlpacaEval
test format leaderboard #145: Pull request #264 opened by xumao-nju
April 1, 2024 10:44 1m 42s xumao-nju:main
April 1, 2024 10:44 1m 42s
Add Ein-70B-v0.1 to AlpacaEval
test format leaderboard #144: Pull request #262 opened by bin-bi
March 25, 2024 01:08 1m 51s bin-bi:main
March 25, 2024 01:08 1m 51s
Supplement for Aligner
test format leaderboard #143: Pull request #261 synchronize by AlignInc
March 24, 2024 11:58 1m 55s supplement
March 24, 2024 11:58 1m 55s
Supplement for Aligner
test format leaderboard #142: Pull request #261 synchronize by AlignInc
March 22, 2024 19:58 1m 55s supplement
March 22, 2024 19:58 1m 55s
Supplement for Aligner
test format leaderboard #141: Pull request #261 opened by AlignInc
March 22, 2024 19:51 1m 47s supplement
March 22, 2024 19:51 1m 47s
Add Aligner-2B+Qwen1.5-72B-Chat & Aligner-2B+Claude3 Opus to AlpacaEval
test format leaderboard #140: Pull request #259 synchronize by AlignInc
March 22, 2024 05:47 2m 10s main
March 22, 2024 05:47 2m 10s
Yann/length correction
test format leaderboard #138: Pull request #258 synchronize by YannDubs
March 20, 2024 02:23 1m 58s yann/length_correction
March 20, 2024 02:23 1m 58s
Yann/length correction
test format leaderboard #137: Pull request #258 synchronize by YannDubs
March 20, 2024 02:23 1m 58s yann/length_correction
March 20, 2024 02:23 1m 58s
Yann/length correction
test format leaderboard #136: Pull request #258 synchronize by YannDubs
March 20, 2024 02:21 2m 3s yann/length_correction
March 20, 2024 02:21 2m 3s
Yann/length correction
test format leaderboard #135: Pull request #258 synchronize by YannDubs
March 20, 2024 02:18 2m 13s yann/length_correction
March 20, 2024 02:18 2m 13s
Yann/length correction
test format leaderboard #134: Pull request #258 synchronize by YannDubs
March 20, 2024 02:17 2m 5s yann/length_correction
March 20, 2024 02:17 2m 5s
Yann/length correction
test format leaderboard #133: Pull request #258 synchronize by YannDubs
March 20, 2024 02:16 1m 57s yann/length_correction
March 20, 2024 02:16 1m 57s
Add Aligner-2B+Qwen1.5-72B-Chat & Aligner-2B+Claude3 Opus to AlpacaEval
test format leaderboard #132: Pull request #259 opened by AlignInc
March 18, 2024 14:27 2m 33s main
March 18, 2024 14:27 2m 33s
Yann/length correction
test format leaderboard #131: Pull request #258 synchronize by YannDubs
March 18, 2024 01:04 2m 35s yann/length_correction
March 18, 2024 01:04 2m 35s
Yann/length correction
test format leaderboard #130: Pull request #258 synchronize by YannDubs
March 18, 2024 01:02 2m 8s yann/length_correction
March 18, 2024 01:02 2m 8s
Yann/length correction
test format leaderboard #129: Pull request #258 synchronize by YannDubs
March 17, 2024 21:14 2m 20s yann/length_correction
March 17, 2024 21:14 2m 20s
Yann/length correction
test format leaderboard #128: Pull request #258 synchronize by YannDubs
March 17, 2024 03:35 3m 20s yann/length_correction
March 17, 2024 03:35 3m 20s
[ENH] length controlled ALpacaEval
test format leaderboard #127: Pull request #248 synchronize by YannDubs
March 17, 2024 03:23 3m 17s yann/length_correction
March 17, 2024 03:23 3m 17s
Add Mistral-ORPO-Beta to AlpacaEval
test format leaderboard #126: Pull request #257 opened by jiwooya1000
March 16, 2024 19:06 2m 11s jiwooya1000:main
March 16, 2024 19:06 2m 11s
Add Samba-CoE-v0.2-best-of-16 to AlpacaEval
test format leaderboard #125: Pull request #256 opened by kyleliang919
March 15, 2024 13:09 2m 48s main
March 15, 2024 13:09 2m 48s