Skip to content

Actions: tatsu-lab/alpaca_eval

test format leaderboard

Actions

Loading...
Loading

Show workflow options

Create status badge

Loading
249 workflow runs
249 workflow runs

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

Add Samba-CoE-v0.2 to AlpacaEval
test format leaderboard #124: Pull request #253 synchronize by kyleliang919
March 9, 2024 16:17 2m 11s main
March 9, 2024 16:17 2m 11s
Add Samba-CoE-v0.2 to AlpacaEval
test format leaderboard #123: Pull request #253 opened by kyleliang919
March 9, 2024 05:39 2m 23s main
March 9, 2024 05:39 2m 23s
[ENH] length controlled ALpacaEval
test format leaderboard #122: Pull request #248 synchronize by YannDubs
March 7, 2024 21:21 2m 16s yann/length_correction
March 7, 2024 21:21 2m 16s
[ENH] add mistral large
test format leaderboard #121: Pull request #251 opened by YannDubs
March 7, 2024 05:40 1m 53s yann/mistral_large
March 7, 2024 05:40 1m 53s
[ENH] add contextual
test format leaderboard #120: Pull request #250 synchronize by YannDubs
March 7, 2024 03:29 2m 33s yann/contextual_no_length
March 7, 2024 03:29 2m 33s
[ENH] add contextual
test format leaderboard #119: Pull request #250 opened by YannDubs
March 7, 2024 03:23 1m 50s yann/contextual_no_length
March 7, 2024 03:23 1m 50s
[ENH] add contextual
test format leaderboard #118: Pull request #249 opened by YannDubs
March 7, 2024 03:19 3m 10s yann/contextual
March 7, 2024 03:19 3m 10s
add Contextual-KTO-Mistral-PairRM to AlpacaEval2
test format leaderboard #117: Pull request #246 synchronize by xwinxu
March 6, 2024 20:39 2m 56s xwinxu:main
March 6, 2024 20:39 2m 56s
add Contextual-KTO-Mistral-PairRM to AlpacaEval2
test format leaderboard #116: Pull request #246 synchronize by xwinxu
March 6, 2024 20:36 2m 4s xwinxu:main
March 6, 2024 20:36 2m 4s
[ENH] length controlled ALpacaEval
test format leaderboard #115: Pull request #248 synchronize by YannDubs
March 6, 2024 11:39 2m 26s yann/length_correction
March 6, 2024 11:39 2m 26s
[ENH] length controlled ALpacaEval
test format leaderboard #114: Pull request #248 synchronize by YannDubs
March 6, 2024 11:33 2m 17s yann/length_correction
March 6, 2024 11:33 2m 17s
[ENH] length controlled ALpacaEval
test format leaderboard #113: Pull request #248 synchronize by YannDubs
March 6, 2024 11:22 2m 44s yann/length_correction
March 6, 2024 11:22 2m 44s
[ENH] length controlled ALpacaEval
test format leaderboard #112: Pull request #248 synchronize by YannDubs
March 6, 2024 11:16 1m 58s yann/length_correction
March 6, 2024 11:16 1m 58s
[ENH] length controlled ALpacaEval
test format leaderboard #111: Pull request #248 synchronize by YannDubs
March 6, 2024 10:37 3m 8s yann/length_correction
March 6, 2024 10:37 3m 8s
[ENH] length controlled ALpacaEval
test format leaderboard #110: Pull request #248 opened by YannDubs
March 6, 2024 10:18 1m 33s yann/length_correction
March 6, 2024 10:18 1m 33s
add Contextual-KTO-Mistral-PairRM to AlpacaEval2
test format leaderboard #109: Pull request #246 synchronize by xwinxu
March 6, 2024 01:56 2m 13s xwinxu:main
March 6, 2024 01:56 2m 13s
[ENH] add claude 3
test format leaderboard #108: Pull request #247 opened by YannDubs
March 5, 2024 12:12 2m 16s yann/claude3
March 5, 2024 12:12 2m 16s
add Contextual-KTO-Mistral-PairRM to AlpacaEval2
test format leaderboard #107: Pull request #246 synchronize by xwinxu
March 5, 2024 07:29 2m 8s xwinxu:main
March 5, 2024 07:29 2m 8s
add Contextual-KTO-Mistral-PairRM to AlpacaEval2
test format leaderboard #106: Pull request #246 synchronize by xwinxu
March 5, 2024 06:36 2m 18s xwinxu:main
March 5, 2024 06:36 2m 18s
add Mistral-7B-ReMax-v0.1
test format leaderboard #105: Pull request #245 synchronize by liziniu
March 1, 2024 04:17 2m 5s liziniu:main
March 1, 2024 04:17 2m 5s
add Mistral-7B-ReMax-v0.1
test format leaderboard #104: Pull request #245 opened by liziniu
February 29, 2024 14:37 2m 23s liziniu:main
February 29, 2024 14:37 2m 23s
[NOTEBOOK] adding final length correction notebook.
test format leaderboard #103: Pull request #244 synchronize by YannDubs
February 28, 2024 06:01 1m 53s yann/length_controlled_ae
February 28, 2024 06:01 1m 53s
[NOTEBOOK] adding final length correction notebook.
test format leaderboard #102: Pull request #244 synchronize by YannDubs
February 28, 2024 05:10 2m 54s yann/length_controlled_ae
February 28, 2024 05:10 2m 54s
[NOTEBOOK] adding final length correction notebook.
test format leaderboard #101: Pull request #244 opened by YannDubs
February 28, 2024 05:08 1m 57s yann/length_controlled_ae
February 28, 2024 05:08 1m 57s
[DATA] Add Gemma
test format leaderboard #100: Pull request #242 opened by YannDubs
February 24, 2024 10:32 2m 15s yann/add_gemma
February 24, 2024 10:32 2m 15s